Publications

Publications HAL du projet ANR. ANR-17-CE23-0018

2020

Journal articles

titre
Encoding high-cardinality string categorical variables
auteur
Patricio Cerda, Gaël Varoquaux
article
IEEE Transactions on Knowledge and Data Engineering, Institute of Electrical and Electronics Engineers, In press, ⟨10.1109/TKDE.2020.2992529⟩
Resume_court
Statistical models usually require vector representations of categorical variables, using for instan …..
Accès au texte intégral et bibtex
https://hal.inria.fr/hal-02171256/file/article.pdf BibTex
titre
Linear predictor on linearly-generated data with missing values: non consistency and solutions
auteur
Marine Le Morvan, Nicolas Prost, Julie Josse, Erwan Scornet, Gaël Varoquaux
article
Proceedings of Machine Learning Research, PMLR, In press
Resume_court
We consider building predictors when the data have missing values. We study the seemingly-simple cas …..
Accès au texte intégral et bibtex
https://hal.archives-ouvertes.fr/hal-02464569/file/aistats.pdf BibTex

Preprints, Working Papers, …

titre
Neumann networks: differential programming for supervised learning with missing values
auteur
Marine Le Morvan, Julie Josse, Thomas Moreau, Erwan Scornet, Gaël Varoquaux
article
2020
Resume_court
The presence of missing values makes supervised learning much more challenging. Indeed, previous wor …..
Accès au texte intégral et bibtex
https://hal.archives-ouvertes.fr/hal-02888867/file/main.pdf BibTex
titre
On the consistency of supervised learning with missing values
auteur
Julie Josse, Nicolas Prost, Erwan Scornet, Gaël Varoquaux
article
2020
Resume_court
In many application settings, the data have missing entries which make analysis challenging. An abun …..
Accès au texte intégral et bibtex
https://hal.archives-ouvertes.fr/hal-02024202/file/main.pdf BibTex

2019

Conference papers

titre
Comparing distributions: $l1$ geometry improves kernel two-sample testing
auteur
Meyer Scetbon, Gaël Varoquaux
article
NeurIPS 2019 – 33th Conference on Neural Information Processing Systems, Dec 2019, Vancouver, Canada
Accès au texte intégral et bibtex
https://hal.inria.fr/hal-02292545/file/NIPS_L1_test-HAL-v2%20%281%29.pdf BibTex

2018

Journal articles

titre
Atlases of cognition with large-scale human brain mapping
auteur
Gaël Varoquaux, Yannick Schwartz, Russell Poldrack, Baptiste Gauthier, Danilo Bzdok, Jean-Baptiste Poline, Bertrand Thirion
article
PLoS Computational Biology, Public Library of Science, 2018, 14 (11), pp.e1006565. ⟨10.1371/journal.pcbi.1006565⟩
Resume_court
To map the neural substrate of mental function, cognitive neuroimaging relies on controlled psycholo …..
Accès au texte intégral et bibtex
https://www.hal.inserm.fr/inserm-02146700/file/journal.pcbi.1006565.pdf BibTex
titre
Similarity encoding for learning with dirty categorical variables
auteur
Patricio Cerda, Gaël Varoquaux, Balázs Kégl
article
Machine Learning, Springer Verlag, 2018, ⟨10.1007/s10994-018-5724-2⟩
Resume_court
For statistical learning, categorical variables in a table are usually considered as discrete entiti …..
Accès au texte intégral et bibtex
https://hal.inria.fr/hal-01806175/file/article_hal.pdf BibTex

Comments are closed.