Pedro Ortiz Suarez
Pedro Ortiz Suarez
Home
Publications
Talks
Projects
Contact
CV
Light
Dark
Automatic
English
English
Deutsch
Español
Français
7
A Data-driven Approach to Natural Language Processing for Contemporary and Historical French
We determine that the importance of the pre-training dataset size was largely overestimated, as we are able to repeatedly show that language models can be pre-trained with corpora of a modest size.
Pedro Ortiz Suarez
PDF
Cite
Theses
TEL
Cite
×