Pedro Ortiz Suarez

Pedro Ortiz Suarez

Senior Research Scientist

Common Crawl Foundation

About me

I’m a Senior Research Scientist at the Common Crawl Foundation.

I am interested in large corpora for training language models, specially for under resourced languages and historical languages. I am interested in tasks such as Name Entity Recognition (NER), Dependency Parsing and Part-of-Speech tagging, Machine Translation and Document structuration.

I love coffee, cookies and maths. ☕🍪

Interests
  • Language modeling
  • Corpus linguistics
  • Named Entity Recognition
  • Computational Linguistics
  • Machine Translation
Education
  • PhD in Computer Science, 2022

    Sorbonne Université

  • BASc MIASHS, 2018

    Université Paris 8

  • MSc in Mathematics, 2017

    Aix-Marseille Université

  • BSc in Mathematics, 2016

    Universidad Nacional de Colombia

Recent Publications

Contact