Résumé


Background

I received my Ph.D. in Computer Science (2018) from Paris-Saclay University, France, where my thesis proposed several neural architectures for event extraction from unstructured text.

Before that, I earned:

During my Ph.D., I spent three years at CEA, List (Laboratory for Integration of Systems and Technology) and five years at LIMSI (now LISN) in Orsay, France.

I then worked as a research engineer at the LAL (Linear Accelerator Laboratory), within the Paris-Saclay Center for Data Science, focusing on NLP applications for French law (case outcome prediction) and fake news detection.

From 2017 to 2020, I joined Teklia as a machine learning scientist, developing technologies for digitised and historical document understanding, including handwriting recognition, article separation, and named entity recognition.

From 2020 to 2022, I worked as a postdoctoral researcher at the University of La Rochelle, within the L3i Laboratory (IT, Image & Interaction).

From 2023 to 2026, I was a scientist at the Digital Humanities Laboratory (DHLAB) at École Polytechnique Fédérale de Lausanne (EPFL), where I worked on the Impresso project, focusing on large-scale multilingual historical newspaper processing, named entity recognition and linking, OCR post-correction, semantic enrichment, and the development of evaluation benchmarks for historical NLP.

My main research interests include:


Education

======

Work experience

======

Publications

====== For a complete list of publications, please visit my Google Scholar profile.

Talks

====== November 7, 2024
🎤 Keynote speaker at the GDR TAL CNRS 2024 annual meeting: “Traitement Automatique des Langues et les Humanités Numériques”, La Rochelle, France.
Talk: The Ongoing Struggle for Alleviating Digitisation Errors in Historical Document Processing: A Necessary Effort?
🔗 Event page


September 16, 2022
🎤 Invited speaker at the NER for OCR’ed Historical Documents Seminar Series, Maison de la Recherche, Paris-Sorbonne, France.
Talk (with Antoine Doucet): Impact of Optical Character Recognition on Named Entity Recognition
🔗 Event site


March 3, 2022 🎤 Invited speaker: Reconnaissance d’entités nommées et extraction d’événements dans les documents historiques at the CERES study day on digital methods for humanities, La Rochelle, France.
🔗 CERES


November 8, 2017
🎤 Presentation: Fake News Detection at the Paris-Saclay Center for Data Science (CDS) Annual Pitching Day.
Talk focused on defining fake news, detection tactics, and evaluation metrics used in a student competition.
🔗 Pitching Day Info


October 15, 2014
🎤 Presentation: Learning word representations for event extraction from text
(Fr: Apprentissage des représentations de mots pour l’extraction d’événements à partir de texte)
At Paris Machine Learning Group #2, Season 2: Learning Causality, Words, the Higgs & more.
This talk was based on my first-year PhD research, later published at EMNLP (Core A).
🔗 Group page
🔗 Meetup event

Teaching

======

Courses and Responsibilities

2023–2025 · EPFL – École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland


2020–2022 · University of La Rochelle, France – L3i Laboratory


2019 · French Institute of Hanoi (Remote)


2017–2019 · University of Rouen Normandy, France


2013 · Paris-Sud University – IUT d’Orsay, France


Scientific Organization and Mentorship

Program Committee Member / Organizational Roles / Reviewer

ARR (ACL Rolling Review) (every year since 2019), CLEF (2020, 2026), SIGIR (2022, 2025, 2026), SIGIR-AP (2025), CIKM (2023–2026), ECIR (2023–2026), CHR (Computational Humanities Research) (2021–2025), ICADL (2022–2024; 2024 Program Chair), ISCRAM (2022), ICPRAI (2022), WWW (The Web Conference) (2026), LREC (2025 Reviewer, 2026 Area Chair), DAS (2022 – Reviewer, Organizing Committee, Chair), ICFHR (2018), ASAR (IEEE) (2018), SwissText (2024), HIP (2023), SoICT (2022), RobustAL (2022).

Reviewer / External Reviewer

ACL (ARR, Main Conference), EMNLP (including Industry Track), NAACL-HLT, COLING (including Demos), EACL, LREC-COLING, LaTeCH-CLfL, SemEval, CoNLL, TALN, CORIA, FinNLP, Hackashop, IEEE Transactions on Knowledge and Data Engineering (TKDE), Journal on Computing and Cultural Heritage (JOCCH), Language Resources and Evaluation (Springer).

Public Outreach


Awards and Competitions


Fun facts