Maud Ehrmann

EPFL CDH DHI DHLAB
INN 116 (Bâtiment INN)
Station 14
1015 Lausanne

Web site:  Web site:  https://dhlab.epfl.ch

vCard
Administrative data

Fields of expertise

With a background in both natural language processing (NLP) and the humanities, my expertise is in the area of historical and multilingual NLP, with particular focus on historical document processing, information extraction, named entity processing, multilingual and historical resource creation, NLP system evaluation, and large-scale infrastructure. In recent years, I have worked and coordinated work on these topics in research projects at the intersection of computer science and cultural heritage - an interdisciplinary setting in which I have often acted as an intermediary between computer scientists, humanities scholars, engineers, and representatives of cultural heritage institutions.

Highlights:


impresso. Media Monitoring of the Past. How can newspaper archives help understand the past? How to explore them? This large-scale, impact-driven project aims to enable critical mining of newspaper archives by integrating robust content mining and innovative data visualisation and exploration into a powerful user interface that can support digital scholarship.

The HIPE Evaluation Campaigns. What is the ability of machines to recognise and disambiguate entities (e.g. people, places, organisations) in multilingual historical documents? The series of HIPE shared tasks aims to assess and advance the development of robust, adaptable and transferable approaches to named entity processing in historical documents to foster efficient semantic indexing of digitised cultural heritage collections. See the HIPE-2020 and HIPE-2022 websites, the HIPE-eval GitHub organisation, the HIPE-2022 dataset, and the DHLAB web page.

Publications

Infoscience publications

Selected publications

Teaching & PhD

Teaching

Humanities and Social Sciences Program

Courses

Digital humanities

Digital Humanities is a discipline at the crossroads of the information sciences and the humanities and social sciences. In this course, students discover this new field of research by learning how to extract information from millions of press articles.

Press and digital history I

Combining digital technology and history, this course offers a new approach to the history of media and journalism. By exploring digitised newspaper archives using digital tools, students will learn to critically analyse vast amounts of data from the past.

Press and digital history II

Combining digital technology and history, this course offers a new approach to the history of media and journalism. By exploring digitised newspaper archives using digital tools, students will learn to critically analyse vast amounts of data from the past.