Maud Ehrmann

EPFL CDH DHI DHLAB
INN 116 (Bâtiment INN)
Station 14
CH-1015 Lausanne

Web site:  Web site:  https://dhlab.epfl.ch

vCard
Administrative data

Fields of expertise

With backgrounds in both natural language processing (NLP) and humanities, my expertise is in the domain of historical document and multilingual NLP, with special focus on historical document processing, information extraction, named entity processing, multilingual and historical resources creation, NLP system evaluation, and large-scale infrastructure.

Over the past few years I worked and coordinated work on these topics in research projects at the intersection of computer science and cultural heritage. In the context of various interdisciplinary settings I could leverage both my backgrounds in NLP and humanities, and often acted as an intermediary between computer scientists, humanity scholars, engineers and representatives of cultural heritage institutions.

Highlights:


impresso. Media Monitoring of the Past. How can newspaper archives help understand the past? How to explore them? This large-scale, impact-driven project aims to enable critical mining of newspaper archives by integrating robust content mining and innovative data visualisation and exploration into a powerful user interface that can support digital scholarship.

The HIPE Evaluation Campaigns. What is the ability of machines to recognise and disambiguate entities (e.g. people, places, organisations) in multilingual historical documents? The series of HIPE shared tasks aims to assess and advance the development of robust, adaptable and transferable approaches to named entity processing in historical documents to foster efficient semantic indexing of digitised cultural heritage collections. See the HIPE-2020 and HIPE-2022 websites, the HIPE-eval GitHub organisation, the HIPE-2022 dataset, and the DHLAB web page.

Publications

Infoscience publications

Selected publications

Teaching & PhD

Teaching

Humanities and Social Sciences Program

Courses

Digital humanities

Digital Humanities is a discipline at the crossroads of the information sciences and the humanities and social sciences. In this course, students discover this new field of research by learning how to extract information from millions of press articles.

Press and digital history II

Combining digital technology and history, this course offers a new approach to the history of the press and of Switzerland in the 20th century. By exploring digitized newspaper archives with the help of digital tools, students will learn to critically analyze massive data from the past.