Work meeting on AI in historical dictionaries and text corpora

Akademie der Wissenschaften und der Literatur

10. December 2025

11:00 – 16:00

The Academy Project Mittelhochdeutsches Wörterbuch is holding a working meeting on the topic of AI in historical dictionaries and text corpora. The aim of the meeting is to exchange ideas about research and networking among scientists.

Topics and content:

  • Discussion of previous experiences and experiments with AI approaches
  • Exchange on requirements for AI approaches through application to historical dictionaries and text corpora, due to
    • the great variety of forms of individual lemmas due to the lack of existing orthography, regional peculiarities, etc.
    • Underrepresentation of relevant language levels in models that use “the Internet” as training data

Contributions:

  • Grouping of lemmas by similarity
  • Distributional semantics/word embeddings as the basis for today's AI approaches
  • Lemmatisation of lemmas in preparation for data-driven dictionary work
  • Semantic tagging and indexing of historical texts and dictionary articles

     

Organisatoren:

Patrick D. Brookshire (Digital Academy of the AdWL)

Jonas Richter (Lower Saxony Academy of Sciences in Göttingen) 

 

Participating researchers:

Niels Bohnert (Trier Office of the AdWL)

Dr Luise Borek (Technical University of Darmstadt & Union of Academies)

Julia Hintersteiner (Paris Lodron University of Salzburg, Austria)

Dr. Nora Ketschik (University of Stuttgart)

Sarah Oberbichler (Leibniz Institute for European History in Mainz)

Ismail Prada Ziegler (University of Bern, Switzerland)

Ute Recker-Hamm (Trier Office of the AdWL)

Jan Schaffert (Lower Saxony Academy of Sciences in Göttingen)

Dr Tobias Streck (Freiburg University)

Dr Stefan Tomasek (Julius Maximilian University of Würzburg)