eLex 2025 proceedings are published by Lexical Computing CZ, s.r.o., and are available below as the complete set, or as individual papers.

How to cite the proceedings:

Kosem, I., Jakubíček, M., Medveď, M. Zgaga, K. Arhar Holdt, Š., Munda, T. & Salgado, A. (eds.) (2025). Electronic lexicography in the 21st century (eLex 2023): Intelligent lexicography. Proceedings of the eLex 2025 conference. Bled, 18–20 November 2025. Bled: Lexical Computing CZ s.r.o.

Download individual papers

Bridging human and AI perspectives: semantic annotation of generic nouns in German

Iván Arias-Arias, Elena Martín-Cancela

Choosing Suitable Text Corpora for Identifying Collocations – A Case Study of a Large Reference Dictionary of Contemporary German

Luise Köhler, Gregor Middell, Alexander Geyken

A Pipeline for Automated Dictionary Creation with Optional Human Intervention

Thomas Widmann

From Word of the Year to Word of the Week: Daily-updated Monitor Corpora for 25 Languages

Ondřej Herman, Miloš Jakubíček, Jan Kraus, Vít Suchomel

The Dictionary of Contemporary Serbian Language (RSSJ): Advanced Automation and Other Challenges

Ranka Stanković, Rada Stijović, Mihailo Škorić, Cvetana Krstev

So Close but Still Far: Case Study on Application of LLMs in Idioms Identification, Definition, and Generation of Illustrative Examples

Aleksandra Marković, Ranka Stanković

AI- and Corpus-Based Strategies for Identifying Phraseme Constructions: A Pilot Study on Croatian Repetitive Constructions

Slobodan Beliga, Ivana Filipović Petrović

Exploring the power of generative artificial intelligence for automatic term extraction from small samples

Lena De Pourcq, Marie Grégoire, Leonardo Zilio

Lexicom at 25: reflections on the changing world of lexicography and language technology

Michael Rundell, Miloš Jakubíček, Vojtěch Kovář, Ondřej Matuška, Michal Cukr

Vocabulary Size of Czech Native Speakers: A Statistical Approach

Marek Blahuš, Miloš Jakubíček, Vojtěch Kovář, František Kovařík

Automatic Non-recorded Sense Detection for Swedish through Word Sense Induction with fine-tuned Word-in-Context models

Dominik Schlechtweg, Emma Sköldberg, Shafqat Mumtaz Virk, James White, Simon Hengchen

DMLEX on Wikibase: Legacy dictionaries as collaboratively editable dataset

Simon Krek, Primož Ponikvar, Andraž Repar, Iztok Kosem, David Lindemann

Handling abstract constructions in a dictionary-based constructicon

Bálint Sass, Éva Dömötör, Balázs Indig, Mátyás Lagos Cortes, Veronika Lipp, Márton Makrai, Gergely Pethő

Up to No Good: Exploiting Word Embeddings for an Automatic Extraction of Candidates for a Lexicon of Slovene Taboo Language

Jaka Čibej

Automatically Updated Corpora of EU National Parliaments with Terminology Extraction in Twenty Languages

Marek Blahuš, Ota Mikušek

Automatic Detection of Word Sense Shift from Corpus Data

Ondřej Herman

The Challenges of Syntactic Descriptions of Multiword Expressions in Electronic Lexicography

Verginica Barbu Mititelu, Voula Giouli, Gražina Korvel, Chaya Liebeskind, Irina Lobzhanidze, Rusudan Makhachashvili, Stella Markantonatou, Alexandra Markovic, Ivelina Stoyanova

Automated Transcription of Mixed-Script Dialectal Materials

Markus Kunzmann

ENEOLI Wikibase: A collaborative working platform for the European Network on Lexical Innovation

David Lindemann, Ana Salgado

The Lexicographic Treatment of Homophonic Neologisms in Chinese Dictionaries

Jiang Li, Wang Yi

The challenge of AI-generated neology

Cécile Poix, Natalya Shevchenko

The role of subjectivity in lexicography: Experiments towards data-driven labeling of informality

Lydia Risberg, Eleri Aedmaa, Maria Tuulik, Margit Langemets, Ene Vainik, Esta Prangel, Kristina Koppel, Hanna Pook

Mapping Slovene Learner Vocabulary to CEFR Scales with AI-assisted Methods

Mojca Stritar Kučuk

Corpus-Based Methods and AI-Assisted Terminography for Contextonym Analysis

Antonio San Martín

Contrasting a new AI-powered dictionary designed for on-screen reading with electronic dictionaries that have evolved from print editions

Ana Frankenberg-Garcia

Implementing Frames in the Phrase-based Active Dictionary: why Frames are needed but FrameNet can
only be a partial solution

Laura Rebosio

Toward a corpus-based multilingual terminology database for Intercultural Communication

María Iglesias Vázquez, Charlotte Venema, Marie Steffens

LLM-Assisted Dialect Lexicography: Challenges and Opportunities in Processing Historical Bavarian Dialects

Philipp Stöckle, Daniel Elsner, Wolfgang Koppensteiner, Katharina Korecky-Kröll

Corpus-Based Vocabulary Profiling for Ukrainian: From Lexical Analysis to the PULS Digital Learning Platform

Olena Synchak, Vasyl Starko, Mariana Burak, Mykhaylo Svystun

How Effective is AI as a Language Consultant?

Urška Vranjek Ošlak

Lexical-Semantic Resources as a Culture-Aware Basis for Benchmarking and Evaluation of LLMs

Nathalie Norman, Sanni Nimb, Sussi Olsen, Nina Schneidermann, Bolette S. Pedersen

Better something than nothing: Analysis of GPT-4 performance in identifying Croatian proverbs

Nikola Bakarić

Exploring Derivational Families through Intelligent Lexicography

Krešimir Šojat, Kristina Kocijan

A Corpus-Based Dictionary for the Endangered Megrelian Language

Irina Lobzhanidze, Rusudan Gersamia

Comparative Analysis of Medical Adjectives in Croatian General Dictionaries

Martina Pavić, Daša Farkaš

Automating Adjectival Microstructures in Monolingual Dictionaries: A New Method Combining Embeddings and LLMs

Enikő Héja, László Simon, Veronika Lipp

Using Large Language Models to Generate Distractors for Language Games

Iztok Kosem, Špela Arhar Holdt

CJVT Igre: New Word Games Based on the Digital Dictionary Database of Slovene

Špela Arhar Holdt, Iztok Kosem

Neology in Practice:Lexicographic and Terminological Approaches to Lexical Innovation

Jelena Kallas, Kristina Koppel, Kris Heylen, Ilan Kernerman, Ana Ostroški Anić, Federica Vezzani, Špela Arhar Holdt

Exploring the constructicographic potential of lexicographic data and language models: The case of the Estonian Nominal Quantifier Construction

Heete Sahkai, Geda Paulsen, Ene Vainik, Jelena Kallas, Ahto Kiil, Katrin Tsepelina, Kertu Saul, and Arvi Tavast

Compiling bilingual dictionaries: AI-Assisted translation of Italian Multiword Expressions into English and French

Annalisa Greco, Matteo Delsanto, Andrea Di Fabio, Lorenzo Mori2 Cristina Onesti, Daniele Paolo Radicioni, Calogero Jerik Scozzaro

Documenting the Final Days of Monolingual English Learners’ Dictionaries Using the Archived Web

Geraint Paul Rees

Compiling a candidate list of taboo constructions for an under-resourced language

Monique Rabé, Martin J Puttkammer, Gerhard B van Huyssteen

The Mangalam Dictionary of Buddhist Sanskrit: automating lexicographic data with generative LLMs

Ligeia Lugli

You get it through lexicography: extracting suppressed language from LLMs using lexicographic scenarios as
jailbreaking tools

Esra Abdelzaher, Ágoston Tóth

Identifying the Most Representative Phraseological Units Using Language Corpora and Artificial Intelligence for Lexicography: The Case of Slovenian Comparative Phrasemes

Matej Meterc, Nataša Jakop

An Electronic Ukrainian Dictionary as a Derussification Tool

Vasyl Starko, Andriy Rysin

Modeling and structuring of a bilingual French-Chinese phraseological dictionary: neural automatic approach for ontology and lexicography

Lian Chen

Image-to-Sense Alignment Using AI Tools

Andrej Perdih, Dejan Gabrovšek, Janoš Ježovnik

Inductive Categorization for Conceptual Analysis with LLMs: A Case Study from the Humanitarian Encyclopedia

Loryn Isaacs, Santiago Chambó, Pilar León-Araúz

Navigating linguistic diversity: modelling diatopic and bibliographic information with TEI Lex-0

Veronika Engler, Karlheinz Mörth, Stephan Procházka, Michaela Rausch-Supola, Daniel Schopper