Proceedings - eLex 2025

eLex 2025 proceedings are published by Lexical Computing CZ, s.r.o., and are available below as the complete set, or as individual papers.

How to cite the proceedings:

Kosem, I., Jakubíček, M., Medveď, M., Zgaga, K., Arhar Holdt, Š., Munda, T. & Salgado, A. (eds.) (2025). Electronic lexicography in the 21st century (eLex 2025): Intelligent lexicography. Proceedings of the eLex 2025 conference. Bled, 18–20 November 2025. Bled: Lexical Computing CZ s.r.o.

DOWNLOAD COMPLETE PROCEEDINGS IMPRESSUM

Download individual papers

Bridging Human and AI Perspectives: Semantic Annotation of Generic Nouns in German

Iván Arias-Arias, Elena Martín-Cancela

p 1–18

Choosing Suitable Text Corpora for Identifying Collocations – A Case Study of a Large Reference Dictionary of Contemporary German

Luise Köhler, Gregor Middell, Alexander Geyken

p 19-29

A Pipeline for Automated Dictionary Creation with Optional Human Intervention

Thomas Widmann

p 30-40

From Word of the Year to Word of the Week: Daily-updated Monitor Corpora for 25 Languages

Ondřej Herman, Miloš Jakubíček, Jan Kraus, Vít Suchomel

p 41-58

The Dictionary of Contemporary Serbian Language (RSSJ): Advanced Automation and Other Challenges

Ranka Stanković, Rada Stijović, Mihailo Škorić, Cvetana Krstev

p 59-75

So Close but Still Far: Case Study on Application of LLMs in Idioms Identification, Definition, and Generation of Illustrative Examples

Aleksandra Marković, Ranka Stanković

p 76-91

AI- and Corpus-Based Strategies for Identifying Phraseme Constructions: A Pilot Study on Croatian Repetitive Constructions

Slobodan Beliga, Ivana Filipović Petrović

p 92-112

Exploring the Power of Generative Artificial Intelligence for Automatic Term Extraction from Small Samples

Lena De Pourcq, Marie Grégoire, Leonardo Zilio

p 113-135

Lexicom at 25: Reflections on the Changing World of Lexicography and Language Technology

Michael Rundell, Miloš Jakubíček, Vojtěch Kovář, Ondřej Matuška, Michal Cukr

p 136-149

Passive Vocabulary Size of Czech Native Speakers: A Statistical Approach

Marek Blahuš, Miloš Jakubíček, Vojtěch Kovář, František Kovařík

p 150-159

Automatic Non-recorded Sense Detection for Swedish through Word Sense Induction with fine-tuned Word-in-Context models

Dominik Schlechtweg, Emma Sköldberg, Shafqat Mumtaz Virk, James White, Simon Hengchen

p 160-174

DMLEX on Wikibase: Legacy Dictionaries as Collaboratively Editable Dataset

Simon Krek, Primož Ponikvar, Andraž Repar, Iztok Kosem, David Lindemann

p 175-189

Handling Abstract Constructions in a Dictionary-Based Constructicon

Bálint Sass, Éva Dömötör, Balázs Indig, Mátyás Lagos Cortes, Veronika Lipp, Márton Makrai, Gergely Pethő

p 190-203

Up to No Good: Exploiting Word Embeddings for an Automatic Extraction of Candidates for a Lexicon of Slovene Taboo Language

Jaka Čibej

p 204-223

Automatically Updated Corpora of EU National Parliaments with Terminology Extraction in Twenty Languages

Marek Blahuš, Ota Mikušek

p 224-237

Automatic Detection of Word Sense Shift from Corpus Data

Ondřej Herman

p 238-252

The Challenges of Syntactic Descriptions of Multiword Expressions in Electronic Lexicography

Verginica Barbu Mititelu, Voula Giouli, Gražina Korvel, Chaya Liebeskind, Irina Lobzhanidze, Rusudan Makhachashvili, Stella Markantonatou, Alexandra Markovic, Ivelina Stoyanova

p 253-273

Automated Transcription of Mixed-Script Dialectal Materials

Markus Kunzmann

p 274-288

ENEOLI Wikibase: A Collaborative Working Platform for the European Network on Lexical Innovation

David Lindemann, Ana Salgado

p 289-300

The Lexicographic Treatment of Homophonic Neologisms in Chinese Dictionaries

Jiang Li, Wang Yi

p 301-321

The Challenge of AI-Generated Neology

Cécile Poix, Natalya Shevchenko

p 322-335

The Role of Subjectivity in Lexicography: Experiments Towards Data-Driven Labeling of Informality

Lydia Risberg, Eleri Aedmaa, Maria Tuulik, Margit Langemets, Ene Vainik, Esta Prangel, Kristina Koppel, Hanna Pook

p 336-356

Mapping Slovene Learner Vocabulary to CEFR Scales with AI-assisted Methods

Mojca Stritar Kučuk

p 357-373

Corpus-Based Methods and AI-Assisted Terminography for Contextonym Analysis

Antonio San Martín

p 374-394

Contrasting a New AI-Powered Dictionary Designed for On-Screen Reading with Electronic Dictionaries That Have Evolved from Print Editions

Ana Frankenberg-Garcia

p 395-417

Implementing Frames in the Phrase-Based Active Dictionary: Why Frames Are Needed but FrameNet Can Only Be a Partial Solution

Laura Rebosio

p 418-438

Toward a Corpus-Based Multilingual Terminology Database for Intercultural Communication

María Iglesias Vázquez, Charlotte Venema, Marie Steffens

p 439-456

LLM-Assisted Dialect Lexicography: Challenges and Opportunities in Processing Historical Bavarian Dialects

Philipp Stöckle, Daniel Elsner, Wolfgang Koppensteiner, Katharina Korecky-Kröll

p 457-479

Corpus-Based Vocabulary Profiling for Ukrainian: From Lexical Analysis to the PULS Digital Learning Platform

Olena Synchak, Vasyl Starko, Mariana Burak, Mykhaylo Svystun

p 480-502

How Effective is AI as a Language Consultant?

Urška Vranjek Ošlak

p 503-516

Lexical-Semantic Resources as a Culture-Aware Basis for Benchmarking and Evaluation of LLMs

Nathalie Norman, Sanni Nimb, Sussi Olsen, Nina Schneidermann, Bolette S. Pedersen

p 517-533

Better Something Than Nothing: Analysis of GPT-4 Performance in Identifying Croatian Proverbs

Nikola Bakarić

p 534-544

Exploring Derivational Families through Intelligent Lexicography

Krešimir Šojat, Kristina Kocijan

p 545-564

A Corpus-Based Dictionary for the Endangered Megrelian Language

Irina Lobzhanidze, Rusudan Gersamia

p 565-586

Comparative Analysis of Medical Adjectives in Croatian General Dictionaries

Martina Pavić, Daša Farkaš

p 587-610

Automating Adjectival Microstructures in Monolingual Dictionaries: A New Method Combining Embeddings and LLMs

Enikő Héja, László Simon, Veronika Lipp

p 611-628

Using Large Language Models to Generate Distractors for Language Games

Iztok Kosem, Špela Arhar Holdt

p 629-644

CJVT Igre: New Word Games Based on the Digital Dictionary Database of Slovene

Špela Arhar Holdt, Iztok Kosem

p 645-660

Neology in Practice: Lexicographic and Terminological Approaches to Lexical Innovation

Jelena Kallas, Kristina Koppel, Kris Heylen, Ilan Kernerman, Ana Ostroški Anić, Federica Vezzani, Špela Arhar Holdt

p 661-681

Exploring the Constructicographic Potential of Lexicographic Data and Language Models: The Case of the Estonian Nominal Quantifier Construction

Heete Sahkai, Geda Paulsen, Ene Vainik, Jelena Kallas, Ahto Kiil, Katrin Tsepelina, Kertu Saul, Arvi Tavast

p 682-701

Compiling Bilingual Dictionaries: AI-Assisted Translation of Italian Multiword Expressions into English and French

Annalisa Greco, Matteo Delsanto, Andrea Di Fabio, Lorenzo Mori, Cristina Onesti, Daniele Paolo Radicioni, Calogero Jerik Scozzaro

p 702-728

Documenting the Final Days of Monolingual English Learners’ Dictionaries Using the Archived Web

Geraint Paul Rees

p 729-748

Compiling a Candidate List of Taboo Constructions for an Under-Resourced Language

Monique Rabé, Martin J. Puttkammer, Gerhard B. van Huyssteen

p 749-765

The Mangalam Dictionary of Buddhist Sanskrit: Automating Lexicographic Data with Generative LLMs

Ligeia Lugli

p 766-782

You Get It Through Lexicography: Extracting Suppressed Language from LLMs Using Lexicographic Scenarios as
Jailbreaking Tools

Esra Abdelzaher, Ágoston Tóth

p 783-803

Identifying the Most Representative Phraseological Units Using Language Corpora and Artificial Intelligence for Lexicography: The Case of Slovenian Comparative Phrasemes

Matej Meterc, Nataša Jakop

p 804-823

An Electronic Ukrainian Dictionary as a Derussification Tool

Vasyl Starko, Andriy Rysin

p 824-838

Modeling and Structuring of a Bilingual French-Chinese Phraseological Dictionary: Neural Automatic Approach for Ontology and Lexicography

Lian Chen

p 839-860

Image-to-Sense Alignment Using AI Tools

Andrej Perdih, Dejan Gabrovšek, Janoš Ježovnik

p 861-874

Inductive Categorization for Conceptual Analysis with LLMs: A Case Study from the Humanitarian Encyclopedia