Towards a Comprehensive Dictionary of Middle Persian

Authors

  • Francisco Mondaca Author
  • Kianoosh Rezania Author
  • Slavomír Čéplö Author
  • Claes Neuefeind Author

Keywords:

corpus-based dictionary, middle persian, api, rest, graphql

Abstract

This paper discusses the process of developing a flexible and comprehensive model for a bilingual corpus dictionary with a dead language, in this case, Zoroastrian Middle Persian, as the source language, and a particular focus on accommodating termini technici and multi-word expressions. Advanced search capabilities are achieved through the integration of state-of-the-art technologies, with plans to further enhance the system by implementing advanced natural language processing techniques. The project offers two distinct API solutions to cater to diverse user needs and ensure efficient access to lexical data. One of these is a dedicated API designed specifically for the web application. The other is a REST API, which simplifies data access and promotes scalability. The project acknowledges the potential for future integration with large language models, underlining the prospect for future enhancements. This approach encourages collaboration and innovation in historical linguistics, highlighting the crucial role of adaptable and cutting-edge technologies in developing a robust lexicon for historical languages. 

Downloads

Published

2023-06-29