paper-17

Editing an automatically-generated index with K Index Editing Tool

Authors: Kseniya Egorova

Abstract:
This paper presents the editing process of a new Russian–English index using dedicated software. The initial index was generated automatically from the semi-bilingual Password English learner’s dictionary for speakers of Russian and the editing was carried out with K Index Editing Tool (KIET). Initially, the editor was provided with the raw index produced according to a set of pre-established principles. It contained all the Russian translations from the Password database, converted to potential Russian headwords arranged in alphabetical order and accompanied by the part of speech of the original English equivalents. The revision process then consisted of modifying, removing or adding headwords, confirming or amending their automatically associated part of speech, and matching and re-ordering links to their English equivalents. At the final stage the index was proofread line by line for spelling and grammar mistakes, resulting in a change in index size from 31,666 to 29,039 headwords with 45,929 senses. The paper also demonstrates the main features of KIET and highlights some of the problem areas and major challenges we faced while revising the index.

Keywords: Russian–English index; automatically-generated index; editorial tool

Reference: In Kosem, I., Jakubiček, M., Kallas, J., Krek, S. (eds.) Electronic lexicography in the 21st century: linking lexical data in the digital age. Proceedings of the eLex 2015 conference, 11-13 August 2015, Herstmonceux Castle, United Kingdom. Ljubljana/Brighton: Trojina, Institute for Applied Slovene Studies/Lexical Computing Ltd., pp. 268-280.

URL: https://elex.link/elex2015/proceedings/eLex_2015_17_Egorova.pdf

Published: 2015