{"id":387,"date":"2015-07-26T22:11:20","date_gmt":"2015-07-26T22:11:20","guid":{"rendered":"https:\/\/elex.link\/elex2015\/?page_id=387"},"modified":"2015-08-08T17:42:05","modified_gmt":"2015-08-08T17:42:05","slug":"paper-19","status":"publish","type":"page","link":"https:\/\/elex.link\/elex2015\/conference-proceedings\/paper-19\/","title":{"rendered":"paper-19"},"content":{"rendered":"<h3><strong>Using a Maximum Entropy Classifier to link \u201cgood\u201d corpus examples to dictionary senses<\/strong><\/h3>\n<p><strong>Authors:<\/strong> Alexander Geyken, Christian P\u00f6litz, Thomas Bartz<\/p>\n<p style=\"text-align: justify;\"><strong>Abstract:<\/strong><br \/>\nA particular problem of maintaining dictionaries consists of replacing outdated example sentences by corpus examples that are up-to-date. Extraction methods such as the good example finder (GDEX; Kilgarriff, 2008) have been developed to tackle this problem. We extend GDEX to polysemous entries by applying machine learning techniques in order to map the example sentences to the appropriate dictionary senses. The idea is to enrich our knowledge base by computing the set of all collocations and to use a maximum entropy classifier (MEC; Nigam, 1999) to learn the correct mapping between corpus sentence and its correct dictionary sense. Our method is based on hand labeled sense annotations. Results reveal an accuracy of 49.16% (MEC) which is significantly better than the Lesk algorithm (31.17%).<\/p>\n<p><strong>Keywords:<\/strong> WSD; maximum entropy; collocations; legacy dictionaries; example sentences<\/p>\n<p><strong>Reference:<\/strong> In Kosem, I., Jakubi\u010dek, M., Kallas, J.,\u00a0Krek, S. (eds.) <em>Electronic lexicography in the 21st century: linking lexical data in the digital age. Proceedings of the eLex 2015 conference, 11-13 August 2015, Herstmonceux Castle, United Kingdom<\/em>. Ljubljana\/Brighton: Trojina, Institute for Applied Slovene Studies\/Lexical Computing Ltd., pp. 304-314.<\/p>\n<p><strong>URL:<\/strong> <a href=\"https:\/\/elex.link\/elex2015\/proceedings\/eLex_2015_19_Geyken+Politz+Bartz.pdf\">https:\/\/elex.link\/elex2015\/proceedings\/eLex_2015_19_Geyken+Politz+Bartz.pdf<\/a><\/p>\n<p><strong>P<\/strong><strong>ublished: <\/strong>2015<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Using a Maximum Entropy Classifier to link \u201cgood\u201d corpus examples to dictionary senses Authors: Alexander Geyken, Christian P\u00f6litz, Thomas Bartz Abstract: A particular problem of maintaining dictionaries consists of replacing outdated example sentences by corpus examples that are up-to-date. Extraction &hellip; <a class=\"more-link\" href=\"https:\/\/elex.link\/elex2015\/conference-proceedings\/paper-19\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":4,"featured_media":0,"parent":327,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-387","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/elex.link\/elex2015\/wp-json\/wp\/v2\/pages\/387","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/elex.link\/elex2015\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/elex.link\/elex2015\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/elex.link\/elex2015\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/elex.link\/elex2015\/wp-json\/wp\/v2\/comments?post=387"}],"version-history":[{"count":4,"href":"https:\/\/elex.link\/elex2015\/wp-json\/wp\/v2\/pages\/387\/revisions"}],"predecessor-version":[{"id":551,"href":"https:\/\/elex.link\/elex2015\/wp-json\/wp\/v2\/pages\/387\/revisions\/551"}],"up":[{"embeddable":true,"href":"https:\/\/elex.link\/elex2015\/wp-json\/wp\/v2\/pages\/327"}],"wp:attachment":[{"href":"https:\/\/elex.link\/elex2015\/wp-json\/wp\/v2\/media?parent=387"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}