Towards Electronic Lexicography for the Kurdish Language

Authors:

Sina Ahmadi, Hossein Hassani, John P. McCrae

Publication Type:

Refereed Conference Meeting Proceeding

Abstract:

This paper describes the development of lexicographic resources for Kurdish and provides a lexical model for this language. Kurdish is considered a less-resourced language, and currently, lacks the machine-readable lexicon resources. The unique potential which Linked Data and the Semantic Web offer to e-lexicography enables interoperability across lexical resources by elevating the traditional linguistic data to machine-processable semantic formats. Therefore, we present our lexicon in Ontolex-Lemon ontology as a standard model for sharing lexical information on the Semantic Web. The research covers Sorani, Kurmanji, and Hawrami dialects of Kurdish. This research suggests that although Kurdish is a less-resourced language, in terms of documented lexicons, it owns a wide range of resources, but because they are machine-readable, they could not contribute to the language processing. The outcome of this project, which is made publicly available, assists scholars in their efforts towards making Kurdish a resource-rich language.

Conference Name:

Sixth Biennial Conference on Electronic Lexicography, eLex 2019

Digital Object Identifer (DOI):

10.5281/zenodo.3518950

Publication Date:

03/10/2019

Conference Location:

Portugal

Research Group:

Linked Data

Institution:

National University of Ireland, Galway (NUIG)

Open access repository:

Yes

Publication document:

Towards Electronic Lexicography for the Kurdish Language