H B Patil, A S Patil and B V Pawar. Article: Part-of-Speech Tagger for Marathi Language using Limited Training Corpora. IJCA Proceedings on National Conference on Recent Advances in Information Technology NCRAIT(4):33-37, February 2014. Full text available. BibTeX
@article{key:article, author = {H. B. Patil and A. S. Patil and B. V. Pawar}, title = {Article: Part-of-Speech Tagger for Marathi Language using Limited Training Corpora}, journal = {IJCA Proceedings on National Conference on Recent Advances in Information Technology}, year = {2014}, volume = {NCRAIT}, number = {4}, pages = {33-37}, month = {February}, note = {Full text available} }
Part-of-speech tagging in Marathi language is a very complex task as Marathi is highly inflectional in nature & free word order language. In this paper we have demonstrated a rule-based Part-of-Speech tagger for Marathi Language. The hand–constructed rules that are learned from corpus and some manual addition after studying the grammar of Marathi language are added and that are used for developing the tagger. Disambiguation is done by analyzing the linguistic feature of the word, its preceding word, its following word, etc. After testing the system with three data sets we got encouraging results. The accuracy of our system is of an average 78. 82% after testing it on three different data sets.