Conference Proceedings

Third International Conference on Advances In Computing, Control And Networking - ACCN 2015

A Rule-Based Setswana Verb Lemmatizer

Author(s) : G. A MALEMA , M. LEFOANE , N.P MOTLOGELWA

Abstract

Lemmatization is a pre-processing stage in several natural language processing applications such as data retrieval. There are a few attempts on Setswana word lemmatization. Developed Setswana lemmatizers do not show in details where lemmatization fails to work well leading to reduced performance. This paper presents a detailed rule-based Setswana verb lemmatizer. Challenges in verb lemmatization are pointed out by word category. The overall results show that rule based Setswana verb lemmatization gives a good performance of 87%. However, reflexive verbs have a significant large percentage of exceptions.

Conference Title : Third International Conference on Advances In Computing, Control And Networking - ACCN 2015
Conference Date(s) : 28-29 December, 2015
Place : Hotel Lebua at State Tower, Bangkok, Thailand
No fo Author(s) : 3
DOI : 10.15224/978-1-63248-082-8-08
Page(s) : 38 - 45
Electronic ISBN : 978-1-63248-082-8
Views : 660   |   Download(s) : 124