Malay Named Entity Recognition Using Rule Based Approach

Ulfa Nadia, Nazlia Omar

Abstract


Named Entity Recognition (NER) research based on rule is widely investigated and is used in various languages mainly English. However, the English NER rules are different with Malay language due to different morphology. Some of challenging issue in Malay is cross reference between named entities, and entity repetition. This paper proposes to solve the issues in Malay NER. This study starts by providing Malay online news corpus, gazeteer development, rules development and evaluation. This study focus on nine name entities i.e person, organization, position, date, time, currency, measurement and percentage. Overall the experimental result shows 90.23% precision, 92.13% recall and 91.05% f-measure. The outcome from this research is expected to help other researchers in implementing the Malay NER using rule based approach through the addition of new rules to achieve higher accuracy.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.


e-ISSN : 2289-2192

For any inquiry regarding our journal please contact our editorial board by email apjitm@ukm.edu.my