Kannada Named Entity Recognition and Classification using Support Vector Machine

Journal Title: Transactions on Machine Learning and Artificial Intelligence - Year 2017, Vol 5, Issue 1

Abstract

Named Entity Recognition and Classification (NERC) is a process of identification of proper nouns in the text and classification of those nouns into certain predefined categories like person name, location, organization, date, time etc. Kannada NERC is an essential and challenging work which aims at developing a novel model based on Support Vector Machine. In this paper, tf-idf and POS features are used, which are extracted from a training corpus created manually. Furthermore, the model is trained and tested with different kernels: polynomial, rbf, sigmoid and linear kernels. The details of implementation and performance evaluation are discussed. The experiments are conducted on a training corpus of size 1, 51,440 tokens and test corpus of 7,000, 11,000, 15,000, 20,000, 30,000, 40,000 and 50,000 tokens. It is observed that the model works with an average precision, recall and F1-measure of 87%, 88% and 87.5% respectively for a linear kernel SVM on the test corpus of 7,000 tokens.

Authors and Affiliations

S Amarappa, S V Sathyanarayana

Keywords

Related Articles

SAAS Cloud security : Attacks and Proposed Solutions

Nowadays the Cloud has started to gain ground even in SMEs, in spite of that the Cloud is still unknown for several ... for others few reliable. SaaS represents a promising technology, which grows each year rapidly. Only...

Difficulty-Level Classification for English Writings

The popularity of e-books has grown recently. As the number of e-books continues to increase, the task of categorizing all books manually requires a significant amount of time. If English sentences can be categorized acc...

Image Category Recognition using Bag of Visual Words Representation

Image category recognition is one of the challenging tasks due to difference in image background, illumination, scale, clutter, rotation, etc. Bag-of-Visual-Words (BoVW) model is considered as the standard approach for i...

Learning Style Classification Based on Student's Behavior in Moodle Learning Management System

In learning field, each student has his own learning style that affects his way of get, process, understand and percept information. Determining the learning style of students enhances the performance of learning process...

Data Analysis Application to Investigate Relationships between Metacognition and Learning Styles

The purpose of this study is to examine if learning styles predict effectiveness in learning. Participants of the study consisted of 80 students selected from different classes of the university of science and Technology...

Download PDF file
  • EP ID EP277053
  • DOI 10.14738/tmlai.51.2549
  • Views 59
  • Downloads 0

How To Cite

S Amarappa, S V Sathyanarayana (2017). Kannada Named Entity Recognition and Classification using Support Vector Machine. Transactions on Machine Learning and Artificial Intelligence, 5(1), 43-63. https://europub.co.uk/articles/-A-277053