Kannada Named Entity Recognition and Classification using Support Vector Machine
Journal Title: Transactions on Machine Learning and Artificial Intelligence - Year 2017, Vol 5, Issue 1
Abstract
Named Entity Recognition and Classification (NERC) is a process of identification of proper nouns in the text and classification of those nouns into certain predefined categories like person name, location, organization, date, time etc. Kannada NERC is an essential and challenging work which aims at developing a novel model based on Support Vector Machine. In this paper, tf-idf and POS features are used, which are extracted from a training corpus created manually. Furthermore, the model is trained and tested with different kernels: polynomial, rbf, sigmoid and linear kernels. The details of implementation and performance evaluation are discussed. The experiments are conducted on a training corpus of size 1, 51,440 tokens and test corpus of 7,000, 11,000, 15,000, 20,000, 30,000, 40,000 and 50,000 tokens. It is observed that the model works with an average precision, recall and F1-measure of 87%, 88% and 87.5% respectively for a linear kernel SVM on the test corpus of 7,000 tokens.
Authors and Affiliations
S Amarappa, S V Sathyanarayana
Unied Acoustic Modeling using Deep Conditional Random Fields
Acoustic models based on Deep Neural Networks (DNNs) lead to sig- nicant improvement in the recognition accuracy. In these methods, Hid- den Markov Models (HMMs) state scores are computed using exible dis- criminant DN...
Design of a Smart Model For Geolocalisation and E-commerce in the Semantic Web
Currently, ecommerce has become a pillar of the economy; there are huge growths of websites that offer different products to sell.This variety of web portals and products requires intelligent and autonomous operation to...
Research on Linear Fractional Town Traffic Flow Model Tactic
Traffic flow is a worldwide problem. It has many influencing factors and it is the complex system. Fractional calculus is a powerful tool for dealing with complex systems. Fractional calculus is a direct way of extending...
Integration of the Cloud Environment in E-learning Systems
Nowadays, elearning systems have known a major revolution, especially with the emergence of new information and communication technologies and the enormous growth in the number of learners, educational content and resour...
Improved HMM for Cursive Arabic Handwriting Recognition System Using MLP Classifier
Recognizing unconstrained cursive Arabic handwritten text is a very challenging task the use of hybrid classification to take advantage of the strong modeling of Hidden Markov Models (HMM) and the large capacity of discr...