Named entities identification

Journal Title: Romanian Journal of Human - Computer Interaction - Year 2014, Vol 7, Issue 4

Abstract

An important topic in natural language processing is represented by named entities recognition inside texts. This article describes a novel approach used for detecting named entities that tries to improve the results obtained with the named entity recognition module from Stanford NLP library. In order to determine and classify named entities, this new model uses the Naive Bayes classifier. Our method is focused on named entities of type person and organization but it can be easily extended to other types of named entities. As training data we are using text that is manually annotated, text annotated with Stanford NLP toolkit and a set of XML files containing rules that describe different patterns. After the training, we are using the naive Bayes classifier in order to classify new entities. As test data we are using a Reuters collection of approximate 25000 articles among which 150 articles were manually annotated and used as training data. In order to evaluate the method we are computing the precision, the recall and the F1 factor.

Authors and Affiliations

Liviu Sebastian Matei ,Ştefan Trăuşan-Matu

Keywords

Related Articles

An Analysis Of The Quality And Accessibility Of Suicide Information Available To The Romanian-Speaking User

As the potential impact of Internet use on suicidal behaviour is currently under questioned, experts have yet not conclusively ruled on the extent of this problem. At the moment, no one really knows what kind of informat...

Testing A Component For Learning Mathematics With Users

The usability and accessibility are considered two major attributes in designing of necessary software for computer assisted learning through assistive technology (for disabled). Assessed separately, most evaluations are...

Interactive marketing system on Internet

This paper presents IMM-Market, an interactive marketing system on Internet, that can be used by any organization for the future development of business in a modern manner. The system was developed as a web portal and in...

Virtual Reality Model in Geographical Information Systems

The paper presents a software architecture to implement a virtual reality model inside the Geographical Information Systems (GIS). Spatial data provides a schematic view of reality, so it is necessary to use raster data...

Reading Space Secrets - A Serious Game Centered on Reading Strategies

Serious games based on reading strategies are an efficient alternative for improving students’ capabilities of text understanding. In today’s industry and academic environments, there are various games that target readin...

Download PDF file
  • EP ID EP28955
  • DOI -
  • Views 356
  • Downloads 8

How To Cite

Liviu Sebastian Matei, Ştefan Trăuşan-Matu (2014). Named entities identification. Romanian Journal of Human - Computer Interaction, 7(4), -. https://europub.co.uk/articles/-A-28955