Named entities identification

Journal Title: Romanian Journal of Human - Computer Interaction - Year 2014, Vol 7, Issue 4

Abstract

An important topic in natural language processing is represented by named entities recognition inside texts. This article describes a novel approach used for detecting named entities that tries to improve the results obtained with the named entity recognition module from Stanford NLP library. In order to determine and classify named entities, this new model uses the Naive Bayes classifier. Our method is focused on named entities of type person and organization but it can be easily extended to other types of named entities. As training data we are using text that is manually annotated, text annotated with Stanford NLP toolkit and a set of XML files containing rules that describe different patterns. After the training, we are using the naive Bayes classifier in order to classify new entities. As test data we are using a Reuters collection of approximate 25000 articles among which 150 articles were manually annotated and used as training data. In order to evaluate the method we are computing the precision, the recall and the F1 factor.

Authors and Affiliations

Liviu Sebastian Matei ,Ştefan Trăuşan-Matu

Keywords

Related Articles

Interactive Video Interface For Embedded Systems

This paper presents an original solution to display information on embedded systems by generating a composite video signal. For simplicity, this signal is generated without the use of additional circuits. At present such...

Visual tools for Software Development in GIS applications

This paper aims to showcase a set of features which enables users to develop custom processing models using a specific interface for the workflow. The component presented in this paper, part of the ArcGIS software suite,...

Musical Information Retrieval System. Theory and Applications

In this paper are analyzed some specific Music Information Retrieval problems. Also it presents introductory notions and some practical applications in this domain. Many of the notions and some techniques for solving the...

Using Motion-Capture Technology in Real Time Interactive Enviroments

This paper brings into discution the use of technology which is capable of captuging human body movements (motion capture – MoCap) in an interactive virtual enviroment. We adopted two complementary perspectives. The firs...

Visual Communication through Infographics

Interaction techniques and visual representations allow users to view, explore and understand large amounts of information. The research made in Information Visualization area has focused on finding ways to render the ab...

Download PDF file
  • EP ID EP28955
  • DOI -
  • Views 369
  • Downloads 8

How To Cite

Liviu Sebastian Matei, Ştefan Trăuşan-Matu (2014). Named entities identification. Romanian Journal of Human - Computer Interaction, 7(4), -. https://europub.co.uk/articles/-A-28955