Named entities identification

Journal Title: Romanian Journal of Human - Computer Interaction - Year 2014, Vol 7, Issue 4

Abstract

An important topic in natural language processing is represented by named entities recognition inside texts. This article describes a novel approach used for detecting named entities that tries to improve the results obtained with the named entity recognition module from Stanford NLP library. In order to determine and classify named entities, this new model uses the Naive Bayes classifier. Our method is focused on named entities of type person and organization but it can be easily extended to other types of named entities. As training data we are using text that is manually annotated, text annotated with Stanford NLP toolkit and a set of XML files containing rules that describe different patterns. After the training, we are using the naive Bayes classifier in order to classify new entities. As test data we are using a Reuters collection of approximate 25000 articles among which 150 articles were manually annotated and used as training data. In order to evaluate the method we are computing the precision, the recall and the F1 factor.

Authors and Affiliations

Liviu Sebastian Matei ,Ştefan Trăuşan-Matu

Keywords

Related Articles

Model-Driven Engineering of User Interfaces: Promises, Successes, Failures, and Challenges

Model-driven engineering (MDE) of user interfaces consists in describing a user interface and aspects involved in it (e.g., task, domain, context of use) in models from which a final interface is produced. With one bi...

Aggregating textual and video data from movies

In this paper, we present an automatically annotated corpus based on movie screenplays (script) and subtitles. We extract the relevant textual information from movie screenplays and subtitles using a regular expression a...

Controlling The Applications Running On A Windows System By Means Of Android Devices

This article presents an application that the authors have developed for the Android platform, which allows a user to remotely control the applications on a computer which has the operating system Microsoft Windows. Ther...

UsiGesture: Test and Evaluation of an Environment for Integrating Gestures in User Interfaces

User interfaces allowing gesture recognition and manipulation are becoming more and more popular these last years. It however remains a hard task for programmers to developer such interfaces : some knowledge of recogniti...

An Intra and Inter-Topic Evaluation and Cleansing Method

Topic modeling is a growing research field and novel ways of interpreting and evaluating results are necessary. We propose a method for evaluating and improving the performance of topic models generating algorithms relyi...

Download PDF file
  • EP ID EP28955
  • DOI -
  • Views 371
  • Downloads 8

How To Cite

Liviu Sebastian Matei, Ştefan Trăuşan-Matu (2014). Named entities identification. Romanian Journal of Human - Computer Interaction, 7(4), -. https://europub.co.uk/articles/-A-28955