DISCOVERY OF ALIASES NAME FROM THE WEB

Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 8

Abstract

An individual is typically referred by numerous name aliases on the web. Accurate identification of aliases of a given person name is useful in various web related tasks such as information retrieval, sentiment analysis, personal name disambiguation, and relation extraction. We propose a method to extract aliases of a given personal name from the web. Given a personal name, the proposed method first extracts a set of candidate aliases. Second, we rank the extracted candidates according to the likelihood of a candidate being a correct alias of the given name. We propose a novel, automatically extracted lexical pattern-based approach to efficiently extract a large set of candidate aliases from snippets retrieved from a web search engine. We define numerous ranking scores to evaluate candidate aliases using three approaches: lexical pattern frequency, word co-occurrences in an anchor text graph, and page counts on the web. To construct a robust alias detection system, we integrate the different ranking scores into a single ranking function using ranking support vector machines. We evaluate the proposed method on three data sets: an English personal names data set, an English place names data set, and a Japanese personal names data set. The proposed method outperforms numerous baselines and previously proposed name alias extraction methods, achieving a statistically significant mean reciprocal rank (MRR) of 0.67. Experiments carried out using location names and Japanese personal names suggest the possibility of extending the proposed method to extract aliases for different types of named entities, and for different languages. Moreover, the aliases extracted using the proposed method are successfully utilized in an information retrieval task and improve recall by 20 percent in a relation detection task.

Authors and Affiliations

N. Thilagavathy, T. Balakumaran, P. Ragu and R. Ranjith kumar

Keywords

Related Articles

slugMicro finance – Role of Banking intermediaries in Inclusive Economic Growth

Micro finance is most challenging financial act of modern banks in India. It is a scaleable antipoverty solution to rural credit barriers. The main objective of this study is to conceptualize the operational methodologi...

An Application of Porters Stemming Algorithm for Text Mining in Healthcare

Text mining has diverse applications in variety of fields where manual analysis and generating effective knowledge discovery from information is not possible because of huge availability of information on website. Ther...

slugComparing Search Algorithms of Unstructured P2P Networks

Computing has passed through many stages since the birth of the first computing machines. A centralized solution has one component that is shared by users all the time. All resources are accessible, but there is a sing...

DEVELOPMENTAL COMPETENCE MAPPING OF UTTARAKHAND AS A TOURIST DESTINATION IN INDIA: A CRITIQUE

In the present times the tourism industry across the globe is the sector which has the topmost growth rate. This sector has seen miraculous advancement in the revenues and profits for various economics throughout the w...

Static and Dynamic analysis of rectangular isotropic plate using multiquadric radial basis function

This paper presents a methodology based on the collocation multiquadric radial basis functions to analyze the static and dynamic behavior of isotropic rectangular plates. The inertia and dissipative terms are evaluated...

Download PDF file
  • EP ID EP18501
  • DOI -
  • Views 355
  • Downloads 14

How To Cite

N. Thilagavathy, T. Balakumaran, P. Ragu and R. Ranjith kumar (2012). DISCOVERY OF ALIASES NAME FROM THE WEB. International Journal of Management, IT and Engineering, 2(8), -. https://europub.co.uk/articles/-A-18501