DISCOVERY OF ALIASES NAME FROM THE WEB

Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 8

Abstract

An individual is typically referred by numerous name aliases on the web. Accurate identification of aliases of a given person name is useful in various web related tasks such as information retrieval, sentiment analysis, personal name disambiguation, and relation extraction. We propose a method to extract aliases of a given personal name from the web. Given a personal name, the proposed method first extracts a set of candidate aliases. Second, we rank the extracted candidates according to the likelihood of a candidate being a correct alias of the given name. We propose a novel, automatically extracted lexical pattern-based approach to efficiently extract a large set of candidate aliases from snippets retrieved from a web search engine. We define numerous ranking scores to evaluate candidate aliases using three approaches: lexical pattern frequency, word co-occurrences in an anchor text graph, and page counts on the web. To construct a robust alias detection system, we integrate the different ranking scores into a single ranking function using ranking support vector machines. We evaluate the proposed method on three data sets: an English personal names data set, an English place names data set, and a Japanese personal names data set. The proposed method outperforms numerous baselines and previously proposed name alias extraction methods, achieving a statistically significant mean reciprocal rank (MRR) of 0.67. Experiments carried out using location names and Japanese personal names suggest the possibility of extending the proposed method to extract aliases for different types of named entities, and for different languages. Moreover, the aliases extracted using the proposed method are successfully utilized in an information retrieval task and improve recall by 20 percent in a relation detection task.

Authors and Affiliations

N. Thilagavathy, T. Balakumaran, P. Ragu and R. Ranjith kumar

Keywords

Related Articles

Marketing strategies of Patanjali Ayurved (FMCG) in present market scenario

India is one of the biggest developing business sector with an aggregate populace over one billion. After post-progression the nearness of MNC indicating extraordinary rivalry among organizations for their item. They a...

A STUDY ON CASH FLOW STATEMENT ANALYSIS WITH SPECIAL REFERENCE TO JET AIRWAYS

In the developing world the are many firms which has been opened but there are only few firms which is able to withstand. Few firms has more assets and less cash and vice versa (i.e, the working capital will be in a go...

Role of Mobile computing in developing technologies

Mobile computing has changed the complete panorama of our everyday lifestyles. It is fitting most important due to the upward thrust within the number of transportable computer systems and the wish to have steady commu...

SPORTING EVENTS OPENING HORIZONS FOR INDIA UNDER THE INTERNATIONAL UMBRELLA

Over the past there has been a developing familiarity with the huge effect that facilitating sporting events occasions can have on a country.Thispaperdiscussesthecontextof hosting a sporting event and its impact on tou...

RELATIONSHIP BETWEEN EMOTIONAL INTELLIGENCE AND ETHICAL COMPETENCE: AN EMPIRICAL STUDY

Researchers have stated that the attitudes and behaviors of future organizations leaders depend on the current university students. Students need to have a proper understanding of ethical behavior that will provide the...

Download PDF file
  • EP ID EP18501
  • DOI -
  • Views 332
  • Downloads 14

How To Cite

N. Thilagavathy, T. Balakumaran, P. Ragu and R. Ranjith kumar (2012). DISCOVERY OF ALIASES NAME FROM THE WEB. International Journal of Management, IT and Engineering, 2(8), -. https://europub.co.uk/articles/-A-18501