DISCOVERY OF ALIASES NAME FROM THE WEB
Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 8
Abstract
An individual is typically referred by numerous name aliases on the web. Accurate identification of aliases of a given person name is useful in various web related tasks such as information retrieval, sentiment analysis, personal name disambiguation, and relation extraction. We propose a method to extract aliases of a given personal name from the web. Given a personal name, the proposed method first extracts a set of candidate aliases. Second, we rank the extracted candidates according to the likelihood of a candidate being a correct alias of the given name. We propose a novel, automatically extracted lexical pattern-based approach to efficiently extract a large set of candidate aliases from snippets retrieved from a web search engine. We define numerous ranking scores to evaluate candidate aliases using three approaches: lexical pattern frequency, word co-occurrences in an anchor text graph, and page counts on the web. To construct a robust alias detection system, we integrate the different ranking scores into a single ranking function using ranking support vector machines. We evaluate the proposed method on three data sets: an English personal names data set, an English place names data set, and a Japanese personal names data set. The proposed method outperforms numerous baselines and previously proposed name alias extraction methods, achieving a statistically significant mean reciprocal rank (MRR) of 0.67. Experiments carried out using location names and Japanese personal names suggest the possibility of extending the proposed method to extract aliases for different types of named entities, and for different languages. Moreover, the aliases extracted using the proposed method are successfully utilized in an information retrieval task and improve recall by 20 percent in a relation detection task.
Authors and Affiliations
N. Thilagavathy, T. Balakumaran, P. Ragu and R. Ranjith kumar
slugA STUDY ON MBC ALGORITHM WITH GOODNESS FUNCTION
In Data Mining, clustering is one of the efficient techniques used to extract useful information from large quantities of data. A cluster is a collection of data objects relatively similar to one another in some respec...
CONSUMER PERCEPTION TOWARDS HATSUN AGRO PRODUCTS LIMITED, CHENNAI – A CONCEPTUAL STUDY
The study is about the consumer perception towards Hatsun Agro Products (HAP) is a dairy products with special reference to Chennai city. This study brings the information culled from various sources, it includes diffe...
slugAPPLICATION AND IMPLEMENTATION OF CRM IN HOTELS OF DEVELOPING CITIES - A CASE STUDY OF RANCHI
Hotel sells room to the guest. It is the main product that Hotel sells and with the sale of this product, other hotel products like food, beverage, laundry services etc. also get sold. Earlier when the numbers of hotel...
slugSystematic Design of High-Speed and LowPower Digit-Serial Multipliers VLSI Based
Terms of both latency and power Digit-serial implementation styles are best suited for implementation of digital signal processing systems which require moderate sampling rates. Digit-serial architectures obtain using...
Competency Mapping: A conceptual Perspective
Every organization should have well defined roles and responsibilities as well as list of competencies that are required to perform each role efficiently and effectively. Such list of competencies should be used for pe...