Automation and Validation of Annotation for Hindi Anaphora Resolution

Abstract

The process of labelling any language genre by which one can extract useful information is called annotation. This provides syntactic information about a word or a word phrase. In this paper, an effort has been made to provide the algorithm for semiautomatic annotation for Hindi text to cater anaphora resolution only. The study was conducted on twelve files of Ranchi Express available in EMILLE corpus. The corpus is originally tagged for demonstrative pronouns. The detection of the pronouns is supported by the incorporation of seven tags. However the semantic interpretation of the demonstrative pronoun is not supported in the original corpus. In this paper an effort has been made to automate the process of tagging as well as the handling of semantic information through addition tags. It was conducted on 1485 demonstrative pronouns. The average accuracy of precision, recall and F measure is 74, 71 and 72 respectively.

Authors and Affiliations

Pardeep Singh, Kamlesh Dutta

Keywords

Related Articles

A Novel Mapreduce Lift Association Rule Mining Algorithm (MRLAR) for Big Data

Big Data mining is an analytic process used to discover the hidden knowledge and patterns from a massive, complex, and multi-dimensional dataset. Single-processor's memory and CPU resources are very limited, which makes...

A Mobile Device Software to Improve Construction Sites Communications "MoSIC"

Effective communication among project participants in construction sites is a real dilemma for construction projects productivity. To improve the efficiency of participants in construction projects and have a speedy deli...

RSECM: Robust Search Engine using Context-based Mining for Educational Big Data

With an accelerating growth in the educational sector along with the aid of ICT and cloud-based services, there is a consistent rise of educational big data, where storage and processing become the prime matter of challe...

Monitoring Vaccine Cold Chain Model with Coloured Petri Net

To protect and prevent vaccines from excessively high or low temperatures throughout the supply chain, from manufacturing to administration, it is necessary to monitor and evaluate vaccine cold chain performance in real...

Autonomic Computing for Business Applications

Autonomic computing, a new deployment technology introduced by IBM a decade ago, to manage the ever increasing complexity of IT systems, has become a part of many large scale deployments today. A lot of inroads have been...

Download PDF file
  • EP ID EP127919
  • DOI 10.14569/IJACSA.2015.061025
  • Views 119
  • Downloads 0

How To Cite

Pardeep Singh, Kamlesh Dutta (2015). Automation and Validation of Annotation for Hindi Anaphora Resolution. International Journal of Advanced Computer Science & Applications, 6(10), 179-185. https://europub.co.uk/articles/-A-127919