An Efficient Method for Noisy Annotation Data Modeling
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 5
Abstract
Abstract : Probabilistic topic models are used for analyzing and extracting content-related annotations from noisy annotated discrete data like WebPages on WWW and these WebPages are stored using social bookmarking services with the help of social bookmarking services, reason behind this process most of time users can attach annotations freely, some annotations do not describe the semantics of the content, therefore they are noisy, simply they are not content related. The extraction of content-related annotations can be used as a prepossessing step in machine learning. Prepossessing step in machine learning is like text classification and image recognition, and can improve information retrieval performance. The proposed model is a generative model for content and annotations, where annotations are assumed to be originated either from topics that generated the content or from a general distribution unrelated to the content. We demonstrate the effectiveness of the proposed method with the help of synthetic data and real social annotation data for text and images
Authors and Affiliations
Sushama Shinde , Shyam Gupta
The Incidence And Associated Factors of Obstetric Near Miss in A Referral Institution in Manipur
Background: The World Health Organization (WHO) estimated that, in the year 2000, 20 million women suffered acute complications in pregnancy with the occurrence of 529,000maternal deaths. Nevertheless, in a systema...
Effect of Angle Orientation of Flat Mirror Concentrator on Solar Panel System Output
Abstract: In this research two flat glass mirrors is used as concentrator of solar panel system. The mirrors increase's the concentration of sun light ray on the solar module. Anew model of solar panel system is designed...
Association Rule Mining using Apriori Algorithm for Distributed System: a Survey
Abstract : Data mining technologies provided through Cloud computing is an absolutely necessary characteristic for today’s businesses to make proactive, knowledge driven decisions, as it helps them have future tr...
A Statistical Approach to perform Web Based Summarization
Over the past decade more and more users of the Internet rely on the search engines to help them find the information they need. However the information they find depends to a large extent, on the ranking mechanism o...
Study on Live analysis of Windows Physical Memory
Memory forensics and data carving methods are usually used during volatile investigation and is nowadays a big area of interest. Volatile memory dump is used for offline analysis of live data. Live analysis of &...