EmpiricalAnalysis of Document Similarity Using Statistical Model

Abstract

Information retrieval is great technology behind web search services. This paper presents the statistical method for content based information. Mainly three paradigms of models are used in retrieving information. These are Boolean, probabilistic and vector space model. This paper also presents empirical studies of document similarity and discusses the issue of information retrieval system using statistical model. Vector space model is classical and most used retrieval model. The operation of retrieving information is calculated by using the cosine similarity function of query vector and set of documents vector. Finally, we concludethe results with human score various type documents like sports, politics and short stories.

Authors and Affiliations

Jyoti Phogat, Atul Kumar

Keywords

Related Articles

Overlapped Fingerprint Separation for Fingerprint Authentication

Overlapped fingerprints captured at the crime scene plays significant role as an evidence to capture the criminals. As latent fingerprints are the accidently left skin impressions, so these are found to be with broken ri...

Performance Evaluation Of Support Vector Machines (Svms)And Convolutional Neural Network (Cnn) On Binary Classification Problem

Support vector machines (SVMs) have been around for decades, they have been used for a number of classification tasks. They actually have a very strong theory behind them, which make it relatively easy to choose the best...

Nanocrystalline Au: SnO2 Thin Films Grown by DC Reactive Magnetron Sputtering

In this work, nanocrystalline gold doped tin oxide (Au:SnO2) thin films were prepared on glass substrates by dc reactive magnetron sputtering at different substrate temperatures. The physical properties of as deposited f...

A Review on Microwave Ablation Technique for Hepatocellular Carcinoma

Thermal ablation is becoming most widely used techniques for treatment of benign and malignant tumors of different organs like the liver, lung, kidney and also the bone [1-5]. It is the best alternative for the candidate...

Photonic Band Gap in One-Dimensional Ternary Metal-Dielectric Photonic Crystal

In this present communication, we have theoretically studied the photonic band gap in ternary metal-dielectric photonic crystal which can be significantly enlarged when air is considered as a dielectric constant. All the...

Download PDF file
  • EP ID EP391519
  • DOI 10.9790/9622-0706074650.
  • Views 118
  • Downloads 0

How To Cite

Jyoti Phogat, Atul Kumar (2017). EmpiricalAnalysis of Document Similarity Using Statistical Model. International Journal of engineering Research and Applications, 7(6), 46-50. https://europub.co.uk/articles/-A-391519