Survey on Data Preprocessing Concept Applicable in Data Mining

Journal Title: UNKNOWN - Year 2015, Vol 4, Issue 2

Abstract

Abstract: Real world data is highly prone to outliers commonly known as data noise. This occurrence usually causes a problem of missing values or maybe data full of inconsistencies thus resulting to a poor quality data. Poor quality data is unreliable and fake since it never upholds data integrity issues. Principally, computer users wish to harvest data that is reliable and of high integrity and that’s where the concept of data preprocessing comes in since quality decisions are directly proportional to quality data. Data preprocessing deals with data preparation and data transformation, and seeks to improve the overall process of data mining and at the same time make the process of knowledge discovery more efficient. This paper therefore focuses on surveying different data preprocessing techniques as used in data mining, exhaustively outlining their major purposes in knowledge discovery process.

Authors and Affiliations

Keywords

Related Articles

Novel Scoring System for Identify Accurate Answers for Factoid Questions

Question and Answer System (QAS) are some of the many challenges for natural language understanding and interfaces. In this paper we have develop a new scoring mathematical model that works on the five types of questions...

Dispersion and Confinement Loss Analysis of Nonlinear Square Lattice Photonic Crystal Fibers Employing Air Holes in the Cladding Region

Dispersion and Confinement Loss Analysis of Nonlinear Square Lattice Photonic Crystal Fibers Employing Air Holes in the Cladding Region

Comparison of Various Short Range Wireless Communication Technologies with NFC

Comparison of Various Short Range Wireless Communication Technologies with NFC

Cloud-Based Mobile Multimedia to Design a Distributed Recommendation Cache

As we know that cloud is upcoming technology in the world. It provide several types of services for the user, one of them is Storage as a Service. Our project is related to the Saas services of the cloud. Multimedia is t...

Comparative Analysis of Cascaded H- Bridge Inverter and RV Inverter Topology

This paper presents a comparative analysis of seven levels Cascaded-H-bridge inverter and RV topology based on SPWM control techniques. Three distinctive major multilevel inverter is flying capacitor multilevel inverter...

Download PDF file
  • EP ID EP356625
  • DOI -
  • Views 91
  • Downloads 0

How To Cite

(2015). Survey on Data Preprocessing Concept Applicable in Data Mining. UNKNOWN, 4(2), -. https://europub.co.uk/articles/-A-356625