Data Quality in Data warehouse: problems and solution

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 1

Abstract

 In recent years, corporate scandals, regulatory changes, and the collapse of major financial institutions have brought much warranted attention to the quality of enterprise data if we can better understand the problems of quality issues, then we can develop a plan of action to address the problem that is both proactive and strategic. Each instance of a quality issue presents challenges in both identifying where problems exist and in quantifying the extent of the problems. Quantifying the issues is important in order to determine where our efforts should be focused. It is reported that more than $2 billion of U.S. federal loan money had been lost because of poor data quality at a single agency. It also reported that manufacturing companies spent over 25% of their sales on wasteful practices. Over the period of time many researchers have contributed to the data quality issues, but no research has collectively gathered all the causes of data quality problems at all the phases of data warehousing along with their possible solution. problems in different phase of data warehouse i.e.; data sources, data integration & data profiling, Data staging and ETL, data warehouse modeling & schema design are discussed in this paper. The purpose of the paper is to identify the reasons for data deficiencies, non-availability or reach ability problems at all the aforementioned stages of data warehousing and to give some classification of these causes as well as solution for improving data quality through Statistical Process Control (SPC),Quality engineering management . etc I have identified possible set of causes of data quality issues from the extensive literature review and with consultation of the data warehouse practitioners working in renowned IT company on India. I hope this will help developers & Implementers of warehouse to examine and analyze these issues before moving ahead for data integration and data warehouse solutions for quality decision oriented and business intelligence oriented applications.

Authors and Affiliations

Rahul Kumar Pandey

Keywords

Related Articles

 More General Sophisticated Method of Implementation of Fiber to the Homes

 Fiber to the Homes (FTTH) is one of the most important fiber optic applications, since FTTH provides huge bandwidth. The single fiber offering multi services such as :( Data, Voice, Video etc.).Comparing FTTH and c...

 Improvement of limited Storage Placement in Wireless Sensor  Network

 In sensor network a large amount of data need to be collected for future information retrieval. The data centric storage has become an important issue in sensor network. Storage nodes are used in this paper to &...

 Performance Analysis for Audio Streaming in Cloud

 Abstract: Audio streaming has been a major application, which has become popular in past few years especially amongst youth both in professional and personal lives. Various organizations to develop and deploythei...

Web-Based Decision Support System for Water Quality Monitoring and Prediction for Outdoor Microalgae Cultivation

Abstract: In outdoor microalgae cultivation, water quality monitoring is essential for identifying any existing problem or any issues that could emerge in the future. It is very helpful in maintaining the water parameter...

 High Speed and Time Efficient 1-D DWT on Xilinx Virtex4 DWT Using 9/7 Filter Based NEDA Technique

 In this paper, we describe an efficient Xilinx Virtix4 discrete wavelet transform (DWT) using 9/7 filter based new efficient distributed arithmetic (NEDA) Technique. We demonstrate that NEDA is a very efficie...

Download PDF file
  • EP ID EP152060
  • DOI -
  • Views 105
  • Downloads 0

How To Cite

Rahul Kumar Pandey (2014).  Data Quality in Data warehouse: problems and solution. IOSR Journals (IOSR Journal of Computer Engineering), 16(1), 18-24. https://europub.co.uk/articles/-A-152060