Application of imputation methods for missing values of PM10 and O3 data: Interpolation, moving average and K-nearest neighbor methods

Journal Title: Environmental Health Engineering and Management Journal - Year 2021, Vol 8, Issue 3

Abstract

Background: PIn air quality studies, it is very often to have missing data due to reasons such as machine failure or human error. The approach used in dealing with such missing data can affect the results of the analysis. The main aim of this study was to review the types of missing mechanism, imputation methods, application of some of them in imputation of missing of PM10 and O3 in Tabriz, and compare their efficiency. Methods: Methods of mean, EM algorithm, regression, classification and regression tree, predictive mean matching (PMM), interpolation, moving average, and K-nearest neighbor (KNN) were used. PMM was investigated by considering the spatial and temporal dependencies in the model. Missing data were randomly simulated with 10, 20, and 30% missing values. The efficiency of methods was compared using coefficient of determination (R2), mean absolute error (MAE) and root mean square error (RMSE). Results: Based on the results for all indicators, interpolation, moving average, and KNN had the best performance, respectively. PMM did not perform well with and without spatio-temporal information. Conclusion: Given that the nature of pollution data always depends on next and previous information, methods that their computational nature is based on before and after information indicated better performance than others, so in the case of pollutant data, it is recommended to use these methods.

Authors and Affiliations

Parisa Saeipourdizaj, Parvin Sarbakhsh, Akbar Gholampour

Keywords

Related Articles

Evaluation and spatial noise mapping using geographical information system (GIS): A case study in Zaria city, Kaduna State, Nigeria

Background: Spatial noise level mapping using a geographical information system (GIS) is essential for the visual colour representation of noise analysis, which is a necessity for strategic planning and mitigating meas...

Feasibility of natural wastewater treatment systems and life cycle assessment (LCA) for aquatic systems

Background: Natural wastewater treatment systems (NWTSs) in small villages are a major challenge for European water authorities. With growing social demands for environmental practices, evaluating the feasibility and e...

Face mask use among pedestrians during the COVID-19 pandemic in Northeast Iran: A survey on 223,848 pedestrians

Background: Despite the mass vaccination of people in countries, preventive health guidelines of coronavirus disease 2019 (COVID-19) are still one of the most critical factors for pandemic control. The objectives of th...

Determination of heavy metals including Hg, Pb, Cd, and Cr in edible fishes Liza abu, Brachirus orientalis and attributed cancer and non-cancer risk assessment

Background: Heavy metals are considered as pollutants polluting aquatic ecosystems because of their toxic effects and bioaccumulation in organisms. They can cause chronic poisoning when ingested by human. The present stu...

Biomedical waste disposal systems of health facilities in Ethiopia

Background: Biomedical waste generated from health and health-related activities can be grouped as general waste and hazardous waste. This remains true if and only if there is proper on-site handling, such as the segrega...

Download PDF file
  • EP ID EP696923
  • DOI 10.34172/EHEM.2021.25
  • Views 93
  • Downloads 0

How To Cite

Parisa Saeipourdizaj, Parvin Sarbakhsh, Akbar Gholampour (2021). Application of imputation methods for missing values of PM10 and O3 data: Interpolation, moving average and K-nearest neighbor methods. Environmental Health Engineering and Management Journal, 8(3), -. https://europub.co.uk/articles/-A-696923