A PRINCIPAL COMPONENT REGRESSION MODEL, FOR FORECASTING DAILY PEAK AMBIENT GROUND LEVEL OZONE CONCENTRATIONS, IN THE PRESENCE OF MULTICOLLINEARITY AMONGST PRECURSOR AIR POLLUTANTS AND LOCAL METEOROLOGICAL CONDITIONS: A CASE STUDY OF MAUN
Journal Title: International Journal of Applied Mathematics & Statistical Sciences - Year 2018, Vol 7, Issue 1
Abstract
The increasing public awareness of the negative health effects of exposure to peak air pollution levels, particularly to the most sensitive population sub-groups like children and the elderly, has made short-term forecasts of episodes of peak concentrations of air pollutants at a local level, increasingly necessary. The main objective of the present study is to develop a statistical model for predicting a day in advance the daily maximum 1- hour average ambient ground level ozone concentration for Maun town, using principal component regression (PCR) technique. The predictor variables are the precursor air pollutants of ground level ozone, namely, nitrogen oxides, nitrogen dioxide and the previous day’s ground level ozone concentration, on the one hand, and meteorological variables that include wind speed, wind direction, relative humidity, surface temperature, atmospheric pressure and solar radiation. The data consist of maximum 1-hour interval concentrations every day, on the response and each of these predictor variables collected from 1 May 2014 to 30 September 2015. A biased regression method of PCR is applied to try and minimise the problem of multicollinearity, usually associated with multiple regression models. The detection of multicollinearity is performed by using the Pearson partial correlation matrix, and variance inflation factor (VIF). Model assessment tools include the tests for significance of individual regression coefficients in the PCR model, the coefficient of determination and F test to test for the validity of the overall model. It is found that the estimated PCR model is based on principal components that are highly correlated with maxima of the ozone concentration the day before, nitrogen oxides concentrations and surface temperature. Furthermore, wind speed, wind direction, relative humidity and nitrogen dioxide are identified as possible causes of multicollinearity, in the available data.
Authors and Affiliations
W. M. Thupeng, T. Mothupi, B. Mokgweetsi
ANALYSIS OF MULTIPLE LINEAR REGRESSION MODELS USING SYMBOLIC INTERVAL-VALUED VARIABLES
In this paper, a proposed form of dependent regression models are introduced to study symbolic data. The estimation of the proposed linear regression models are based on interval valued data, for which we have lower and...
THERMAL RADIATION AND CONVECTIVE HEATING ON HYDROMAGNETIC BOUNDARY LAYER FLOW OF NANOFLUID PAST A PERMEABLE STRETCHING SURFACE
This research work studies the hydromagnetic boundary layer flow of Nanofluid past a permeable stretching surface with the introduction of both thermal radiation and Newtonian heating. The Nanoparticles considered here a...
INVARIANCE PROPERTIES AND ESTIMATING TASK SOLUTION OF BIOLOGICAL POPULATION IN THE TWO-DIMENSIONAL CASE
Considered methods of solving Kolmogorov - Fisher type task of reacting with diffusion, in the two-dimensional case, obtained invariant properties of solutions and two way estimation of the solution. Given are the result...
PROGRESSIVE ACHIEVEMENT OF HORTICULTURAL CROPS IN INDIA
Fruits, Vegetables, Flowers, Plantation crops and Spices are major Horticultural crops in India. The horticulture production is estimated as 283 million tonnes in year 2015- 16. China ranked first in fruit production wit...
PRE-HARVEST WHEAT YIELD FORECAST THROUGH AGRO-METEOROLOGICAL INDICES FOR NORTHERN REGION OF HARYANA
Parameter estimation in statistical modeling plays a crucial role in the real world phenomena. Several alternative analyses may be required for the purpose. An attempt has been made in this paper to assess the impact of...