Handling Multicollinearity; A Comparative Study Of The Prediction Performance Of Some Methods Based On Some Probabiltiy Distributions

Journal Title: Annals. Computer Science Series - Year 2018, Vol 16, Issue 1

Abstract

This study used some probability distribution (Gamma, Beta and Chi-square distributions) to assess the performance of partial least square regression (PLSR), ridge regression (RR) and LASSO regression (LR) methods. Ordinary Least Squares may fail if the variables are almost collinear or related. As such, this methods (PLSR, RR, AND LR) were compared using simulated data that follows gamma, beta and chi-square distributions with number variables (P=4 and 10) and sample sizes (n=60 and 90). The comparison was carried out using Mean Square Log Error (MSLE), Mean Absolute Error (MAE) and R-Square (R2) which shows that the results of RR is better when P=4 and n=60 using gamma distribution, but using chi square distribution PLRS is better methods. Also, when P=4 and n=90, RR shows better results with both gamma and beta distributions but with chi square distribution all methods have equal predictive ability. However, at P=10 and n=60 RR performed better with both gamma and chi square distributions while when data follows beta distribution all distributions have equal predictive ability. RR shows better results at both gamma and chi square distributions when P=10 and n=90 while PLSR performed better with beta distribution.

Authors and Affiliations

ZAKARI Yahaya ZAKARI, S. A. Yau, U. USMAN

Keywords

Related Articles

Iterative Methods for Systems’ Solving - a C# approach

This work wishes to support various mathematical issues concerning the iterative methods with the help of new programming languages. We consider a way to show how problems in math have an answer by using different academ...

Intelligent Car System

In modern life the road safety has becomes the core issue. One single move of a driver can cause horrifying accident. The main goal of intelligent car system is to make communication with other cars on the road. The syst...

Development and Optimization of a Multimedia Product<br />

This article presents a new concept of a multimedia interactive product. It is a multi-user versatile platform that can be used for different purposes. The first implementation of the platform is a multi-player game call...

From "Nolite turbare circulos meos!" to "Don’t delete my folder"

Arhimede îsi desena figurile pe nisipul plajei, pe pământ bătut sau în cenusă pusă pe o pardoseală ori pe propriul să corp, uns în prealabil cu untdelemn; pe corp trasa figurile cu ajutorul unghiei. Când generalul roman...

Alternative Estimator for Multivariate Location and Scatter Matrix in the Presence of Outlier

It is generally known that in estimating location and scatter matrix of multivariate data when outliers are presents, the method of classical is not robust. The Maximum Likelihood Estimator (MLE) is always very sensitive...

Download PDF file
  • EP ID EP521332
  • DOI -
  • Views 49
  • Downloads 0

How To Cite

ZAKARI Yahaya ZAKARI, S. A. Yau, U. USMAN (2018). Handling Multicollinearity; A Comparative Study Of The Prediction Performance Of Some Methods Based On Some Probabiltiy Distributions. Annals. Computer Science Series, 16(1), 15-21. https://europub.co.uk/articles/-A-521332