Breast Cancer Disease Prediction Using Random Forest Regression and Gradient Boosting Regression
Journal Title: International Journal of Experimental Research and Review - Year 2024, Vol 38, Issue 2
Abstract
The current research endeavors to evaluate the efficacy of regression-based machine learning algorithms through an assessment of their performance using diverse metrics. The focus of our study involves the implementation of the breast cancer Wisconsin (Diagnostic) dataset, employing both the random forest and gradient-boosting regression algorithms. In our comprehensive performance analysis, we utilized key metrics such as Mean Squared Error (MSE), R-squared, Mean Absolute Error (MAE), and Coefficient of Determination (COD), supplemented by additional metrics. The evaluation aimed to gauge the algorithms' accuracy and predictive capabilities. Notably, for continuous target variables, the gradient-boosting regression model emerged as particularly noteworthy in terms of performance when compared to other models. The gradient-boosting regression model exhibited remarkable results, highlighting its superiority in handling the breast cancer dataset. The model achieved an impressively low MSE value of 0.05, indicating minimal prediction errors. Furthermore, the R-squared value of 0.89 highlighted the model's ability to explain the variance in the data, affirming its robust predictive power. The Mean Absolute Error (MAE) of 0.14 reinforced the model's accuracy in predicting continuous outcomes. Beyond these core metrics, the study incorporated additional measures to provide a comprehensive understanding of the algorithms' performance. The findings underscore the potential of gradient-boosting regression in enhancing predictive accuracy for datasets with continuous target variables, particularly evident in the context of breast cancer diagnosis. This research contributes valuable insights to the ongoing exploration of machine learning algorithms, providing a basis for informed decision-making in medical and predictive analytics domains.
Authors and Affiliations
Pradeep Yadav, Chandra Prakash Bhargava, Deepak Gupta, Jyoti Kumari, Archana Acharya, Madhukar Dubey
The importance of financial liquidity analysis in an enterprise
The article attempts to present the importance of financial liquidity analysis and its impact on financial condition of an enterprise. It introduces financial analysis tools and effects of losing of financial liquidity i...
Study on the toxicity of neem (Azadirachta indica A. Juss) leaf extracts as phytopiscicide on three life stages of Mozambique tilapia (Oreochromis mossambicus Peters) with special reference to their ethological responses
Acute toxicity of leaf extracts of neem (Azadirachta indica A. Juss) on three different stages of fresh water weed fish Oreochromis mossambicus was investigated in the present study. The 24, 48, 72 and 96 h LC50 values f...
Documentation and diversity analysis by DNA fingerprinting of the indigenous Mango (Mangifera indica L.) germplasm of West Bengal
Mango (Mangifera indica L.) has been reported to have extensive diversity due to alloploidy, outbreeding, continuous grafting and phenotypic differences arising from varied agro climatic conditions in different mango gro...
Automatic ECG Arrhythmia Recognition using ANN and CNN
Present research highlights the need for more patient-oriented monitoring systems for cardiac health, especially in the aftermath of COVID-19. The study introduces a contactless and affordable ECG device capable of recor...
Ethnicity and Scientific validation of West Bengal Amla (Phyllanthus emblica L.) with special reference to GC-MS screening
The present investigation was carried out to analyze the active constituents present in fruit of Phyllanthus emblica L. ( Phyllanthaceae) consumed by the tribal people of west Bengal for the treatment of various disease...