MCIP: Mining Crop Image Data On pysparkdataframe Using Feature Selection and Cluster Based Techniques
Journal Title: International Journal of Experimental Research and Review - Year 2023, Vol 34, Issue 5
Abstract
Crop-related problems such as pests and diseases in India lead to yearly losses exceeding $500 billion. Leaf blight is identified as the principal factor responsible for the substantial financial losses amounting to $500 billion. Farmers engaged in the cultivation of forage and grain sorghum experience the greatest degree of hardship. This disease has a significant impact on various crops, including maize, rice, tomato, potato, millet, and onion. The timely detection and evaluation of disease in plants can contribute to mitigating the extent of associated losses. However, the task presents difficulties as a result of variations in crop species, varieteis of crop diseases, and environmental factors. The current methodologies lack generalizability in their ability to classify and predict diseases. All of the techniques employed in this study are applied to a dataset with predetermined input values and corresponding output values. The current methodologies involve preprocessing the images and performing segmentation for extracting the appropriate characteristics. The process of segmentation necessitates the implementation of pre-processing techniques, such as dilation and edge detection. As a consequence, the loss of crucial information occurs, which subsequently leads to inaccurate classification. Furthermore, the methodologies employed thus far have not been designed to evaluate the performance of the algorithm on specialised or specific datasets. Deep learning methodologies are susceptible to the issue of overfitting. This paper proposed an approach for extracting and analysing crop image data using the PySpark (MCIP) data frame. The MCIP framework employs Principal Component Analysis (PCA) as a method for selecting pertinent features. The PCA features that have been gathered are subsequently employed to identify homogeneous subgroups through the utilisation of the K-means algorithm. The utilisation of a categorised predictive output facilitates the identification and detection of diseases present in potato leaves. The utilisation of the Multispectral Crop Imaging Platform (MCIP) extends beyond the examination of potatoes exclusively, as it possesses the capability to identify diseases present in the foliage of various agricultural crops. In order to validate our assertion, we conducted an experiment utilising the MCIP algorithm on a dataset pertaining to rice diseases. In order to assess the robustness of MCIP, we conducted an evaluation of its Accuracy, Silhouette score, speed, and F1 score. The MCIP model demonstrated high performance in terms of both speed and accuracy compared to existed approaches. The level of accuracy is remarkably near 100 percent.
Authors and Affiliations
yashi chaudhary, Heman Pathak
Optimization and Removal of Heavy Metals from Groundwater Using Moringa Extracts and Coconut Shell Carbon Powder
This study focuses on enhancing the efficacy and elimination of heavy metals from groundwater by employing bio-absorbents generated from Moringa extracts and Coconut shell carbon powder. The green synthesis technique was...
A Bibliometric Analysis of Bougainvillea Plant: Research Trends, Geographic Distribution and Future Direction
The main aim of this paper is to conduct an exhaustive bibliometric analysis of Bougainvillea. A total of 624 publications on Bougainvillea were identified from Scopus data ranging from 1937 to 2024. The dataset download...
Assessment of Women's Online Shopping Behavior in India: Model Design and Analysis
The exponential increase in internet usage in India has driven the swift growth of e-commerce, with women playing a crucial role in this expanding digital economy. This research presents a thorough literature analysis an...
Monitoring and assessment of flood risk in lower Damodar basin of Bengal delta, India
The river Damodar is known as ‘sorrow of Bengal’ due to its flood ravages in entire Damodar valley caused much unhappiness and distress in lower Damodar region. The intense rainfall during monsoon and discharge from upla...
Biochemical profile of Cashew nut
Cashew is a kidney-shaped nut that commercially grows on a tropical evergreen tree. In recent times, the commercial importance of cashew nut and apple in terms of human health is gaining great momentum. The kernels of 75...