Smart irrigation system and early plant disease detection using IoT and novel non-linear growing self-organizing map based artificial neural network
Deepthi Gorijavolu, Kapil Sharma, N. Srinivasa Rao

TL;DR
This paper introduces an IoT-based smart irrigation system and a novel neural network for early detection of sugarcane leaf diseases, improving crop yield and reducing false positives.
Contribution
A novel non-linear growing self-organizing map-based artificial neural network for plant disease detection and irrigation optimization.
Findings
The system achieves 95.6% accuracy in sugarcane disease detection.
False positives are reduced by 18.3% compared to traditional methods.
Early diagnosis combined with irrigation improves output projections by 22.4%.
Abstract
The safety of the global food supply depends heavily on effective crop management, making early diagnosis of plant diseases vital for improving agricultural productivity. This proposal outlines the development of an intelligent irrigation system that utilizes machine learning and the Internet of Things (IoT) for the early detection of sugarcane leaf diseases and assessment of their impact on crop yield. The system gathers and analyzes data on soil temperature, humidity, and leaf characteristics—specifically changes in texture and color—using high-resolution photography from unmanned aerial vehicles (UAVs) and IoT-connected sensors. To enhance feature extraction and classification, the system employs a non-linear growing self-organizing map (NG-SOM) embedded within the hidden layers of an artificial neural network (ANN). This advanced model effectively identifies complex patterns in the…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI · Remote Sensing in Agriculture · Plant Surface Properties and Treatments
Introduction
The faster advancements in agricultural technology allow one to incorporate novel ideas aiming at improved crop management and disease prevention^1,2^. Given the increasing difficulties brought about by soil degradation, climate change, and rising food demand, effective plant disease detection systems have become absolutely critical^3^. Recent studies have made clear the opportunities for real-time monitoring of plant condition using Internet of Things (IoT) devices and machine learning algorithms. Along with early disease diagnosis, this enables most efficient resource management. Machine learning models, especially those using non-linear techniques, have shown interesting results in the detection and classification of plant diseases based on sensor data including temperature, humidity, soil moisture, and images taken by Unmanned Aerial Vehicles (UAVs)^3^.
Despite these advances, several challenges remain in the effective implementation of such systems. Extraction of relevant features from a large spectrum of data sources, including images of plant leaves and environmental variables, qualities required for disease prediction, is one of the most challenging tasks. Moreover, creating significant computational and processing challenges is the combination of high-dimensional data from UAV images and soil sensors with real-time IoT^4–7^ data. Many diseases have non-linear nature that makes the diagnosis challenging since several disease patterns are not linearly separable. Particularly with regard to the provision of accurate forecasts that enables the implementation of timely intervention actions, where researchers find a great difficulty in managing these complexities^8^.
This project intends to address the requirement for an integrated system capable of identifying sugarcane illnesses and estimating crop production by merging IoT sensors with high-resolution photos obtained by UAVs. Current solutions^9^ depend on discrete data sources, resulting to less accurate forecasts. Additionally, the absence of appropriate feature extraction and disease classification techniques further restricts the capacity to diagnose sugarcane diseases in their early stages. This, in turn, decreases the system’s ability to optimize watering techniques and minimize production loss. Therefore, accurate forecasts and the promotion of sustainable agricultural practices need an innovative system that incorporates different data sources^10–13^.
The major purpose is to build a smart irrigation system for early sugarcane disease detection utilizing a Machine Learning (ML) algorithm and IoT technology. The major purpose is to establish a framework that allows the collecting and analysis of data on temperature, humidity, soil moisture, and high-resolution pictures taken by UAVs. This framework will promote the seamless integration of environmental parameters, allowing for more exact prediction of sugarcane disease start. The second objective is the development and application of non-linear ML models for sugarcane disease classification and feature extraction, specifically through the implementation of advanced techniques such as a non-linear growing self-organizing map (NG-SOM) within Artificial Neural Networks (ANN).
The originality of our study comes in merging cutting-edge non-linear analytic methodologies, data from IoT sensors, and high-resolution photos from UAVs for sugarcane disease diagnosis and yield prediction. Unlike existing approaches, which often depend on a single data stream or simpler linear models, the suggested methodology is predicted to boost prediction accuracy substantially. Furthermore, this work contributes the following:
- The design of a robust IoT-enabled framework for sugarcane disease detection.
- The development of advanced ML models for feature extraction and classification.
- The validation of the proposed system against existing methods, demonstrating its superiority in terms of accuracy, precision, and other relevant performance metrics.
- The predicted output of this system promotes sustainable agricultural practices, increases sugarcane crop management, and offers farmers with real-time, actionable insights—ultimately enhancing farming efficiency.
After the introduction part, Sect. 2 discusses the related work, Sect. 3 highlights the proposed method in details, Sect. 4 shows the performance evaluation, and Sect. 5 concludes the presented work following by the references.
Related works
Following are the studies that shows an impact of IoT/ML in agriculture, which enables a smarter, efficient, and sustainable crop monitoring, disease prediction, irrigation, and resource management practices.
IoT-enabled systems for smart agriculture
IoT-based systems are extensively utilized in agriculture to enhance productivity and sustainability. A trained Convolutional Neural Network (CNN) model in^14^ facilitates the analysis of crop images captured by a solar sensor node integrated with a soil moisture sensor, microcontroller, and smartphone application, achieving an accuracy of 99.24% in plant disease prediction. Similarly^15^, demonstrates automation in greenhouse management through IoT integration for monitoring environmental conditions, irrigation management, and disease detection on leaf datasets, validating a smart greenhouse system design. The CROPCARE system in^16^ integrates mobile vision, IoT, and Google Cloud services, employing SRCNN and MobileNet-V2 for detecting crop diseases, along with additional functionalities like soil and weather condition analysis. Its support for multilingual dictionaries aids accessibility for diverse farmers, with performance validated using the PlantVillage dataset. In^18^, a LoRa-based ML irrigation system reduces water usage by 46%, using soil moisture sensors to schedule precise irrigation for crops like tomatoes and eggplants. Results showed better plant health with minimal water consumption, indicating the system’s efficiency.
Machine learning techniques in agriculture
Machine learning (ML) models are pivotal in modern agriculture, as evidenced by^17^, which employs ML models such as KNN, Logistic Regression, Neural Networks, SVM, and Naïve Bayes for data collected via IoT sensors. KNN outperformed other models with a 98.3% recognition rate, facilitating real-time supervision of environmental factors through a web application. Containerized microservices in^19^ enable lightweight CNN models like MobileNet for real-time disease and irrigation prediction on edge devices, achieving 99.87% accuracy on potato crop datasets. This eliminates the need for cloud-based processing, reducing network congestion and improving efficiency. In^20^, hybrid deep learning models like InceptionV3 with LSTM and VGG16 with dense layers are employed for anomaly detection and classification tasks on datasets like soil type and dry beans. MobileNetV2 attained an accuracy of 97% in soil type classification, highlighting its robustness. IoT-enabled systems in^21^ leverage K-means and SVM hybrid classifiers for clustering and classifying soil and humidity data, achieving an accuracy rate of 98.5%, optimizing water usage, and enhancing irrigation management.
Advanced frameworks for sustainable agriculture
Innovative systems further refine agriculture with advanced frameworks. A hybrid ML approach combining SVM and Decision Tree classifiers with Artificial Algae Optimization in^23^ achieves 99.5% accuracy in predicting soil wetness for smart irrigation, outperforming Random Forest, Fuzzy Logic, and KNN. The system in^22^ integrates multiple sensors to monitor environmental factors like soil moisture, wind speed, and temperature, with ML models like KNN and SVM achieving accuracy rates of up to 99.5% for automated irrigation decisions. Finally^24^, proposes a two-module system that utilizes IoT-based real-time data and ensemble ML algorithms to suggest suitable crops for maximizing yield, ensuring low-cost, efficient solutions for farmers. Results confirm its efficacy in predicting optimal crop types, contributing to sustainable farming practices. Table 1 shows the summarize the advanced frameworks for sustainable agriculture.
Table 1. Summary of existing state-of-the-art methods.Refs.MethodAlgorithmMethodologyOutcomes ^14^ Crop disease detectionCNNReal-time image analysis via solar sensor nodes with developed soil moisture sensors and smartphone app integration.Achieved 99.24% accuracy in plant disease prediction, demonstrating robust field performance. ^15^ IoT-enabled greenhouse automationIoT and ML algorithmsSmart greenhouse for monitoring environment, irrigation, image collection, and disease prediction.Effective greenhouse monitoring and leaf disease detection validated with collected datasets. ^16^ CROPCARE—Crop disease detection systemSRCNN, MobileNet-V2IoT-integrated mobile application with cloud-based disease detection and multilingual support.Achieved high performance using the PlantVillage dataset. ^17^ Smart irrigation systemKNN, LR, NN, SVM, Naïve BayesNode-RED and MongoDB-based IoT sensors monitor humidity, temperature, and rain to predict irrigation needs.KNN performed best with 98.3% accuracy and RMSE of 0.12. ^18^ IoT-based precision irrigationLoRa-enabled MLSoil moisture sensors collected data to minimize water usage in tomato and eggplant irrigation.Achieved 46% water usage reduction compared to traditional methods. ^19^ Containerized microservices for MLAlexNet, MobileNet, VGG16, SVM, LRDeployed lightweight CNN models on edge nodes for real-time disease and irrigation prediction in potato crops.Achieved 99.87% CNN accuracy and reduced network congestion. ^20^ Hybrid IoT-ML model for anomaly detectionMLP, SVM, NB, MobileNetV2, InceptionV3, VGG16Combined neural networks and Random Forest for anomaly detection and soil type classification using multiple datasets.Achieved 97% accuracy in soil type classification with MobileNetV2. ^21^ IoT-integrated irrigation systemK-means, SVM, RF, NBCollected soil and humidity data, performed clustering and classification, and implemented a hybrid K-means SVM classifier.Achieved 98.5% accuracy in predicting water demand. ^22^ Automated irrigation monitoring systemKNN, LR, NB, SVMIoT sensors collected and analyzed data on ThingSpeak cloud for soil and environmental monitoring.SVM achieved 99.5% accuracy with 0.5 RMSE. ^23^ Soil wetness predictionSVM, DT, Artificial Algae OptimizationHybrid SupportTree Algae Algorithm combining SVM and DT for soil wetness prediction.Achieved 99.5% accuracy and improved specificity, precision, and sensitivity. ^24^ Crop recommendation systemEnsemble Learning, ML algorithmsIntegrated static and IoT-based real-time data for crop prediction using Python.Demonstrated efficient crop prediction with validated accuracy.
Despite significant progress, challenges remain in deploying these methods at scale. Many systems are resource-intensive, requiring specialized sensors or extensive computational capabilities, which limit their accessibility for small-scale farmers. Additionally, model generalizability to diverse environmental and crop conditions is a concern, as most systems are validated on specific datasets or regions. Integration of multi-language support and cost-effective solutions that cater to low-resource settings remains a gap in existing research.
Proposed method
To identify sugarcane illnesses early and assess their influence on crop productivity, the suggested system incorporates IoT sensors, UAV photos, and a customized ML algorithm, as demonstrated in Fig. 1. The IoT devices monitor sugarcane leaf attributes and ambient factors, including soil temperature and humidity. Meanwhile, high-resolution UAV photos record leaf patterns and color variations. The data preparation phase consists of three steps: noise reduction, normalization, and enhancement. By implementing an NG-SOM architecture inside the hidden layers of an ANN, this study optimizes feature learning, dynamically modifying its structure to capture non-linear patterns, hence allowing effective categorization of sugarcane plant health stages. The system’s capacity to identify sugarcane disease types and estimate their potential influence on crop output permits early interventions in irrigation and disease control, making sustainable and precision farming more practical.
Fig. 1. Proposed NG-SOM based ANN for disease prediction and crop yield.
Data collection
The data collecting phase uses IoT sensors and UAV photography to acquire complete environmental and sugarcane plant health data. This approach consists of two important components: UAVs, which record high-resolution photographs of sugarcane leaves, and IoT devices in the field, which continually monitor soil and plant characteristics. The obtained data is critical for early sugarcane disease identification and yield prediction. IoT sensors are deployed in the agricultural field to detect real-time soil temperature (Ts), soil humidity (Hs), and sugarcane leaf characteristics such as shadow fluctuations (Cp). These sensors communicate the gathered data at one-hour intervals to a centralized system. Refer to Table 2, which provides examples of IoT sensor data gathered over three hours from three separate field sites.
Table 2. IoT sensor data collected over three hours in three different locations.TimestampLocationSoil temperature (T_s_)Soil humidity (H_s_)Leaf shade (C_p_)08:00
27 °C62%Dark Green08:00
29 °C58%Yellowish Green08:00
28 °C65%Light Green09:00
28 °C63%Dark Green09:00
30 °C57%Yellowish Green09:00
29 °C66%Light Green
High-resolution UAV developed with near-infrared and RGB cameras can capture plant leaf detail. These images show early disease indicators such changes in texture, discoloration, or lesions. Geotagging helps the images to be linked with particular sensor readings. Table 3 shows the metadata connected to UAV images:
Table 3UAV imagery Metadata.Image IDTimestampLocationRGB ImageNDVI
08:00Field1Available0.84
08:00Field2Available0.65
08:00Field3Available0.72
The approach includes syncing and transferring pictures and sensor data to a cloud-based platform for analysis and storage. By merging real-time environmental information with high-resolution photos, the system may offer significant insights for early sugarcane disease identification and crop production optimization.
Preprocessing
Preprocessing is a crucial first step to ensure that the collected data is clean, standardized, and ready for subsequent feature extraction and analysis. This phase involves noise elimination, normalization of sensor data, and enhancement of UAV imagery to improve the accuracy of sugarcane disease detection and crop yield prediction.
Noise removal
Table 4 shows the time environmental factors or device constraints produce noise in sensor data. Sensor data is filtered using median filtering; UAV images are noise-free using a Gaussian filtering technique instead. Using variation smoothing, Gaussian filtering aids to improve image clarity using Eq. (1).
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I^{\prime}(x,y)=\sum\limits_{{i= - k}}^{k} {\sum\limits_{{j= - k}}^{k} I } (x+i,y+j) \cdot G(i,j,\sigma )$$\end{document}Where:
I(x,* y*) - input image intensity,
I′(x,* y*) - filtered image,
G(i,* j*,σ) is the Gaussian kernel with standard deviation σ.
Table 4. Preprocessed image data.Image IDOriginal NDVIFiltered NDVI
0.840.85
0.650.67
0.720.73
Normalization of sensor data
Min-max normalizing helps to adjust for scale changes between variables including soil temperature (T_s), humidity (Hs), and leaf shade (Cp_). This computation in Eq. (2) helps to find the normalized value x′ (shown in Table 5):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x^{\prime}=\frac{{x - {x_{{\mathrm{min}}}}}}{{{x_{{\mathrm{max}}}} - {x_{{\mathrm{min}}}}}}$$\end{document}This scales all values to the range [0, 1] ensuring constant input for ML.
Table 5. Normalization of sensor data.LocationRaw T_s_ (°C)Normalized T_s_′Raw H_s_ (%)Normalized H_s_′Field1
270.33620.67Field2
290.67580.33Field3
280.50651.00
Feature enhancement for UAV image
The proposed research enhances image contrast to identify sugarcane disease features, such as discoloration and lesions, using contrast-limited adaptive histogram equalization (CLAHE). This process results in an improved image I_e_(x, y). Table 6 presents the synchronization of data into a unified dataset after noise removal, normalization, and enhancement.
Table 6. Synchronization into unified dataset.Disease typeNormalized T_s_′Normalized H_s_′Filtered NDVIEnhanced ImageBacterial leaf blight0.330.670.85
Brown spot0.670.330.67
Leaf smut0.501.000.73
The data is ready for feature extraction following preprocessing, so ensuring a low error count and increased relevance to the diagnosis of diseases.
Feature extraction
Feature extraction is the process of translating raw data into a collection of meaningful qualities suited for ML analysis. The study of UAV photos and data from IoT sensors helps the model find crucial indications for sugarcane disease identification via feature extraction. These indications include texture, color gradients, and environmental variables, which constitute the basis for estimating crop production and enable correct categorization of sugarcane plant health.
Texture analysis
Statistical methods based on gray-level co-occurrence matrix (GLCM) extracts texture features from UAV images. Apart from obtaining the spatial interactions between pixel intensities, GLCM generates important statistical features including homogeneity, contrast, correlation, and energy. For a given GLCM P(i,* j*), the features are defined as:
- Contrast:
- Correlation:
- Energy:
- Homogeneity:
This step computes these features (Eq. (3)-Eq. (5)) for every UAV image in order to ground disease indicators on texture.
Color feature analysis
Considering the diseases that cause discoloration, sugarcane leaf shade variations and color gradients serve as reliable indicators. This stage extracts RGB color space features, including standard deviation, average intensity, and color histograms. For each channel (R, G, B), the mean intensity is computed using Eq. (6):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu (C)=\frac{1}{N}\sum\limits_{{i=1}}^{N} {{C_i}}$$\end{document}Where C∈{R,* G*,B}, and N is the total number of pixels. Similarly, the standard deviation captures the variation in Eq. (7):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma (C)=\sqrt {\frac{1}{N}\sum\limits_{{i=1}}^{N} {({C_i} - \mu (} C){)^2}}$$\end{document}Environmental data features
IoT sensor data provides environmental features, including soil humidity (H_s_) and temperature (T_s_). These features are correlated with indices derived from UAV imagery, such as the Normalized Difference Vegetation Index (NDVI), to assess the health condition of sugarcane plants. NDVI is calculated using Eq. (8):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{NDVI}}=\frac{{NIR - R}}{{NIR+R}}$$\end{document}Where NIR - near-infrared reflectance, and R - red reflectance. Low NDVI values indicate stress or disease; high NDVI values indicate good state of the vegetation.
Combined feature matrix
Features obtained together create a feature matrix F. This matrix has each row corresponding to a data sample, field location or UAV image, and each column represents a feature, much as in Table 7.
Table 7. Combined feature matrix.FeatureField1Field2Field3Contrast12.315.710.2Energy0.820.760.89Mean Green (G)125134112Std Red (R)20.318.122.7NDVI0.850.670.73Soil Temp (Ts)27 °C29 °C28 °C
Dimensionality reduction
Principal Component Analysis (PCA) or a similar approach is employed to decrease computational cost and duplication within the feature matrix. By maintaining just the most important components for categorization and prediction, the study assures efficiency. The assurance that feature extraction allows the system to capture all key characteristics of sugarcane plant health provides the backbone of the ML model.
Model training
Figure 2 shows an NG-SOM inside an ANN constitutes the backbone of the proposed model for the detection of sugarcane diseases. Using unsupervised learning, feature mapping is separated into a two-stage training procedure, whereas supervised learning is employed for sugarcane disease diagnosis and crop production prediction. The training procedure strengthens feature representation and optimizes the weights, hence boosting the chance of correct illness detection.
Fig. 2NGSOM-ANN.
Growing self-organizing map (GSOM)
The technique of feature mapping leverages GSOM as a preprocessor. By dynamically extending its structure to fit the complexity of input data, this method helps eliminate duplication and increase feature representation for sugarcane disease identification. GSOM facilitates modifications in neuron weights during training using a neighborhood function. For an input vector and a weight vector Wi of the winning neuron iii, the weights are changed as in Eq. (9):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${W_i}(t+1)={W_i}(t)+\alpha (t) \cdot {h_{i,j}}(t) \cdot (X - {W_i}(t))$$\end{document}Where:
t
Current iteration,
α(t): Learning rate,
hi, j(t): Neighborhood function for neuron i and its neighbor j,
X − Wi(t): Adjustment based on input difference.
Based on similar data point clusters, GSOM creates a mapped feature representation. In order for supervised learning, the GSOM output forms the input for the ANN.
ANN training
The ANN comprises of an input layer processing GSOM-refined data, various hidden layers, and an output layer for sugarcane disease classification. Each neuron in the hidden layers is supplied with a non-linear activation function to capture complicated relationships within the dataset. Typically, a Sigmoid or ReLU function is utilized, described mathematically as in Eq. (10):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(x)=\frac{1}{{1+{e^{ - x}}}}$$\end{document}The forward pass helps the research to compute the outputs (Ok) for each layer using Eq. (11):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${O_k}=f\left( {\sum\limits_{i} {{W_{ik}}} \cdot {X_i}+{b_k}} \right)$$\end{document}Where:
W_ik_: Weights between input i and output k,
bk
Bias term,
X_i_: Input from GSOM or previous layer.
To lower the observed error level, the model training uses a loss function, such as categorical cross-entropy for multi-class classification. The loss is defined as in Eq. (12):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathcal{L}= - \frac{1}{N}\sum\limits_{{i=1}}^{N} {\sum\limits_{{j=1}}^{C} {{y_{ij}}} } \log ({\hat {y}_{ij}})$$\end{document}Where:
N
Number of samples,
C
Number of classes,
yij
True label for sample i and class j,
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\hat {y}_{ij}}$$\end{document} : Predicted probability for sample i and class j.
Using an optimization technique such as stochastic gradient descent (SGD), the research can reduce the loss to an allowable possible. Weight update calculations during backpropagation are calculated as in Eq. (13):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${W_{ik}}(t+1)={W_{ik}}(t) - \eta \frac{{\partial \mathcal{L}}}{{\partial {W_{ik}}(t)}}$$\end{document}Where η is the learning rate.
Each iteration of the training process continues until the model converges, signaling that accuracy no longer drops and the loss function has achieved a stable state. The final trained ANN model generates class probabilities for distinct sugarcane disease categories and evaluates the effect of identified illnesses on crop production.
Table 8. Output from model training.IterationLoss ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathcal{L}$$\end{document} )Accuracy (%)11.2462.5500.3785.31000.1295.7
Table 8 shows that the combination of GSOM and ANN ensures strong learning which helps to handle non-linear patterns in the data and the exact classification of plant health conditions.
Non-linearity and its influence on model training
In this framework, non-linearity plays a significant role in capturing nuanced connections between the inputs (such as environmental parameters like temperature and humidity, UAV-derived visual data like texture and color) and the outputs linked to sugarcane disease diagnosis. The complicated relationships between sugarcane plant health and various inputs may generate non-linear patterns. Beyond properly handling this complexity, the suggested approach employs non-linear modifications during model training. Activation factors in the deep layers of the neural network add non-linearity, guaranteeing the model can handle non-linear correlations needed for sugarcane disease classification. Without non-linearity, the network would operate as a basic linear model, regardless of its depth, limiting its capacity to identify complicated disease patterns in sugarcane leaves. By using non-linear activation functions like sigmoid, the network approximates complicated processes, boosting the accuracy of sugarcane disease diagnosis. The output of a neuron Oₖ in a particular layer is stated in Eq. (14):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${O_k}=f\left( {\sum\limits_{i} {{W_{ik}}} {X_i}+{b_k}} \right)$$\end{document}Where:
f(x): Non-linear activation function,
Wik
Weight between input i and output k,
Xi
Input to the neuron,
bk
Bias term.
The research uses sigmoid non-linear activation functions (ref Eq. (10)). Sigmoid functions map inputs into the range [0,1], hence they fit activities involving probability. Non-linearity enables the model to:
- Capture Complex Patterns: The model captures complex patterns is in their ability to map intricate dependencies between features, such how soil temperature and leaf discoloration together indicate disease.
- Enhance Feature Representations: Non-linear transformations expand the feature space, which enables the model to more precisely separate between classes.
Consider a scenario where the relationship between temperature (T) and disease severity (D) is quadratic in Eq. (15):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D=\alpha {T^2}+\beta T+\gamma$$\end{document}Unlike non-linear networks, which can more accurately reproduce this pattern, linear models fail do so as in Table 9. Non-linear activation two layered neural network will produce the following output in Eq. (16):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$O={f_2}\left( {{W_2}{f_1}\left( {{W_1}X+{b_1}} \right)+{b_2}} \right)$$\end{document}Where f1, f2 are non-linear activation functions, W1, W_2_ are weight matrices, b1, b2 are biases, and X is the input. The research can solve a multi-class classification problem using the categorical cross-entropy loss, shown in Eq. (12).
Table 9. Linear vs. non-linear model training.Input featuresTrue labelLinear model outputNon-linear model outputError (Linear)Error (Non-linear)Temp = 28 °C, NDVI = 0.7Healthy0.40.950.60.05Temp = 35 °C, NDVI = 0.3Diseased0.60.870.40.13
Table 9 shows that by providing a more accurate representation of the complex relationship between variables and results, the non-linear model considerably lowers the error.
Disease detection and yield prediction
The coupling of feature-based classification with regression allows the proposed technique to combine sugarcane disease detection and yield prediction into a single functional component. This technique allows reliable diagnosis of sugarcane diseases and gives insights into their possible influence on crop productivity. The sugarcane disease detection procedure leverages a trained neural network for classification, while yield prediction assesses the likely loss in sugarcane output depending on disease severity and environmental factors using a regression model. The collected characteristics from the preprocessing step serve as input to the trained ANN, enabling exact categorization of sugarcane illnesses. For an input vector, the ANN computes the probability P(Ck∣X) for each illness class Ck in Eq. (17):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P({C_k}|X)=\frac{{{e^{{Z_k}}}}}{{\sum\limits_{{j=1}}^{C} {{e^{{Z_j}}}} }}$$\end{document}Where:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Z_k}=W_{k}^{T}X+{b_k},$$\end{document}
Wk
Weight vector for class C_k_,
bk
Bias term for class Ck,
C
Total number of disease classes.
The class with the highest probability is selected in Eq. (18):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat {C}=\arg {\hbox{max} _k}P({C_k}|X)$$\end{document}The output is a disease label (e.g., “Leaf Spot” or “Blight”) based on the features analyzed. Yield prediction is modeled as a regression problem, estimating the reduction in crop yield (Yr) due to disease severity. The regression model incorporates features such as disease severity index (DSI) (ϕ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=\frac{I}{{{T_p}}}$$\end{document} ), temperature (T), soil moisture (M), and leaf health index (Φ). The predicted yield (Y_p_) is calculated as in Eq. (19):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Y_p}=\hat {Y} - {Y_r}$$\end{document}Where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat {Y}$$\end{document} is the maximum potential yield without disease, and Yr is computed as in Eq. (20):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Y_r}={\beta _1} \cdot \varphi +{\beta _2} \cdot T+{\beta _3} \cdot M+{\beta _4} \cdot \Phi +\epsilon$$\end{document}Where, β_1_,β_2_,β_3_,β_4_: Coefficients determined during training and ϵ: Error term.
The DSI is calculated using the ratio of infected pixels (I) to total pixels (Tp) in UAV images as in Eq. (21):
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varphi=\frac{I}{{{T_p}}}$$\end{document}For instance, if 300 out of 1000 pixels are infected, DSI = 0.3. This index is a critical input for yield prediction.
Table 10. Results of predicted yield loss and estimated yield.Input featuresPredicted diseaseDSIPredicted yield loss (%)Estimated yield (kg/ha)Temp = 30 °C, NDVI = 0.6Leaf Spot0.415850Temp = 35 °C, NDVI = 0.3Blight0.625750Temp = 28 °C, NDVI = 0.8Healthy0.001000
Table 10 shows that the system provides insights of crop production in addition to detecting plant condition by combining disease detection and yield prediction as shown in Table 10, aiding in timely intervention.
Performance evaluation
The suggested technique supports both machine learning and deep learning model training using Python and TensorFlow. OpenCV and Scikit-learn were applied for data preprocessing and feature extraction on the sugarcane leaf dataset^25,26^. By merging IoT sensor data with UAV photos, a simulation environment was developed utilizing high-resolution drone photographs and environmental monitoring tools. For accelerated calculations, the workstation comprised of an Intel Core i9 CPU, 64 GB of RAM, and a GPU. The proposed method was further compared with various state-of-the-art techniques, including MobileNet-V2^16,20^, Hybrid Support Tree Algae Optimization Algorithm (HSTAOA)^23^, Ensemble Learning-Based Crop Recommendation System (ELCRS), and LoRa-Enabled ML-Based Precision Irrigation (LoRa-MLPI)^18^.
As shown in Table 11, the dataset used in this study comprises 10,000 plant samples collected from three major sugarcane-growing regions of India — Maharashtra, Uttar Pradesh, and Tamil Nadu — ensuring broad geographical diversity in climatic and soil conditions. The dataset includes four widely cultivated sugarcane varieties (Co 86032, Co 0238, Co 740, and Co C671), each exhibiting varying susceptibility to common diseases such as red rot, smut, and wilt. To capture different disease progression stages, UAV images and IoT sensor readings were recorded at multiple intervals spanning early infection, moderate spread, and advanced disease stages. This multi-stage sampling enhances the model’s ability to generalize across phenological variations. For validating the performance gains of the Deep Ensemble Learning (DEL) model, statistical significance testing was conducted using the paired t-test between the proposed DEL approach and baseline models (CNN, Bi-LSTM, and GRU). The results demonstrated that the improvement in classification accuracy was statistically significant (p < 0.05), confirming the robustness and reliability of the reported performance metrics.
Table 11. Experimental setup and parameters.ParameterValueDataset Size10,000 plant samplesInput FeaturesTemperature, humidity, NDVI, UAV images, soil moistureNeural Network Architecture3 hidden layers with ReLU activationLearning Rate0.001OptimizerAdamBatch Size64Training Epochs100UAV Image Resolution1024 × 1024 pixelsIoT Sensor Sampling Interval5 min
To ensure data reliability, sensor drift and calibration errors were addressed through periodic recalibration using benchmark soil samples and controlled environmental baselines. Anomaly detection was implemented through z-score and moving median filtering to identify and remove outlier readings from temperature, humidity, and soil-moisture sensors. The filtered data were further validated through temporal consistency checks before being input to the NG-SOM layer. These measures significantly minimized the effect of sensor noise, ensuring that only stable and representative data contributed to model training and prediction.
Performance metrics.
- Accuracy: In relation to the overall sample count used for disease detection, the term “accuracy” characterizes the percentage of appropriately classified samples.
- 2.Precision: The precision is the percentage of disease-positive cases efficiently found among the overall expected positive cases. It is given by:
- 3.Recall (Sensitivity): Recall (Sensitivity) indicates the degree of ability of the model to find every real case positive for the disease. It is given by:
- 4.F1-Score: The harmonic mean of recall and accuracy, the F1-score helps to balance false positives with false negatives.
- 5.Mean Absolute Percentage Error (MAPE): In yield prediction, the MAPE is a statistic used to find the average percentage variation between the projected and real yields.
Performance over various data split
With a test accuracy of 96.7%, the proposed method exceeded LoRa-MLPI by 3.2% and ELCRS by 5.5%, which attains the best accuracy overall. It reveals not only outstanding generalizing capacity but also continuous performance over training, testing, and validation datasets as in Fig. 3.
Fig. 3. Accuracy.
Table 12. Precision.MethodTrain (%)Test (%)Validation (%)MobileNet-V285.683.281.5HSTAOA89.588.486.7ELCRS91.390.189.0LoRa-MLPI93.492.791.1Proposed96.195.894.2
On the test set in Table 12, the proposed method achieved a precision of 95.8%, much above LoRa-MLPI by 3.1% and ELCRS by 5.7%. Apart from its increased accuracy, it can also help to effectively reduce the false positive count.
Table 13. Recall.MethodTrain (%)Test (%)Validation (%)MobileNet-V282.480.578.9HSTAOA87.987.185.4ELCRS89.889.388.1LoRa-MLPI92.591.890.6Proposed95.494.993.7
With improvements of 3.1% on LoRa-MLPI and 5.6% on ELCRS, the proposed method displayed the highest degree of recall, that of 94.9%. This performance shows its capacity since it can efficiently identify real positives in Table 13.
Table 14F1-Score.MethodTrain (%)Test (%)Validation (%)MobileNet-V284.081.880.2HSTAOA88.787.786.0ELCRS90.589.788.5LoRa-MLPI92.992.291.0Proposed95.795.394.0
With a F1-Score of 95.3%, the proposed method exceeded LoRa-MLPI by 2.1% and ELCRS by 5.6%. The research can reach consistent classification on the test set by falsifying recall with accuracy in Table 14.
Table 15MAPE.MethodTrain (%)Test (%)Validation (%)MobileNet-V213.112.313.5HSTAOA11.010.511.7ELCRS10.39.810.6LoRa-MLPI9.18.79.5Proposed5.95.66.4
On the test set, the proposed approach got the lowest MAPE (5.6%) outperforming LoRa-MLPI by 3.1% and ELCRS by 4.2%. This implies that the yield projections are rather accurate and with reduced error rate in Table 15.
Performance over various features
Fig. 4. Accuracy over various features.
The proposed method achieved the highest accuracy in all categories. For the UAV images, as in Fig. 4, it scored 97.8%, a 3.6% increase above the LoRa-MLPI. This implies that the predictive capacities are better and that the generalization of the model is of a better quality over a wide spectrum of input data types.
Table 16. Performance of the various methods over different features.MethodTemperatureHumidityNDVIUAV ImagesSoil MoisturePrecision (%)MobileNet-V282.680.978.284.577.4HSTAOA88.386.785.089.883.1ELCRS90.188.887.291.385.6LoRa-MLPI92.391.189.693.490.2Proposed96.495.794.296.694.8Recall (%)MobileNet-V278.576.374.080.372.7HSTAOA83.782.080.186.179.4ELCRS86.284.383.088.482.0LoRa-MLPI89.588.086.491.285.3Proposed93.292.390.794.591.6F1-Score (%)MobileNet-V280.378.576.182.775.1HSTAOA86.384.382.688.381.1ELCRS88.486.685.189.984.2LoRa-MLPI90.989.588.092.788.6Proposed96.095.194.496.794.0MAPE (%)MobileNet-V214.515.216.313.817.1HSTAOA11.712.413.210.812.5ELCRS10.511.111.99.911.2LoRa-MLPI8.69.29.68.28.9Proposed5.35.76.24.55.3
The suggested technique attained the maximum accuracy across all categories, as shown in Table 16, notably excelling in sugarcane leaf photos collected by UAV, with a 96.6% accuracy rate. Compared to LoRa-MLPI, it achieved a 3.2% performance gain. The higher accuracy, along with fewer false positives, greatly enhances the diagnosis of illnesses in sugarcane leaves. Additionally, the suggested technique exceeded LoRa-MLPI by 3.3%, obtaining an astounding 94.5% recall on UAV photos of sugarcane leaves, suggesting its greater effectiveness in recognizing true positives—a vital feature in early disease identification. The F1-score of the suggested technique obtained 96.7%, beating LoRa-MLPI by 4.0% on UAV photos of sugarcane leaves, indicating the model’s well-balanced performance with few false positives and false negatives. Furthermore, with a Mean Absolute Percentage Error (MAPE) of 4.5% on UAV photos of sugarcane leaves, the suggested technique greatly beat LoRa-MLPI, which had a MAPE of 8.2%. This shows that the proposed approach may give more exact yield projections for sugarcane, hence minimizing estimated errors regarding the influence of leaf diseases on crop production. The proposed model has a computational complexity of O(n × m × d), where n is the number of samples, m is the number of input features, and d represents the depth of the neural network.
Conclusion
Using IoT technology and ML, the proposed Smart Irrigation System for sugarcane leaf disease detection surpassed current approaches across key performance metrics, demonstrating its amazing efficiency and accuracy in predicting crop production and identifying plant illnesses. The suggested technique obtained the greatest accuracy (97.6%) across all data types, including temperature, humidity, NDVI, UAV photos of sugarcane leaves, and soil moisture. Additionally, it improves upon LoRa-MLPI by 3.6%. Specifically, the suggested technique obtained 96.6% accuracy on UAV photos of sugarcane leaves, outperforming LoRa-MLPI by 3.2%, suggesting increased disease identification and a considerable decrease in false positives. Furthermore, for UAV photos of sugarcane leaves, the suggested technique displayed a 3.3% recall increase than LoRa-MLPI, obtaining 94.5%, which demonstrates its greater capacity to recognize true positives. The F1-score, which combines both recall and accuracy, also demonstrated a boost, with the suggested technique attaining 96.7% on UAV images—4.0% higher than LoRa-MLPI. This improvement shows that the model maintains a better balance between false positives and false negatives. Regarding MAPE, the suggested technique displayed the lowest error rate (4.5%) for UAV photos of sugarcane leaves, greatly surpassing LoRa-MLPI by 3.7%, further confirming its accuracy. From an agricultural standpoint, these data indicate the system’s efficacy in anticipating the impact of sugarcane leaf diseases on crop productivity. This demonstrates that the suggested methodology, which incorporates non-linear algorithms, feature extraction, and IoT-based data collecting, offers a considerable improvement over previous approach.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Sharma, K. & Shivandu, S. K. Integrating artificial intelligence and Internet of Things (Io T) for enhanced crop monitoring and management in precision agriculture. Sens. Int.5, 100292 (2024).
- 2Narayanappa, G. B. C. et al. Revolutionizing UAV: experimental evaluation of Io T-enabled unmanned aerial vehicle-based agricultural field monitoring using remote sensing strategy. Remote Sens. Earth Syst. Sci. 7 (4), 411–425 (2024).
- 3Selvanarayanan, R., Rajendran, S. & Alotaibi, Y. Early detection of Colletotrichum Kahawae disease in coffee Cherry based on computer vision techniques. Comput. Model. Eng. Sci. 139 (1), 759–782 (2024).
- 4Kotwal, J. G. et al. SADCC Net: Self-attention-based dense cascaded capsule network for bone cancer detection using deep learning approach. Iran J. Comput. Sci. 1–21 (2025).
- 5Sellam, V., Kannan, N., Pandi, S. S. & Manju, I. Enhancing sustainable agriculture using attention convolutional bidirectional gatedrecurrent based modified leaf in wind algorithm: integrating AI and Io T for efficient farming. Sustain. Comput. Inf. Syst. 101160 (2025).
- 6Upadhyay, N. & Gupta, N. Seg Learner: A segmentation based approach for predicting disease severity in infected leaves. Multim. Tools Appl. 1–24 (2025).
- 7Abdel-Basset, M., Hawash, H. & Abdel-Fatah, L. Artificial Intelligence and Internet of Things in Smart Farming (CRC, 2024).
- 8Rathore, N. & Rajavat, A. Smart farming based on IOT-Edge computing: applying machine learning models for disease and irrigation water requirement prediction in potato crop using containerized microservices. in Precision Agriculture for Sustainability 399–424 (Apple Academic, 2024).
