Efficient Deep Learning Ensemble of Lightweight CNNs and Vision Transformers for Real-Time Plant Disease Diagnosis
Abstract
Timely identification of plant diseases plays a vital role in protecting crop yield and supporting effective decision-making in precision agriculture. Conventional computer vision models achieve high recognition accuracy but often require substantial computing power, making them impractical for low-cost edge hardware widely used in rural areas. In this work, a compact deep learning ensemble is presented, combining three lightweight convolutional neural networks—MobileNetV3-Small, EfficientNet-B0, and ShuffleNetV2—with a Vision Transformer (ViT-B/16). The models operate in parallel, and their outputs are merged using a weighted late-fusion approach, with fusion weights determined through systematic grid search to achieve the best trade-off between predictive performance and processing speed. The Plant Village dataset, consisting of 54,303 images from 38 healthy and diseased leaf categories, was used for evaluation. To improve robustness, the training data were augmented through geometric transformations, contrast adjustment, and controlled noise addition. When tested on a Raspberry Pi 4 device, the ensemble reached an accuracy of 97.85%, precision of 97.67%, recall of 97.92%, and F1-score of 97.79%, with an average inference time of 20.5 ms and a total size of 14.6 MB. These results surpassed those of all individual models and conventional machine-learning baselines. Statistical testing using McNemar’s method confirmed the significance of the improvement (p < 0.05). Precision–Recall analysis indicated strong resistance to false positives, while accuracy–latency assessment confirmed suitability for real-time field operation. The proposed system offers a practical, resource-efficient framework for on-site plant disease diagnosis in areas with limited connectivity and computing resources. Further development will focus on adaptation to field-captured imagery, hardware-aware model compression, and the integration of additional sensing modalities such as hyperspectral and thermal imaging.
Keywords
References
M. A. John, I. Bankole, O. Ajayi-Moses, T. Ijila, T. Jeje, and L. Patil, “Relevance of advanced plant disease detection techniques in disease and pest management for ensuring food security and their implication: A review,” American Journal of Plant Sciences, vol. 14, no. 11, pp. 1260–1295, 2023.
A. Hussain, A. F. Elkarmout, E. Z. Mansour, M. Awais, M. Usman, H. Ahmad, M. Faisal, and T. Ahmad, “An environment friendly practice, the climate smart agriculture crop production and soil management systems: A review,” Journal of Sustainable Agricultural and Environmental Sciences, vol. 3, no. 3, pp. 101–124, 2024.
I. Buja, E. Sabella, A. G. Monteduro, M. S. Chiriacò, L. De Bellis, A. Luvisi, and G. Maruccio, “Advances in plant disease de-tection and monitoring: From traditional assays to in-field diagnostics,” Sensors, vol. 21, no. 6, p. 2129, 2021.
M. S. P. Ngongoma, M. Kabeya, and K. Moloi, “A review of plant disease detection systems for farming applications,” Applied Sciences, vol. 13, no. 10, p. 5982, 2023.
H. Orchi, M. Sadik, M. Khaldoun, and E. Sabir, “Automation of crop disease detection through conventional machine learning and deep transfer learning approaches,” Agriculture, vol. 13, no. 2, p. 352, 2023.
S. A. A. Qadri, N.-F. Huang, T. M. Wani, and S. A. Bhat, “Advances and challenges in computer vision for image-based plant disease detection: A comprehensive survey of machine and deep learning approaches,” IEEE Transactions on Automation Sci-ence and Engineering, vol. 22, pp. 2639–2670, 2024.
A. Upadhyay et al., “Deep learning and computer vision in plant disease detection: A comprehensive review of techniques, mod-els, and trends in precision agriculture,” Artificial Intelligence Review, vol. 58, no. 3, p. 92, 2025.
A. M. Roy and J. Bhaduri, “A deep learning enabled multi-class plant disease detection model based on computer vision,” AI, vol. 2, no. 3, pp. 413–428, 2021.
A. Bhargava, A. Shukla, O. P. Goswami, M. H. Alsharif, P. Uthansakul, and M. Uthansakul, “Plant leaf disease detection, clas-sification, and diagnosis using computer vision and artificial intelligence: A review,” IEEE Access, vol. 12, pp. 37443–37469, 2024.
S. S. Harakannanavar, J. M. Rudagi, V. I. Puranikmath, A. Siddiqua, and R. Pramodhini, “Plant leaf disease detection using computer vision and machine learning algorithms,” Global Transitions Proceedings, vol. 3, no. 1, pp. 305–310, 2022.
I. Ahmed and P. K. Yadav, “Plant disease detection using machine learning approaches,” Expert Systems, vol. 40, no. 5, p. e13136, 2023.
H. N. Ngugi, A. E. Ezugwu, A. A. Akinyelu, and L. Abualigah, “Revolutionizing crop disease detection with computational deep learning: A comprehensive review,” Environmental Monitoring and Assessment, vol. 196, no. 3, p. 302, 2024.
E. Yilmaz, S. C. Bocekci, C. Safak, and K. Yildiz, “Advancements in smart agriculture: A systematic literature review on state-of-the-art plant disease detection with computer vision,” IET Computer Vision, vol. 19, no. 1, p. e70004, 2025.
L. Li, S. Zhang, and B. Wang, “Plant disease detection and classification by deep learning—A review,” IEEE Access, vol. 9, pp. 56683–56698, 2021.
M. A. Hanif, M. K. I. Zim, and H. Kaur, “ResNet vs Inception-v3 vs SVM: A comparative study of deep learning models for image classification of plant disease detection,” in Proc. 2024 IEEE Int. Conf. Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI), vol. 2, pp. 1–6, 2024.
U. Arora, U. Mishra, S. Singh, and V. Singh, “Comparative analysis of VGG16, Inception V4, AlexNet, and ResNet 50 for plant disease identification,” in Proc. 2024 15th Int. Conf. Computing Communication and Networking Technologies (ICCCNT), pp. 1–7, 2024.
N. Ganatra and A. Patel, “Performance analysis of fine-tuned convolutional neural network models for plant disease classifica-tion,” International Journal of Control and Automation, vol. 13, no. 3, pp. 293–305, 2020.
V. Maeda-Gutiérrez et al., “Comparison of convolutional neural network architectures for classification of tomato plant diseases,” Applied Sciences, vol. 10, no. 4, p. 1245, 2020.
A. Khan, Z. Rauf, A. Sohail, A. R. Khan, H. Asif, A. Asif, and U. Farooq, “A survey of the vision transformers and their CNN-transformer based variants,” Artificial Intelligence Review, vol. 56, no. Suppl. 3, pp. 2917–2970, 2023.
S. Liu, W. Wang, L. Deng, and H. Xu, “Cnn-trans model: A parallel dual-branch network for fundus image classification,” Bio-medical Signal Processing and Control, vol. 96, p. 106621, 2024.
S. Khan, M. Naseer, M. Hayat, S. W. Zamir, F. S. Khan, and M. Shah, “Transformers in vision: A survey,” ACM Computing Surveys, vol. 54, no. 10s, pp. 1–41, 2022.
M. Hayat, N. Ahmad, A. Nasir, and Z. A. Tariq, “Hybrid deep learning EfficientNetV2 and vision transformer (EffNetV2-ViT) model for breast cancer histopathological image classification,” IEEE Access, 2024.
S. Venkatramulu, V. Srinivas, T. M. Sadala, R. Rajoju, and R. Kamalakar, “Deep learning-based early detection of crop diseases using leaf image analysis in smart agricultural systems,” International Journal of Environmental Sciences, vol. 11, no. 5s, pp. 294–303, 2025.
Z. Revesai and O. P. Kogeda, “Lightweight interpretable deep learning model for nutrient analysis in mobile health applications,” Digital, vol. 5, no. 2, p. 23, 2025.
C. Sanford, D. J. Hsu, and M. Telgarsky, “Representational strengths and limitations of transformers,” Advances in Neural In-formation Processing Systems, vol. 36, pp. 36677–36707, 2023.
J. Chen, P. Wu, X. Zhang, R. Xu, and J. Liang, “Add-ViT: CNN-transformer hybrid architecture for small data paradigm pro-cessing,” Neural Processing Letters, vol. 56, no. 3, p. 198, 2024.
G. Zhang, W. Li, Y. Tang, S. Chen, and L. Wang, “Lightweight CNN-ViT with cross-module representational constraint for express parcel detection,” The Visual Computer, vol. 41, no. 5, pp. 3283–3295, 2025.
V. Pandey, U. Tripathi, V. K. Singh, Y. S. Gaur, and D. Gupta, “Survey of accuracy prediction on the PlantVillage dataset using different ML techniques,” EAI Endorsed Transactions on Internet of Things, vol. 10, 2024.
L. Hui and M. Belkin, “Evaluation of neural architectures trained with square loss vs cross-entropy in classification tasks,” arXiv preprint arXiv:2006.07322, 2020.
J. Miao and W. Zhu, “Precision–recall curve (PRC) classification trees,” Evolutionary Intelligence, vol. 15, no. 3, pp. 1545–1569, 2022.
DOI: https://doi.org/10.52088/ijesty.v5i4.1347
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Mruna Dubey, P.S.G. Aruna Sri, Suresh Kumar Jha, Nupur, Girish Bhiogade, Neeraj Kumar



























