Fruit Deformity Classification through Single-Input and Multi-Input   Architectures based on CNN Models using Real and Synthetic Images

Tommy D. Beltran; Raul J. Villao; Luis E. Chuquimarca; Boris X.; Vintimilla; and Sergio A. Velastin

arXiv:2412.12966·cs.CV·December 18, 2024

Fruit Deformity Classification through Single-Input and Multi-Input Architectures based on CNN Models using Real and Synthetic Images

Tommy D. Beltran, Raul J. Villao, Luis E. Chuquimarca, Boris X., Vintimilla, and Sergio A. Velastin

PDF

TL;DR

This study develops CNN-based single-input and multi-input architectures utilizing real and synthetic images to accurately classify fruit deformities, with the multi-input MobileNetV2 model achieving the highest accuracy across multiple fruit types.

Contribution

It introduces a novel multi-input CNN architecture combining RGB and silhouette images, demonstrating improved deformity classification accuracy over traditional single-input models.

Findings

01

Multi-input MobileNetV2 achieved up to 94% accuracy for mangoes.

02

Synthetic images improved CNN model performance.

03

Multi-input architecture outperformed single-input models.

Abstract

The present study focuses on detecting the degree of deformity in fruits such as apples, mangoes, and strawberries during the process of inspecting their external quality, employing Single-Input and Multi-Input architectures based on convolutional neural network (CNN) models using sets of real and synthetic images. The datasets are segmented using the Segment Anything Model (SAM), which provides the silhouette of the fruits. Regarding the single-input architecture, the evaluation of the CNN models is performed only with real images, but a methodology is proposed to improve these results using a pre-trained model with synthetic images. In the Multi-Input architecture, branches with RGB images and fruit silhouettes are implemented as inputs for evaluating CNN models such as VGG16, MobileNetV2, and CIDIS. However, the results revealed that the Multi-Input architecture with the MobileNetV2…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDepthwise Convolution · Batch Normalization · Convolution · Pointwise Convolution · 1x1 Convolution · Depthwise Separable Convolution · Inverted Residual Block · Average Pooling