Exploring PCA-based feature representations of image pixels via CNN to enhance food image segmentation
Ying Dai

TL;DR
This paper introduces an unsupervised CNN-based PCA feature extraction method for food ingredient segmentation, achieving stable, interpretable results with minimal labeled data on FoodSeg103.
Contribution
It proposes a novel PCA-based feature representation approach using CNNs for improved, unsupervised food image segmentation, emphasizing optimal feature selection and clustering without dataset fine-tuning.
Findings
Principal component maps enhance clustering quality.
Eigenvalue count correlates with optimal cluster number.
Achieved 0.5423 mIoU on FoodSeg103 dataset.
Abstract
For open vocabulary recognition of ingredients in food images, segmenting the ingredients is a crucial step. This paper proposes a novel approach that explores PCA-based feature representations of image pixels using a convolutional neural network (CNN) to enhance segmentation. An internal clustering metric based on the silhouette score is defined to evaluate the clustering quality of various pixel-level feature representations generated by different feature maps derived from various CNN backbones. Using this metric, the paper explores optimal feature representation selection and suitable clustering methods for ingredient segmentation. Additionally, it is found that principal component (PC) maps derived from concatenations of backbone feature maps improve the clustering quality of pixel-level feature representations, resulting in stable segmentation outcomes. Notably, the number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Chemical Sensor Technologies · Smart Agriculture and AI · Food Supply Chain Traceability
