Identifying Object States in Cooking-Related Images
Ahmad Babaeian Jelodar, Md Sirajus Salekin, Yu Sun

TL;DR
This paper introduces a novel approach to identify object states in cooking images, creating a dataset and employing deep learning models to improve state recognition for robotic applications.
Contribution
It is the first to explicitly address object state identification in cooking images, proposing a ResNet-based model and fine-tuning strategies for improved accuracy.
Findings
The model achieves high accuracy on the dataset.
Fine-tuning significantly improves object-specific state recognition.
The dataset and methodology support future research in robotic manipulation.
Abstract
Understanding object states is as important as object recognition for robotic task planning and manipulation. To our knowledge, this paper explicitly introduces and addresses the state identification problem in cooking related images for the first time. In this paper, objects and ingredients in cooking videos are explored and the most frequent objects are analyzed. Eleven states from the most frequent cooking objects are examined and a dataset of images containing those objects and their states is created. As a solution to the state identification problem, a Resnet based deep model is proposed. The model is initialized with Imagenet weights and trained on the dataset of eleven classes. The trained state identification model is evaluated on a subset of the Imagenet dataset and state labels are provided using a combination of the model with manual checking. Moreover, an individual model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Advanced Chemical Sensor Technologies
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection
