Master's Thesis : Deep Learning for Visual Recognition
R\'emi Cad\`ene, Nicolas Thome, Matthieu Cord

TL;DR
This thesis explores deep learning methods for visual recognition, focusing on low-data scenarios, introducing new models, and providing a flexible framework for training and testing CNNs across various datasets.
Contribution
It introduces novel CNN techniques for low-data visual recognition and develops a versatile Torch7-based framework for training deep models on diverse datasets.
Findings
Achieved high accuracy on food recipe dataset with 100k images
Won the DSG online satellite image classification challenge with 6,000 images
Reviewed state-of-the-art CNN architectures and training techniques
Abstract
The goal of our research is to develop methods advancing automatic visual recognition. In order to predict the unique or multiple labels associated to an image, we study different kind of Deep Neural Networks architectures and methods for supervised features learning. We first draw up a state-of-the-art review of the Convolutional Neural Networks aiming to understand the history behind this family of statistical models, the limit of modern architectures and the novel techniques currently used to train deep CNNs. The originality of our work lies in our approach focusing on tasks with a low amount of data. We introduce different models and techniques to achieve the best accuracy on several kind of datasets, such as a medium dataset of food recipes (100k images) for building a web API, or a small dataset of satellite images (6,000) for the DSG online challenge that we've won. We also draw…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Advanced Neural Network Applications
