An embarrassingly simple comparison of machine learning algorithms for indoor scene classification
Bhanuka Manesha Samarasekara Vitharana Gamage

TL;DR
This paper compares five machine learning algorithms for indoor scene classification, analyzing feature extractor impacts, and proposes a simple MnasNet-based system achieving 72% accuracy with low latency.
Contribution
It provides a straightforward comparison of classifiers and feature extractors, and introduces a lightweight MnasNet-based indoor scene classification system.
Findings
MnasNet-based system achieves 72% accuracy
Low latency feature extractors perform competitively
Comparison highlights pros and cons of classifiers
Abstract
With the emergence of autonomous indoor robots, the computer vision task of indoor scene recognition has gained the spotlight. Indoor scene recognition is a challenging problem in computer vision that relies on local and global features in a scene. This study aims to compare the performance of five machine learning algorithms on the task of indoor scene classification to identify the pros and cons of each classifier. It also provides a comparison of low latency feature extractors versus enormous feature extractors to understand the performance effects. Finally, a simple MnasNet based indoor classification system is proposed, which can achieve 72% accuracy at 23 ms latency.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Remote-Sensing Image Classification
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Depthwise Convolution · Pointwise Convolution · Batch Normalization · Depthwise Separable Convolution · 1x1 Convolution · Inverted Residual Block · Softmax · Average Pooling · Sigmoid Activation
