Fast and Efficient Scene Categorization for Autonomous Driving using   VAEs

Saravanabalagi Ramachandran; Jonathan Horgan; Ganesh Sistu; and John; McDonald

arXiv:2210.14981·cs.CV·October 28, 2022

Fast and Efficient Scene Categorization for Autonomous Driving using VAEs

Saravanabalagi Ramachandran, Jonathan Horgan, Ganesh Sistu, and John, McDonald

PDF

TL;DR

This paper introduces a fast, compact, and robust scene categorization method for autonomous driving that uses a variational autoencoder to generate global image descriptors, achieving efficient high-level scene recognition.

Contribution

The paper presents a novel approach using VAEs to create interpretable, global image descriptors for scene categorization, improving speed and robustness over existing methods.

Findings

01

Descriptors are compact with 128 dimensions.

02

Method is significantly faster to compute.

03

Descriptors are robust to seasonal and lighting changes.

Abstract

Scene categorization is a useful precursor task that provides prior knowledge for many advanced computer vision tasks with a broad range of applications in content-based image indexing and retrieval systems. Despite the success of data driven approaches in the field of computer vision such as object detection, semantic segmentation, etc., their application in learning high-level features for scene recognition has not achieved the same level of success. We propose to generate a fast and efficient intermediate interpretable generalized global descriptor that captures coarse features from the image and use a classification head to map the descriptors to 3 scene categories: Rural, Urban and Suburban. We train a Variational Autoencoder in an unsupervised manner and map images to a constrained multi-dimensional latent space and use the latent vectors as compact embeddings that serve as global…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.