Seek and You Will Find: A New Optimized Framework for Efficient Detection of Pedestrian
Sudip Das, Partha Sarathi Mukherjee, Ujjwal Bhattacharya

TL;DR
This paper introduces a new deep learning framework for pedestrian detection that balances speed and accuracy, utilizing a novel dataset and achieving significant performance improvements over existing methods.
Contribution
The paper presents a simple yet effective deep learning strategy and a new annotated dataset to enhance pedestrian detection, especially for small-sized pedestrians.
Findings
Achieved nearly 16% improvement over state-of-the-art methods.
Developed a new dataset with over 80K annotated pedestrians.
Demonstrated improved detection performance on multiple datasets.
Abstract
Studies of object detection and localization, particularly pedestrian detection have received considerable attention in recent times due to its several prospective applications such as surveillance, driving assistance, autonomous cars, etc. Also, a significant trend of latest research studies in related problem areas is the use of sophisticated Deep Learning based approaches to improve the benchmark performance on various standard datasets. A trade-off between the speed (number of video frames processed per second) and detection accuracy has often been reported in the existing literature. In this article, we present a new but simple deep learning based strategy for pedestrian detection that improves this trade-off. Since training of similar models using publicly available sample datasets failed to improve the detection performance to some significant extent, particularly for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Automated Road and Building Extraction
MethodsTest · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
