Hybrid Channel Based Pedestrian Detection
Fiseha B. Tesema, Hong Wu, Mingjian Chen, Junpeng Lin, William Zhu,, Kaizhu Huang

TL;DR
This paper introduces a pedestrian detection framework that combines handcrafted features with CNN features, leveraging higher spatial resolution from handcrafted channels to improve detection accuracy, especially for small instances.
Contribution
It extends the RPN+BF framework by integrating handcrafted features with CNN features using RoI-pooling, enhancing detection performance for small pedestrians.
Findings
Handcrafted features outperform CNN features from VGG-16 in detection accuracy.
Combining handcrafted and CNN features yields better results than using either alone.
The proposed method achieves competitive results on the Caltech pedestrian dataset.
Abstract
Pedestrian detection has achieved great improvements with the help of Convolutional Neural Networks (CNNs). CNN can learn high-level features from input images, but the insufficient spatial resolution of CNN feature channels (feature maps) may cause a loss of information, which is harmful especially to small instances. In this paper, we propose a new pedestrian detection framework, which extends the successful RPN+BF framework to combine handcrafted features and CNN features. RoI-pooling is used to extract features from both handcrafted channels (e.g. HOG+LUV, CheckerBoards or RotatedFilters) and CNN channels. Since handcrafted channels always have higher spatial resolution than CNN channels, we apply RoI-pooling with larger output resolution to handcrafted channels to keep more detailed information. Our ablation experiments show that the developed handcrafted features can reach better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRegion Proposal Network
