Factorial Convolution Neural Networks
Jaemo Sung, Eun-Sung Jung

TL;DR
This paper introduces FactorNet, a lightweight convolutional neural network composed of multiple sub CNNs, which improves object detection accuracy and speed over GoogleNet in real-time systems.
Contribution
The paper proposes FactorNet, a novel CNN architecture with independent sub CNNs that reduces overheads and enhances detection performance compared to existing models.
Findings
FactorNet achieves at least 5% better accuracy than GoogleNet.
FactorNet provides additional speedup in real-time object detection.
FactorNet outperforms GoogleNet on the KITTI dataset.
Abstract
In recent years, GoogleNet has garnered substantial attention as one of the base convolutional neural networks (CNNs) to extract visual features for object detection. However, it experiences challenges of contaminated deep features when concatenating elements with different properties. Also, since GoogleNet is not an entirely lightweight CNN, it still has many execution overheads to apply to a resource-starved application domain. Therefore, a new CNNs, FactorNet, has been proposed to overcome these functional challenges. The FactorNet CNN is composed of multiple independent sub CNNs to encode different aspects of the deep visual features and has far fewer execution overheads in terms of weight parameters and floating-point operations. Incorporating FactorNet into the Faster-RCNN framework proved that FactorNet gives \ignore{a 5\%} better accuracy at a minimum and produces additional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
Methods1x1 Convolution · Softmax · Average Pooling · Local Response Normalization · Convolution · Auxiliary Classifier · Dense Connections · Inception Module · Max Pooling · Dropout
