Going Deeper with Convolutions
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed,, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich

TL;DR
The paper introduces the Inception architecture, notably GoogLeNet, a deep convolutional neural network that achieved state-of-the-art results in ImageNet 2014 by efficiently utilizing computational resources through multi-scale processing.
Contribution
It presents a novel deep CNN architecture that increases depth and width without extra computational cost, improving image classification and detection performance.
Findings
Achieved new state-of-the-art in ImageNet classification and detection
Designed a 22-layer deep network called GoogLeNet
Demonstrated efficient resource utilization in deep networks
Abstract
We propose a deep convolutional neural network architecture codenamed "Inception", which was responsible for setting the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC 2014). The main hallmark of this architecture is the improved utilization of the computing resources inside the network. This was achieved by a carefully crafted design that allows for increasing the depth and width of the network while keeping the computational budget constant. To optimize quality, the architectural decisions were based on the Hebbian principle and the intuition of multi-scale processing. One particular incarnation used in our submission for ILSVRC 2014 is called GoogLeNet, a 22 layers deep network, the quality of which is assessed in the context of classification and detection.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗qualcomm/GoogLeNetmodel· 106 dl106 dl
- 🤗Kalray/googlenetmodel· 17 dl17 dl
- 🤗onnxmodelzoo/googlenet-12-int8model
- 🤗onnxmodelzoo/googlenet-12-qdqmodel
- 🤗onnxmodelzoo/googlenet-12model
- 🤗onnxmodelzoo/googlenet-3model
- 🤗onnxmodelzoo/googlenet-6model
- 🤗onnxmodelzoo/googlenet-7model
- 🤗onnxmodelzoo/googlenet-8model
- 🤗onnxmodelzoo/googlenet-9model
Videos
Drago Anguelov — Robustness, Safety, and Scalability at Waymo· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsConvolution · Average Pooling · Local Response Normalization · Auxiliary Classifier · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling · Softmax
