# Multilevel Context Representation for Improving Object Recognition

**Authors:** Andreas K\"olsch, Muhammad Zeshan Afzal, Marcus Liwicki

arXiv: 1703.06408 · 2018-03-28

## TL;DR

This paper introduces a multilevel context representation method that combines low- and high-level CNN features to enhance object recognition accuracy without increasing computational cost.

## Contribution

It extends existing CNN architectures with additional top-layer connections, demonstrating improved accuracy on ImageNet by leveraging context near high-level layers.

## Key findings

- Achieved 1-2% reduction in classification error on ImageNet.
- The approach is orthogonal to data augmentation techniques.
- No additional computational cost incurred.

## Abstract

In this work, we propose the combined usage of low- and high-level blocks of convolutional neural networks (CNNs) for improving object recognition. While recent research focused on either propagating the context from all layers, e.g. ResNet, (including the very low-level layers) or having multiple loss layers (e.g. GoogLeNet), the importance of the features close to the higher layers is ignored. This paper postulates that the use of context closer to the high-level layers provides the scale and translation invariance and works better than using the top layer only. In particular, we extend AlexNet and GoogLeNet by additional connections in the top $n$ layers. In order to demonstrate the effectiveness of the proposed approach, we evaluated it on the standard ImageNet task. The relative reduction of the classification error is around 1-2% without affecting the computational cost. Furthermore, we show that this approach is orthogonal to typical test data augmentation techniques, as recently introduced by Szegedy et al. (leading to a runtime reduction of 144 during test time).

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1703.06408/full.md

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/1703.06408/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/1703.06408/full.md

---
Source: https://tomesphere.com/paper/1703.06408