# Distilling Knowledge From a Deep Pose Regressor Network

**Authors:** Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao, Yasin Almalioglu,, Andrew Markham, Niki Trigoni

arXiv: 1908.00858 · 2019-08-05

## TL;DR

This paper introduces a novel method for distilling knowledge from deep pose regression networks to improve visual odometry efficiency, emphasizing reliable teacher predictions through confidence scoring and achieving significant model compression and speedup.

## Contribution

It presents the first successful approach to distill knowledge from deep pose regression networks using confidence-based attentive training methods.

## Key findings

- Achieved up to 92.95% parameter reduction
- Reduced computation time by 2.12x
- Maintained prediction accuracy close to teacher network

## Abstract

This paper presents a novel method to distill knowledge from a deep pose regressor network for efficient Visual Odometry (VO). Standard distillation relies on "dark knowledge" for successful knowledge transfer. As this knowledge is not available in pose regression and the teacher prediction is not always accurate, we propose to emphasize the knowledge transfer only when we trust the teacher. We achieve this by using teacher loss as a confidence score which places variable relative importance on the teacher prediction. We inject this confidence score to the main training task via Attentive Imitation Loss (AIL) and when learning the intermediate representation of the teacher through Attentive Hint Training (AHT) approach. To the best of our knowledge, this is the first work which successfully distill the knowledge from a deep pose regression network. Our evaluation on the KITTI and Malaga dataset shows that we can keep the student prediction close to the teacher with up to 92.95% parameter reduction and 2.12x faster in computation time.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.00858/full.md

## Figures

30 figures with captions in the complete paper: https://tomesphere.com/paper/1908.00858/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/1908.00858/full.md

---
Source: https://tomesphere.com/paper/1908.00858