Distillation of neural network models for detection and description of key points of images
A.V. Yashchenko, A.V. Belikov, M.V. Peterson, A.S. Potapov

TL;DR
This paper explores neural network distillation to create more compact, faster models for image key point detection and description, achieving comparable or improved accuracy with fewer parameters.
Contribution
It introduces a novel distillation method and training procedure for neural networks in key point detection, along with a new dataset and quality metric.
Findings
Distilled models outperform original models in key point accuracy.
Smaller models maintain high accuracy close to original models.
Proposed method enhances speed and efficiency of neural key point detection.
Abstract
Image matching and classification methods, as well as synchronous location and mapping, are widely used on embedded and mobile devices. Their most resource-intensive part is the detection and description of the key points of the images. And if the classical methods of detecting and describing key points can be executed in real time on mobile devices, then for modern neural network methods with the best quality, such use is difficult. Thus, it is important to increase the speed of neural network models for the detection and description of key points. The subject of research is distillation as one of the methods for reducing neural network models. The aim of thestudy is to obtain a more compact model of detection and description of key points, as well as a description of the procedure for obtaining this model. A method for the distillation of neural networks for the task of detecting and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Computational Techniques in Science and Engineering · Advanced Data Processing Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
