TL;DR
EUPE is a compact, versatile vision encoder that distills knowledge from multiple domain experts, achieving high performance on diverse tasks efficiently suitable for edge devices.
Contribution
Introduces EUPE, a novel distillation approach that scales up to a large proxy teacher before scaling down, improving efficiency and versatility over previous methods.
Findings
EUPE matches or exceeds individual domain experts in diverse tasks.
EUPE outperforms previous agglomerative encoders in efficiency and performance.
Full EUPE models and code are publicly released.
Abstract
Running AI models on smart edge devices can unlock versatile user experiences, but presents challenges due to limited compute and the need to handle multiple tasks simultaneously. This requires a vision encoder with small size but powerful and versatile representations. We present our method, Efficient Universal Perception Encoder (EUPE), which offers both inference efficiency and universally good representations for diverse downstream tasks. We achieve this by distilling from multiple domain-expert foundation vision encoders. Unlike previous agglomerative methods that directly scale down from multiple teachers to an efficient encoder, we demonstrate the importance of first scaling up to a large proxy teacher and then scaling down from this single teacher. Experiments show that EUPE achieves on-par or better performance than individual domain experts of the same size on diverse task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗facebook/EUPE-ViT-Bmodel· 41k dl· ♡ 2241k dl♡ 22
- 🤗facebook/EUPE-ViT-Smodel· 639 dl· ♡ 9639 dl♡ 9
- 🤗facebook/EUPE-ViT-Tmodel· 398 dl· ♡ 6398 dl♡ 6
- 🤗facebook/EUPE-ConvNeXt-Bmodel· 167 dl· ♡ 6167 dl♡ 6
- 🤗facebook/EUPE-ConvNeXt-Smodel· 114 dl· ♡ 8114 dl♡ 8
- 🤗facebook/EUPE-ConvNeXt-Tmodel· 120 dl· ♡ 4120 dl♡ 4
- 🤗abdelstark/eupe-vit-b16-onnxmodel
- 🤗BiliSakura/EUPE-ConvNeXt-Tmodel· 3 dl3 dl
- 🤗BiliSakura/EUPE-ConvNeXt-Smodel· 6 dl6 dl
- 🤗BiliSakura/EUPE-ConvNeXt-Bmodel· 5 dl5 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
