Supervision-by-Hallucination-and-Transfer: A Weakly-Supervised Approach for Robust and Precise Facial Landmark Detection

Jun Wan; Yuanzhi Yao; Zhihui Lai; Jie Zhou; Xianxu Hou; Wenwen Min

arXiv:2601.12919·cs.CV·January 21, 2026

Supervision-by-Hallucination-and-Transfer: A Weakly-Supervised Approach for Robust and Precise Facial Landmark Detection

Jun Wan, Yuanzhi Yao, Zhihui Lai, Jie Zhou, Xianxu Hou, Wenwen Min

PDF

Open Access

TL;DR

This paper introduces a weakly-supervised framework combining face hallucination and pose transfer to improve facial landmark detection accuracy, especially on low-resolution or compressed images, surpassing existing methods.

Contribution

It presents the first integration of face hallucination and pose transfer tasks for weakly-supervised facial landmark detection, enhancing robustness and precision.

Findings

01

Outperforms state-of-the-art FLD methods in experiments

02

Effectively recovers facial details from low-resolution images

03

Improves landmark detection accuracy through pose transfer

Abstract

High-precision facial landmark detection (FLD) relies on high-resolution deep feature representations. However, low-resolution face images or the compression (via pooling or strided convolution) of originally high-resolution images hinder the learning of such features, thereby reducing FLD accuracy. Moreover, insufficient training data and imprecise annotations further degrade performance. To address these challenges, we propose a weakly-supervised framework called Supervision-by-Hallucination-and-Transfer (SHT) for more robust and precise FLD. SHT contains two novel mutually enhanced modules: Dual Hallucination Learning Network (DHLN) and Facial Pose Transfer Network (FPTN). By incorporating FLD and face hallucination tasks, DHLN is able to learn high-resolution representations with low-resolution inputs for recovering both facial structures and local details and generating more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Face Recognition and Perception