Hformer: Hybrid CNN-Transformer for Fringe Order Prediction in Phase Unwrapping of Fringe Projection
Xinjun Zhu, Zhiqiang Han, Mengkai Yuan, Qinghua Guo, Hongyi Wang

TL;DR
This paper introduces Hformer, a hybrid CNN-Transformer model for phase unwrapping in fringe projection 3D measurement, demonstrating improved fringe order prediction over CNN-only models.
Contribution
The paper pioneers integrating Transformer architecture into phase unwrapping, creating a hybrid model that outperforms CNN-based methods like U-Net and DCNN.
Findings
Hformer achieves better fringe order prediction accuracy.
Ablation studies confirm the effectiveness of feature pyramid networks and flipping strategies.
The approach offers a new direction for deep learning in phase unwrapping.
Abstract
Recently, deep learning has attracted more and more attention in phase unwrapping of fringe projection three-dimensional (3D) measurement, with the aim to improve the performance leveraging the powerful Convolutional Neural Network (CNN) models. In this paper, for the first time (to the best of our knowledge), we introduce the Transformer into the phase unwrapping which is different from CNN and propose Hformer model dedicated to phase unwrapping via fringe order prediction. The proposed model has a hybrid CNN-Transformer architecture that is mainly composed of backbone, encoder and decoder to take advantage of both CNN and Transformer. Encoder and decoder with cross attention are designed for the fringe order prediction. Experimental results show that the proposed Hformer model achieves better performance in fringe order prediction compared with the CNN models such as U-Net and DCNN.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptical measurement and interference techniques · Advanced Measurement and Metrology Techniques · Industrial Vision Systems and Defect Detection
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Concatenated Skip Connection · Max Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · U-Net · Label Smoothing
