OXSeg: Multidimensional Attention UNet-Based Lip Segmentation Using Semi-Supervised Lip Contours
HANIE MOGHADDASI, CHRISTINA CHAMBERS, SARAH N. MATTSON, JEFFREY R. WOZNIAK, CLAIRE D. COLES, RAJA MUKHERJEE, MICHAEL SUTTIE

TL;DR
This paper introduces a new lip segmentation method using attention UNet and multidimensional inputs, achieving high accuracy in identifying lip contours and detecting fetal alcohol syndrome-related anomalies.
Contribution
The novel approach combines multidimensional attention UNet with anatomical landmark-based mask generation to improve lip segmentation accuracy.
Findings
The proposed method achieved a mean dice score of 84.75% and pixel accuracy of 99.77% in upper lip segmentation.
A GAN-based classifier achieved 98.55% accuracy in identifying fetal alcohol syndrome in one population.
Abstract
Lip segmentation plays a crucial role in various domains, such as lip synchronization, lip-reading, and diagnostics. However, the effectiveness of supervised lip segmentation is constrained by the availability of lip contour in the training phase. A further challenge with lip segmentation is its reliance on image quality, lighting, and skin tone, leading to inaccuracies in the detected boundaries. To address these challenges, we propose a sequential lip segmentation method that integrates attention UNet and multidimensional input. We unravel the micro-patterns in facial images using local binary patterns to build multidimensional inputs. Subsequently, the multidimensional inputs are fed into sequential attention UNets, where the lip contour is reconstructed. We introduce a mask generation method that uses a few anatomical landmarks and estimates the complete lip contour to improve…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Face recognition and analysis · Facial Rejuvenation and Surgery Techniques
