OXSeg: Multidimensional Attention UNet-Based Lip Segmentation Using Semi-Supervised Lip Contours

HANIE MOGHADDASI; CHRISTINA CHAMBERS; SARAH N. MATTSON; JEFFREY R. WOZNIAK; CLAIRE D. COLES; RAJA MUKHERJEE; MICHAEL SUTTIE

PMC · DOI:10.1109/access.2025.3577000·August 5, 2025

OXSeg: Multidimensional Attention UNet-Based Lip Segmentation Using Semi-Supervised Lip Contours

HANIE MOGHADDASI, CHRISTINA CHAMBERS, SARAH N. MATTSON, JEFFREY R. WOZNIAK, CLAIRE D. COLES, RAJA MUKHERJEE, MICHAEL SUTTIE

PDF

Open Access

TL;DR

This paper introduces a new lip segmentation method using attention UNet and multidimensional inputs, achieving high accuracy in identifying lip contours and detecting fetal alcohol syndrome-related anomalies.

Contribution

The novel approach combines multidimensional attention UNet with anatomical landmark-based mask generation to improve lip segmentation accuracy.

Findings

01

The proposed method achieved a mean dice score of 84.75% and pixel accuracy of 99.77% in upper lip segmentation.

02

A GAN-based classifier achieved 98.55% accuracy in identifying fetal alcohol syndrome in one population.

Abstract

Lip segmentation plays a crucial role in various domains, such as lip synchronization, lip-reading, and diagnostics. However, the effectiveness of supervised lip segmentation is constrained by the availability of lip contour in the training phase. A further challenge with lip segmentation is its reliance on image quality, lighting, and skin tone, leading to inaccuracies in the detected boundaries. To address these challenges, we propose a sequential lip segmentation method that integrates attention UNet and multidimensional input. We unravel the micro-patterns in facial images using local binary patterns to build multidimensional inputs. Subsequently, the multidimensional inputs are fed into sequential attention UNets, where the lip contour is reconstructed. We introduce a mask generation method that uses a few anatomical landmarks and estimates the complete lip contour to improve…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Diseases3

fetal alcohol syndrome facial anomalies FAS

Figures10

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Face recognition and analysis · Facial Rejuvenation and Surgery Techniques