# Hierarchical attention transformer provides assistant suggestions for orbital rejuvenation surgery

**Authors:** Xiang Lian, Xin Hu, Guannan Li, Siqi Wu, Yihao Liu, Ke Qin, Kai Liu

PMC · DOI: 10.3389/fmed.2025.1532195 · Frontiers in Medicine · 2025-03-06

## TL;DR

A new AI system called HATrans helps detect signs of aging around the eyes at home, potentially reducing the need for in-person plastic surgery consultations.

## Contribution

The Hierarchical Attention Transformer (HATrans) introduces specialized branches for improved periocular image classification and early aging detection.

## Key findings

- HATrans outperformed ResNet and Swin Transformer in accuracy, sensitivity, and specificity for periocular aging detection.
- The hierarchical attention mechanism effectively distinguishes subtle foreground-background differences in smartphone images.
- The model enables home-based early aging identification, supporting clinical decision-making in cosmetic surgery.

## Abstract

Early detection of periocular aging is a common concern in cosmetic surgery. Traditional diagnostic and treatment methods often require hospital visits and consultations with plastic surgeons, which are costly and time-consuming. This study aims to develop and evaluate an AI-based decision-making system for periocular cosmetic surgery, utilizing a Hierarchical Attention Transformer (HATrans) model designed for multi-label classification in periocular conditions, allowing for home-based early aging identification.

This cross-sectional study was conducted at the Department of Plastic and Reconstructive Surgery at Shanghai Jiao Tong University School of Medicine’s Ninth People’s Hospital from September 1, 2010, to April 30, 2024. The study enhanced the Vision Transformer (ViT) by adding two specialized branches: the Region Recognition Branch for foreground area identification, and the Patch Recognition Branch for refined feature representation via contrastive learning. These enhancements allowed for better handling of complex periocular images.

The HATrans model significantly outperformed baseline architectures such as ResNet and Swin Transformer, achieving superior accuracy, sensitivity, and specificity in identifying periocular aging. Ablation studies demonstrated the critical role of the hierarchical attention mechanism in distinguishing subtle foreground-background differences, improving the model’s performance in smartphone-based image analysis.

The HATrans model represents a significant advancement in multi-label classification for facial aesthetics, offering a practical solution for early periocular aging detection at home. The model’s robust performance supports its potential for assisting clinical decision-making in cosmetic surgery, facilitating accessible and timely treatment recommendations.

## Full-text entities

- **Diseases:** hypertrophy of the (MESH:D006984), periocular deformities (MESH:D019557), bleeding (MESH:D006470), epicanthal folds (MESH:D057165), facial trauma (MESH:D020220), headaches (MESH:D006261), oculi muscle (MESH:D019042), facial deformities (MESH:D005153), trichiasis (MESH:D058457), lateral canthal deformities (MESH:D010509), skin laxity (MESH:D007593), palpebral pocket (MESH:D005888), corneal, and conjunctival irritation (MESH:D003229), eyelid laxity (MESH:D005141), infection (MESH:D007239), XL (MESH:D000080345), ptosis (MESH:C564553), vision problems (MESH:D014786), skin damage (MESH:D012871), strabismus (MESH:D013285)
- **Chemicals:** ViT (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11922868/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11922868/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/PMC11922868/full.md

---
Source: https://tomesphere.com/paper/PMC11922868