# Using synthetic dataset for semantic segmentation of the human body in the problem of extracting anthropometric data

**Authors:** Azat Absadyk, Olzhas Turar, Darkhan Akhmed-Zaki

PMC · DOI: 10.3389/frai.2024.1336320 · Frontiers in Artificial Intelligence · 2024-08-09

## TL;DR

This paper presents a method to extract accurate body measurements from real images using a model trained on synthetic data, which could improve virtual fitting in e-commerce.

## Contribution

A novel approach using synthetic data for training a semantic segmentation model to extract anthropometric data from real images, accounting for clothing.

## Key findings

- U-Net with EfficientNet backbone achieved 99.83% training accuracy and 0.977 IoU score.
- The model accurately segmented body shape in real images while accounting for clothing.
- Average deviations from actual measurements were minimal, with the highest at 1.15 cm for chest measurements.

## Abstract

The COVID-19 pandemic highlighted the need for accurate virtual sizing in e-commerce to reduce returns and waste. Existing methods for extracting anthropometric data from images have limitations. This study aims to develop a semantic segmentation model trained on synthetic data that can accurately determine body shape from real images, accounting for clothing.

A synthetic dataset of over 22,000 images was created using NVIDIA Omniverse Replicator, featuring human models in various poses, clothing, and environments. Popular CNN architectures (U-Net, SegNet, DeepLabV3, PSPNet) with different backbones were trained on this dataset for semantic segmentation. Models were evaluated on accuracy, precision, recall, and IoU metrics. The best performing model was tested on real human subjects and compared to actual measurements.

U-Net with EfficientNet backbone showed the best performance, with 99.83% training accuracy and 0.977 IoU score. When tested on real images, it accurately segmented body shape while accounting for clothing. Comparison with actual measurements on 9 subjects showed average deviations of −0.24 cm for neck, −0.1 cm for shoulder, 1.15 cm for chest, −0.22 cm for thallium, and 0.17 cm for hip measurements.

The synthetic dataset and trained models enable accurate extraction of anthropometric data from real images while accounting for clothing. This approach has significant potential for improving virtual fitting and reducing returns in e-commerce. Future work will focus on refining the algorithm, particularly for thallium and hip measurements which showed higher variability.

## Linked entities

- **Diseases:** COVID-19 (MONDO:0100096)

## Full-text entities

- **Diseases:** COVID-19 (MESH:D000086382)
- **Chemicals:** thallium (MESH:D013793)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11342039/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11342039/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC11342039/full.md

---
Source: https://tomesphere.com/paper/PMC11342039