MeFEm: Medical Face Embedding model

Yury Borets; Stepan Botman

arXiv:2602.14672·cs.CV·February 17, 2026

MeFEm: Medical Face Embedding model

Yury Borets, Stepan Botman

PDF

Open Access 1 Models

TL;DR

MeFEm is a novel vision model for biometric and medical facial analysis that employs innovative masking and loss strategies, outperforming existing models on key tasks with less data.

Contribution

Introduces MeFEm, a new facial embedding model with unique modifications like axial stripe masking and probabilistic CLS reassignment, achieving superior performance.

Findings

01

Outperforms FaRL and Franca on anthropometric tasks

02

Effective BMI estimation on a new consolidated dataset

03

Uses less data than comparable models

Abstract

We present MeFEm, a vision model based on a modified Joint Embedding Predictive Architecture (JEPA) for biometric and medical analysis from facial images. Key modifications include an axial stripe masking strategy to focus learning on semantically relevant regions, a circular loss weighting scheme, and the probabilistic reassignment of the CLS token for high quality linear probing. Trained on a consolidated dataset of curated images, MeFEm outperforms strong baselines like FaRL and Franca on core anthropometric tasks despite using significantly less data. It also shows promising results on Body Mass Index (BMI) estimation, evaluated on a novel, consolidated closed-source dataset that addresses the domain bias prevalent in existing data. Model weights are available at https://huggingface.co/boretsyury/MeFEm , offering a strong baseline for future work in this domain.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
boretsyury/MeFEm
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Face and Expression Recognition