FOSS: Multi-Person Age Estimation with Focusing on Objects and Still Seeing Surroundings
Masakazu Yoshimura, Satoshi Ogata

TL;DR
This paper introduces a unified model for multi-person age estimation that detects faces and estimates ages simultaneously, outperforming traditional two-model approaches especially in complex scenes.
Contribution
The paper presents a novel single-model approach for multi-person age estimation that focuses on faces while still perceiving surroundings, trained with single-face images but effective in multi-person scenarios.
Findings
Enhanced accuracy over traditional two-model methods.
Effective on both multi-person and single-person datasets.
Outperforms state-of-the-art in age estimation accuracy.
Abstract
Age estimation from images can be used in many practical scenes. Most of the previous works targeted on the estimation from images in which only one face exists. Also, most of the open datasets for age estimation contain images like that. However, in some situations, age estimation in the wild and for multi-person is needed. Usually, such situations were solved by two separate models; one is a face detector model which crops facial regions and the other is an age estimation model which estimates from cropped images. In this work, we propose a method that can detect and estimate the age of multi-person with a single model which estimates age with focusing on faces and still seeing surroundings. Also, we propose a training method which enables the model to estimate multi-person well despite trained with images in which only one face is photographed. In the experiments, we evaluated our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Video Surveillance and Tracking Methods · Generative Adversarial Networks and Image Synthesis
