The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences
Bria Long, Robert Z. Sparks, Violet Xiang, Stefan Stojanov, Zi Yin, Grace E. Keene, Alvin W. M. Tan, Steven Y. Feng, Chengxu Zhuang, Virginia A. Marchman, Daniel L. K. Yamins, and Michael C. Frank

TL;DR
The BabyView dataset provides high-resolution, egocentric videos of infants aged 6 months to 3 years, enabling research on human development and AI models through diverse, annotated, real-world data.
Contribution
This is the first large-scale, high-resolution egocentric infant video dataset with extensive annotations, facilitating development and evaluation of models in developmental and computer vision tasks.
Findings
Models trained on BabyView perform worse than on curated datasets.
Performance improves with dataset size but remains below human levels.
The dataset presents a challenge for developing human-like AI systems.
Abstract
Human children far exceed modern machine learning algorithms in their sample efficiency, achieving high performance in key domains with much less data than current models. This ''data gap'' is a key challenge both for building intelligent artificial systems and for understanding human development. Egocentric video capturing children's experience--their ''training data''--is a key ingredient for comparison of humans and models and for the development of algorithmic innovations to bridge this gap. Yet there are few such datasets available, and extant data are low-resolution, have limited metadata, and importantly, represent only a small set of children's experiences. Here, we provide the first release of a large developmental egocentric video dataset--the BabyView dataset--recorded using a high-resolution camera with a large vertical field-of-view and gyroscope/accelerometer data. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIdentity, Memory, and Therapy · Youth Education and Societal Dynamics
MethodsSparse Evolutionary Training
