Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space
Core Francisco Park, Maya Okawa, Andrew Lee, Hidenori Tanaka, Ekdeep, Singh Lubana

TL;DR
This paper introduces a framework called concept space to analyze how generative models learn and develop hidden capabilities, revealing that latent abilities emerge suddenly during training and are controlled by data properties.
Contribution
It proposes a novel concept space framework to study learning dynamics and hidden capability emergence in generative models, linking these phenomena to data properties.
Findings
Learning speed and order are influenced by concept signal properties.
Sudden turns in learning dynamics correspond to the emergence of hidden capabilities.
Hidden capabilities can exist without being elicitable through naive prompts.
Abstract
Modern generative models demonstrate impressive capabilities, likely stemming from an ability to identify and manipulate abstract concepts underlying their training data. However, fundamental questions remain: what determines the concepts a model learns, the order in which it learns them, and its ability to manipulate those concepts? To address these questions, we propose analyzing a model's learning dynamics via a framework we call the concept space, where each axis represents an independent concept underlying the data generating process. By characterizing learning dynamics in this space, we identify how the speed at which a concept is learned, and hence the order of concept learning, is controlled by properties of the data we term concept signal. Further, we observe moments of sudden turns in the direction of a model's learning dynamics in concept space. Surprisingly, these points…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsInnovative Teaching and Learning Methods
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Focus
