F\"urElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance
Ruocheng Wang, Pei Xu, Haochen Shi, Elizabeth Schumann, C. Karen Liu

TL;DR
This paper introduces a large-scale dataset of 3D hand motions of pianists, and develops a novel pipeline using imitation learning, reinforcement learning, and diffusion models to synthesize realistic, physically-plausible hand movements for piano performances.
Contribution
It presents the first large-scale dataset of hand motions for piano playing and a new method combining diffusion models and reinforcement learning for realistic motion synthesis.
Findings
Generated motions are natural and dexterous.
The approach generalizes to unseen music pieces.
Enhanced motion accuracy through data augmentation.
Abstract
Piano playing requires agile, precise, and coordinated hand control that stretches the limits of dexterity. Hand motion models with the sophistication to accurately recreate piano playing have a wide range of applications in character animation, embodied AI, biomechanics, and VR/AR. In this paper, we construct a first-of-its-kind large-scale dataset that contains approximately 10 hours of 3D hand motion and audio from 15 elite-level pianists playing 153 pieces of classical music. To capture natural performances, we designed a markerless setup in which motions are reconstructed from multi-view videos using state-of-the-art pose estimation models. The motion data is further refined via inverse kinematics using the high-resolution MIDI key-pressing data obtained from sensors in a specialized Yamaha Disklavier piano. Leveraging the collected dataset, we developed a pipeline that can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Human Motion and Animation · Music and Audio Processing
MethodsDiffusion
