Piano Skills Assessment
Paritosh Parmar, Jaiden Reddy, Brendan Morris

TL;DR
This paper introduces a new multimodal dataset for assessing piano playing skills using visual and audio data, exploring the effectiveness of different assessment methods and providing baselines for future research.
Contribution
It presents the first dataset for multimodal piano skill assessment and investigates the comparative effectiveness of visual versus auditory analysis.
Findings
Dataset enables multimodal skill evaluation
Visual and audio cues both contribute to assessment accuracy
Provides baseline models for future research
Abstract
Can a computer determine a piano player's skill level? Is it preferable to base this assessment on visual analysis of the player's performance or should we trust our ears over our eyes? Since current CNNs have difficulty processing long video videos, how can shorter clips be sampled to best reflect the players skill level? In this work, we collect and release a first-of-its-kind dataset for multimodal skill assessment focusing on assessing piano player's skill level, answer the asked questions, initiate work in automated evaluation of piano playing skills and provide baselines for future work. Dataset is available from: https://github.com/ParitoshParmar/Piano-Skills-Assessment.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Music and Audio Processing · Music Technology and Sound Studies
