HumTrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond
Shansong Liu, Xu Li, Dian Li, Ying Shan

TL;DR
HumTrans is the largest publicly available dataset of humming melodies, consisting of 56.22 hours of recordings from diverse compositions, designed to advance humming melody transcription and related music generation tasks.
Contribution
This paper introduces HumTrans, the largest open-source humming dataset, enabling improved research in melody transcription and music generation.
Findings
Largest humming dataset with 56.22 hours of recordings
Includes diverse genres and languages
Provides baseline results and evaluation tools
Abstract
This paper introduces the HumTrans dataset, which is publicly available and primarily designed for humming melody transcription. The dataset can also serve as a foundation for downstream tasks such as humming melody based music generation. It consists of 500 musical compositions of different genres and languages, with each composition divided into multiple segments. In total, the dataset comprises 1000 music segments. To collect this humming dataset, we employed 10 college students, all of whom are either music majors or proficient in playing at least one musical instrument. Each of them hummed every segment twice using the web recording interface provided by our designed website. The humming recordings were sampled at a frequency of 44,100 Hz. During the humming session, the main interface provides a musical score for students to reference, with the melody audio playing simultaneously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Diverse Musicological Studies
