Development of Large Annotated Music Datasets using HMM-based Forced   Viterbi Alignment

S. Johanan Joysingh; P. Vijayalakshmi; T. Nagarajan

arXiv:2408.14890·eess.AS·August 28, 2024

Development of Large Annotated Music Datasets using HMM-based Forced Viterbi Alignment

S. Johanan Joysingh, P. Vijayalakshmi, T. Nagarajan

PDF

TL;DR

This paper introduces a method using HMM-based forced Viterbi alignment with predefined guitar exercises to efficiently generate annotated music datasets, facilitating automatic music transcription research.

Contribution

It presents a streamlined approach for creating instrument-specific datasets with precise transcriptions, especially for monophonic instruments like guitar.

Findings

01

Generated an acoustic guitar dataset with accurate time-aligned transcriptions.

02

Achieved average transcription accuracy within 5ms of manual verification.

03

Demonstrated the method's potential for efficient dataset creation for AMT systems.

Abstract

Datasets are essential for any machine learning task. Automatic Music Transcription (AMT) is one such task, where considerable amount of data is required depending on the way the solution is achieved. Considering the fact that a music dataset, complete with audio and its time-aligned transcriptions would require the effort of people with musical experience, it could be stated that the task becomes even more challenging. Musical experience is required in playing the musical instrument(s), and in annotating and verifying the transcriptions. We propose a method that would help in streamlining this process, making the task of obtaining a dataset from a particular instrument easy and efficient. We use predefined guitar exercises and hidden Markov model(HMM) based forced viterbi alignment to accomplish this. The guitar exercises are designed to be simple. Since the note sequence are already…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.