A Multimodal Machine Learning Framework for Teacher Vocal Delivery   Evaluation

Hang Li; Yu Kang; Yang Hao; Wenbiao Ding; Zhongqin Wu; Zitao Liu

arXiv:2107.07956·cs.SD·July 19, 2021

A Multimodal Machine Learning Framework for Teacher Vocal Delivery Evaluation

Hang Li, Yu Kang, Yang Hao, Wenbiao Ding, Zhongqin Wu, Zitao Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a multimodal machine learning framework that objectively evaluates teacher vocal delivery by analyzing fluency and passion, addressing subjectivity and inefficiency in manual assessments.

Contribution

The study presents a novel pairwise comparison method with a multimodal orthogonal fusion algorithm for large-scale, objective vocal delivery evaluation.

Findings

01

Effective evaluation of vocal delivery achieved

02

Datasets collected from real-world education scenarios

03

Code made publicly available

Abstract

The quality of vocal delivery is one of the key indicators for evaluating teacher enthusiasm, which has been widely accepted to be connected to the overall course qualities. However, existing evaluation for vocal delivery is mainly conducted with manual ratings, which faces two core challenges: subjectivity and time-consuming. In this paper, we present a novel machine learning approach that utilizes pairwise comparisons and a multimodal orthogonal fusing algorithm to generate large-scale objective evaluation results of the teacher vocal delivery in terms of fluency and passion. We collect two datasets from real-world education scenarios and the experiment results demonstrate the effectiveness of our algorithm. To encourage reproducible results, we make our code public available at \url{https://github.com/tal-ai/ML4VocalDelivery.git}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tal-ai/ML4VocalDelivery
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Natural Language Processing Techniques · Topic Modeling