A Deep Learning Approach for Low-Latency Packet Loss Concealment of   Audio Signals in Networked Music Performance Applications

Prateek Verma; Alessandro Ilic Mezza; Chris Chafe; Cristina Rottondi

arXiv:2007.07132·cs.SD·July 15, 2020

A Deep Learning Approach for Low-Latency Packet Loss Concealment of Audio Signals in Networked Music Performance Applications

Prateek Verma, Alessandro Ilic Mezza, Chris Chafe, Cristina Rottondi

PDF

TL;DR

This paper presents a deep learning-based method for real-time prediction and concealment of lost audio packets in networked music performance applications, aiming to improve audio quality under strict latency constraints.

Contribution

It introduces a novel deep learning approach for low-latency packet loss concealment specifically tailored for real-time networked music performance scenarios.

Findings

01

Effective real-time packet loss prediction demonstrated

02

Significant reduction in audio glitches observed

03

Improved perceived audio quality in tests

Abstract

Networked Music Performance (NMP) is envisioned as a potential game changer among Internet applications: it aims at revolutionizing the traditional concept of musical interaction by enabling remote musicians to interact and perform together through a telecommunication network. Ensuring realistic conditions for music performance, however, constitutes a significant engineering challenge due to extremely strict requirements in terms of audio quality and, most importantly, network delay. To minimize the end-to-end delay experienced by the musicians, typical implementations of NMP applications use un-compressed, bidirectional audio streams and leverage UDP as transport protocol. Being connection less and unreliable,audio packets transmitted via UDP which become lost in transit are not re-transmitted and thus cause glitches in the receiver audio playout. This article describes a technique for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.