TL;DR
This paper explores integrating IP broadcasting with live audio tagging by containerising an audio tagging model into a microservice, enhancing workflow flexibility and addressing latency challenges.
Contribution
It introduces a modular microservice for audio tagging in IP broadcasting workflows, emphasizing containerisation for flexible deployment across various production scales.
Findings
Containerised audio tagging model enables flexible integration.
Latency impacts the real-time usefulness of audio tagging.
Workflow improvements for diverse broadcasting environments.
Abstract
The broadcasting industry has adopted IP technologies, revolutionising both live and pre-recorded content production, from news gathering to live music events. IP broadcasting allows for the transport of audio and video signals in an easily configurable way, aligning with modern networking techniques. This shift towards an IP workflow allows for much greater flexibility, not only in routing signals but with the integration of tools using standard web development techniques. One possible tool could include the use of live audio tagging, which has a number of uses in the production of content. These could include adding sound effects to automated closed captioning or identifying unwanted sound events within a scene. In this paper, we describe the process of containerising an audio tagging model into a microservice, a small segregated code module that can be integrated into a multitude of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
