Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution
Akshat Dewan, Michal Ziemski, Henri Meylan, Lorenzo Concina, Bruno, Pouliquen

TL;DR
This paper introduces an end-to-end automated system for transcribing and translating international multilingual conference meetings, enhancing efficiency and replacing traditional workflows at WIPO.
Contribution
It presents a fully integrated, customized speech-to-text and machine translation system for multilingual meetings, demonstrating its architecture, development, and impact.
Findings
High accuracy in automated transcripts
Successful integration into conference workflows
Significant time and cost savings
Abstract
This paper presents an end-to-end solution for the creation of fully automated conference meeting transcripts and their machine translations into various languages. This tool has been developed at the World Intellectual Property Organization (WIPO) using in-house developed speech-to-text (S2T) and machine translation (MT) components. Beyond describing data collection and fine-tuning, resulting in a highly customized and robust system, this paper describes the architecture and evolution of the technical components as well as highlights the business impact and benefits from the user side. We also point out particular challenges in the evolution and adoption of the system and how the new approach created a new product and replaced existing established workflows in conference management documentation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Natural Language Processing Techniques
