MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models
Yingxu He, Zhuohan Liu, Shuo Sun, Bin Wang, Wenyu Zhang, Xunlong Zou,, Nancy F. Chen, Ai Ti Aw

TL;DR
MERaLiON-AudioLLM is a pioneering multilingual speech-text model designed for Singapore's diverse linguistic landscape, improving speech recognition and understanding in complex, multicultural environments.
Contribution
It is the first speech-text model tailored for Singapore's multilingual context, integrating advanced speech and text processing for localized AI applications.
Findings
Enhanced speech recognition accuracy in multilingual settings
Improved task-specific understanding for regional dialects
Demonstrated effectiveness in complex, multicultural environments
Abstract
We introduce MERaLiON-AudioLLM (Multimodal Empathetic Reasoning and Learning in One Network), the first speech-text model tailored for Singapore's multilingual and multicultural landscape. Developed under the National Large Language Models Funding Initiative, Singapore, MERaLiON-AudioLLM integrates advanced speech and text processing to address the diverse linguistic nuances of local accents and dialects, enhancing accessibility and usability in complex, multilingual environments. Our results demonstrate improvements in both speech recognition and task-specific understanding, positioning MERaLiON-AudioLLM as a pioneering solution for region specific AI applications. We envision this release to set a precedent for future models designed to address localised linguistic and cultural contexts in a global framework.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗MERaLiON/MERaLiON-3-10B-previewmodel· 322 dl· ♡ 1322 dl♡ 1
- 🤗MERaLiON/MERaLiON-AudioLLM-Whisper-SEA-LIONmodel· 205 dl· ♡ 29205 dl♡ 29
- 🤗MERaLiON/MERaLiON-2-10Bmodel· 711 dl· ♡ 11711 dl♡ 11
- 🤗MERaLiON/MERaLiON-2-3Bmodel· 2.6k dl· ♡ 52.6k dl♡ 5
- 🤗MERaLiON/MERaLiON-2-10B-ASRmodel· 1.4k dl· ♡ 101.4k dl♡ 10
- 🤗lewiswoncy/m_test_9model· 42 dl42 dl
- 🤗lewiswoncy/m_test_9_11model· 2 dl2 dl
- 🤗MERaLiON/MERaLiON-2-3B-MLXmodel· 8 dl8 dl
- 🤗MERaLiON/MERaLiON-2-10B-MLXmodel· 12 dl12 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeophysical Methods and Applications
MethodsSparse Evolutionary Training
