Large Language Models and Non-Negative Matrix Factorization for Bioacoustic Signal Decomposition

Yasaman Torabi; Shahram Shirani; James P. Reilly

arXiv:2507.09161·eess.AS·July 15, 2025

Large Language Models and Non-Negative Matrix Factorization for Bioacoustic Signal Decomposition

Yasaman Torabi, Shahram Shirani, James P. Reilly

PDF

Open Access

TL;DR

This paper introduces a hybrid bioacoustic signal analysis method combining matrix factorization and large language models to interpret overlapping clinical sounds without prior labels, aiding medical diagnosis.

Contribution

It presents a novel framework that integrates large language models with matrix factorization for unsupervised bioacoustic signal decomposition and interpretation.

Findings

01

Effective separation of overlapping bioacoustic signals

02

Association of acoustic patterns with medical conditions

03

No need for labeled data or prior source knowledge

Abstract

Large language models have shown a remarkable ability to extract meaning from unstructured data, offering new ways to interpret biomedical signals beyond traditional numerical methods. In this study, we present a matrix factorization framework for bioacoustic signal analysis which is enhanced by large language models. The focus is on separating bioacoustic signals that commonly overlap in clinical recordings, using matrix factorization to decompose the mixture into interpretable components. A large language model is then applied to the separated signals to associate distinct acoustic patterns with potential medical conditions such as cardiac rhythm disturbances or respiratory abnormalities. Recordings were obtained from a digital stethoscope applied to a clinical manikin to ensure a controlled and high-fidelity acquisition environment. This hybrid approach does not require labeled data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Fractal and DNA sequence analysis · Machine Learning in Bioinformatics