Large Language Models and Non-Negative Matrix Factorization for Bioacoustic Signal Decomposition
Yasaman Torabi, Shahram Shirani, James P. Reilly

TL;DR
This paper introduces a hybrid bioacoustic signal analysis method combining matrix factorization and large language models to interpret overlapping clinical sounds without prior labels, aiding medical diagnosis.
Contribution
It presents a novel framework that integrates large language models with matrix factorization for unsupervised bioacoustic signal decomposition and interpretation.
Findings
Effective separation of overlapping bioacoustic signals
Association of acoustic patterns with medical conditions
No need for labeled data or prior source knowledge
Abstract
Large language models have shown a remarkable ability to extract meaning from unstructured data, offering new ways to interpret biomedical signals beyond traditional numerical methods. In this study, we present a matrix factorization framework for bioacoustic signal analysis which is enhanced by large language models. The focus is on separating bioacoustic signals that commonly overlap in clinical recordings, using matrix factorization to decompose the mixture into interpretable components. A large language model is then applied to the separated signals to associate distinct acoustic patterns with potential medical conditions such as cardiac rhythm disturbances or respiratory abnormalities. Recordings were obtained from a digital stethoscope applied to a clinical manikin to ensure a controlled and high-fidelity acquisition environment. This hybrid approach does not require labeled data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Fractal and DNA sequence analysis · Machine Learning in Bioinformatics
