BiomechGPT: Towards a Biomechanically Fluent Multimodal Foundation Model for Clinically Relevant Motion Tasks
Ruize Yang, Ann Kennedy, R. James Cotton

TL;DR
BiomechGPT is a multimodal foundation model that integrates biomechanical movement data and language to perform clinically relevant movement analysis, enabling versatile and detailed assessments in healthcare settings.
Contribution
This work introduces BiomechGPT, the first multimodal biomechanics-language model trained on extensive movement data for diverse clinical movement tasks.
Findings
High accuracy in activity recognition
Effective identification of movement impairments
Reliable scoring of clinical outcomes
Abstract
Advances in markerless motion capture are expanding access to biomechanical movement analysis, making it feasible to obtain high-quality movement data from outpatient clinics, inpatient hospitals, therapy, and even home. Expanding access to movement data in these diverse contexts makes the challenge of performing downstream analytics all the more acute. Creating separate bespoke analysis code for all the tasks end users might want is both intractable and does not take advantage of the common features of human movement underlying them all. Recent studies have shown that fine-tuning language models to accept tokenized movement as an additional modality enables successful descriptive captioning of movement. Here, we explore whether such a multimodal motion-language model can answer detailed, clinically meaningful questions about movement. We collected over 30 hours of biomechanics from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusculoskeletal pain and rehabilitation
