A Framework for Multimodal Medical Image Interaction
Laura Sch\"utz, Sasan Matinfar, Gideon Schafroth, Navid Navab, Merle, Fairhurst, Arthur Wagner, Benedikt Wiestler, Ulrich Eck, Nassir Navab

TL;DR
This paper introduces a multimodal interaction framework using audiovisual feedback in virtual reality to improve spatial perception and accuracy in medical image analysis, especially for surgical procedures.
Contribution
It presents a novel VR-based multimodal framework with model-based sonification for enhanced medical image interaction, validated through user studies.
Findings
Improved spatial perception and localization accuracy.
Significant learning curve in audiovisual association.
Enhanced surgical decision-making support.
Abstract
Medical doctors rely on images of the human anatomy, such as magnetic resonance imaging (MRI), to localize regions of interest in the patient during diagnosis and treatment. Despite advances in medical imaging technology, the information conveyance remains unimodal. This visual representation fails to capture the complexity of the real, multisensory interaction with human tissue. However, perceiving multimodal information about the patient's anatomy and disease in real-time is critical for the success of medical procedures and patient outcome. We introduce a Multimodal Medical Image Interaction (MMII) framework to allow medical experts a dynamic, audiovisual interaction with human tissue in three-dimensional space. In a virtual reality environment, the user receives physically informed audiovisual feedback to improve the spatial perception of anatomical structures. MMII uses a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques
