Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities
Rohit Mohan, Jos\'e Arce, Sassan Mokhtar, Daniele Cattaneo, Abhinav, Valada

TL;DR
Syn-Mediverse is a large, hyper-realistic synthetic multimodal dataset designed to improve scene understanding in healthcare facilities, supporting various computer vision tasks for autonomous healthcare robots.
Contribution
It introduces the first comprehensive synthetic dataset for healthcare environments, enabling training and benchmarking of models across multiple scene understanding tasks.
Findings
Dataset contains over 48,000 images and 1.5 million annotations.
Evaluations show the dataset's complexity and challenge for state-of-the-art models.
Provides an online benchmark for further research.
Abstract
Safety and efficiency are paramount in healthcare facilities where the lives of patients are at stake. Despite the adoption of robots to assist medical staff in challenging tasks such as complex surgeries, human expertise is still indispensable. The next generation of autonomous healthcare robots hinges on their capacity to perceive and understand their complex and frenetic environments. While deep learning models are increasingly used for this purpose, they require extensive annotated training data which is impractical to obtain in real-world healthcare settings. To bridge this gap, we present Syn-Mediverse, the first hyper-realistic multimodal synthetic dataset of diverse healthcare facilities. Syn-Mediverse contains over \num{48000} images from a simulated industry-standard optical tracking camera and provides more than 1.5M annotations spanning five different scene understanding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Advanced Neural Network Applications · Retinal Imaging and Analysis
