EMUSE: Evolutionary Map of the Universe Search Engine
Nikhel Gupta, Zeeshan Hayder, Minh Huynh, Ray P. Norris, Lars Petersson, Andrew M. Hopkins, Simone Riggi, B\"arbel S. Koribalski, and Miroslav D. Filipovi\'c

TL;DR
EMUSE is a multimodal search engine for radio sources in large astronomical datasets, utilizing fine-tuned foundation models to improve classification and retrieval of diverse radio galaxy morphologies.
Contribution
This work introduces EMUSE, a novel search engine that fine-tunes foundation models for multimodal radio source classification and retrieval in large-scale astronomical surveys.
Findings
Effective retrieval of radio sources with distinct morphological features
Demonstrated the model's ability to recognize various radio galaxy classes
Identified challenges in detecting rare or unseen sources
Abstract
We present EMUSE (Evolutionary Map of the Universe Search Engine), a tool designed for searching specific radio sources within the extensive datasets of the EMU (Evolutionary Map of the Universe) survey, with potential applications to other Big Data challenges in astronomy. Built on a multimodal approach to radio source classification and retrieval, EMUSE fine-tunes the OpenCLIP model on curated radio galaxy datasets. Leveraging the power of foundation models, our work integrates visual and textual embeddings to enable efficient and flexible searches within large radio astronomical datasets. We fine-tune OpenCLIP using a dataset of 2,900 radio galaxies, encompassing various morphological classes, including FR-I, FR-II, FR-x, R-type, and other rare and peculiar sources. The model is optimized using adapter-based fine-tuning, ensuring computational efficiency while capturing the unique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Scientific Computing and Data Management · Distributed and Parallel Computing Systems
