A Geolocation-Aware Multimodal Approach for Ecological Prediction
Valerie Zermatten, Chiara Vanalli, Gencer Sumbul, Diego Marcos, Devis Tuia

TL;DR
This paper introduces GAMMA, a transformer-based model that effectively integrates heterogeneous ecological data sources with explicit spatial context, improving environmental prediction accuracy.
Contribution
GAMMA is a novel geolocation-aware multimodal fusion approach that combines diverse ecological data without interpolation, enhancing environmental modeling capabilities.
Findings
Multimodal fusion improves prediction accuracy over single modalities.
Explicit spatial context enhances model performance.
GAMMA effectively integrates remote sensing, biodiversity, and textual data.
Abstract
While integrating multiple modalities has the potential to improve environmental monitoring, current approaches struggle to combine data sources with heterogeneous formats or contents. A central difficulty arises when combining continuous gridded data (e.g., remote sensing) with sparse and irregular point observations such as species records. Existing geostatistical and deep-learning-based approaches typically operate on a single modality or focus on spatially aligned inputs, and thus cannot seamlessly overcome this difficulty. We propose a Geolocation-Aware MultiModal Approach (GAMMA), a transformer-based fusion approach designed to integrate heterogeneous ecological data using explicit spatial context. Instead of interpolating observations into a common grid, GAMMA first represents all inputs as location-aware embeddings that preserve spatial relationships between samples. GAMMA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeographic Information Systems Studies · Data-Driven Disease Surveillance · Multimodal Machine Learning Applications
