Clustering multivariate functional data using the epigraph and hypograph indices: a case study on Madrid air quality
Bel\'en Pulido, Alba M. Franco-Pereira, Rosa E. Lillo

TL;DR
This paper presents a new method for clustering multivariate functional data using epigraph and hypograph indices, effectively capturing variable interrelationships, with demonstrated success on simulated and environmental datasets including Madrid air quality.
Contribution
Introduces generalized epigraph and hypograph indices for multivariate functional data, enabling improved clustering by considering variable interdependencies.
Findings
Strong performance on simulated datasets
Effective clustering of environmental data
Useful for climate and environmental research
Abstract
With the rapid growth of data generation, advancements in functional data analysis (FDA) have become essential, especially for approaches that handle multiple variables at the same time. This paper introduces a novel formulation of the epigraph and hypograph indices, along with their generalized expressions, specifically designed for multivariate functional data (MFD). These new definitions account for interrelationships between variables, enabling effective clustering of MFD based on the original data curves and their first two derivatives. The methodology developed here has been tested on simulated datasets, demonstrating strong performance compared to state-of-the-art methods. Its practical utility is further illustrated with two environmental datasets: the Canadian weather dataset and a 2023 air quality study in Madrid. These applications highlight the potential of the method as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Gene expression and cancer classification · Complex Network Analysis Techniques
