RobPy: a Python Package for Robust Statistical Methods
Sarah Leyder, Jakob Raymaekers, Peter J. Rousseeuw, Thomas Servotte,, Tim Verdonck

TL;DR
RobPy is a comprehensive Python package that implements robust statistical methods, enabling effective outlier detection and analysis in data, filling a gap for Python users compared to existing R packages.
Contribution
This paper introduces RobPy, the first extensive Python package for robust statistical methods, integrating tools for outlier detection, robust estimation, and diagnostic visualization.
Findings
RobPy offers a wide range of robust methods compatible with NumPy, SciPy, and scikit-learn.
The package includes diagnostic tools for visualizing outliers and assessing robustness.
RobPy demonstrates comparable or superior performance to existing R packages in robust data analysis.
Abstract
Robust estimation provides essential tools for analyzing data that contain outliers, ensuring that statistical models remain reliable even in the presence of some anomalous data. While robust methods have long been available in R, users of Python have lacked a comprehensive package that offers these methods in a cohesive framework. RobPy addresses this gap by offering a wide range of robust methods in Python, built upon established libraries including NumPy, SciPy, and scikit-learn. This package includes tools for robust preprocessing, univariate estimation, covariance matrices, regression, and principal component analysis, which are able to detect outliers and to mitigate their effect. In addition, RobPy provides specialized diagnostic plots for visualizing casewise and cellwise outliers. This paper presents the structure of the RobPy package, demonstrates its functionality through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultidisciplinary Science and Engineering Research · Advanced Statistical Methods and Models
