Active Learning Methods for Efficient Hybrid Biophysical Variable Retrieval
ochem Verrelst, Sara Dethier, Juan Pablo Rivera, Jordi Mu\~noz-Mar\'i,, Gustau Camps-Valls, Jos\'e Moreno

TL;DR
This paper introduces six active learning methods integrated into a MATLAB toolbox to efficiently select training samples, significantly improving the accuracy of biophysical variable retrieval from remote sensing data with fewer samples.
Contribution
The paper presents novel active learning techniques tailored for biophysical variable retrieval, enhancing kernel-based machine learning regression efficiency with large datasets.
Findings
AL methods outperform random sampling in accuracy
Improved retrieval of leaf area index and chlorophyll content
Lower sampling rates achieve comparable or better results
Abstract
Kernel-based machine learning regression algorithms (MLRAs) are potentially powerful methods for being implemented into operational biophysical variable retrieval schemes. However, they face difficulties in coping with large training datasets. With the increasing amount of optical remote sensing data made available for analysis and the possibility of using a large amount of simulated data from radiative transfer models (RTMs) to train kernel MLRAs, efficient data reduction techniques will need to be implemented. Active learning (AL) methods enable to select the most informative samples in a dataset. This letter introduces six AL methods for achieving optimized biophysical variable estimation with a manageable training dataset, and their implementation into a Matlab-based MLRA toolbox for semi-automatic use. The AL methods were analyzed on their efficiency of improving the estimation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
