FSKD: Monocular Forest Structure Inference via LiDAR-to-RGBI Knowledge Distillation

Taimur Khan; Hannes Feilhauer; Muhammad Jazib Zafar

arXiv:2604.01766·cs.CV·April 3, 2026

FSKD: Monocular Forest Structure Inference via LiDAR-to-RGBI Knowledge Distillation

Taimur Khan, Hannes Feilhauer, Muhammad Jazib Zafar

PDF

TL;DR

This paper introduces FSKD, a knowledge distillation framework that enables monocular RGBI imagery to accurately infer detailed forest structure metrics traditionally obtained from costly LiDAR data, supporting scalable environmental monitoring.

Contribution

The novel FSKD method combines multi-modal fusion and asymmetric distillation to produce state-of-the-art monocular forest structure predictions, including CHM, PAI, and FHD, with improved accuracy and flexibility.

Findings

01

Student model achieves state-of-the-art zero-shot CHM performance.

02

Multi-modal fusion improves accuracy by 10-26% over RGBI-only training.

03

Method remains effective under temporal mismatch, enabling scalable monitoring.

Abstract

Very High Resolution (VHR) forest structure data at individual-tree scale is essential for carbon, biodiversity, and ecosystem monitoring. Still, airborne LiDAR remains costly and infrequent despite being the reference for forest structure metrics like Canopy Height Model (CHM), Plant Area Index (PAI), and Foliage Height Diversity (FHD). We propose FSKD: a LiDAR-to-RGB-Infrared (RGBI) knowledge distillation (KD) framework in which a multi-modal teacher fuses RGBI imagery with LiDAR-derived planar metrics and vertical profiles via cross-attention, and an RGBI-only SegFormer student learns to reproduce these outputs. Trained on 384 $k m^{2}$ of forests in Saxony, Germany (20 cm ground sampling distance (GSD)) and evaluated on eight geographically distinct test tiles, the student achieves state-of-the-art (SOTA) zero-shot CHM performance (MedAE 4.17 m, $R^{2}$ =0.51, IoU 0.87), outperforming…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.