Bin Latent Transformer (BiLT): A shift-invariant autoencoder for calibration-free spectral unmixing of turbid media
Martin Hohmann

TL;DR
This paper introduces BiLT-Autoencoder, a shift-invariant neural network architecture that accurately unmix spectral data in turbid media without calibration, using a novel attention-based encoder and physics-guided decoder.
Contribution
The work proposes a shift-invariant autoencoder with a cross-attention encoder and physics-constrained decoder, improving spectral unmixing robustness to calibration drift and hardware changes.
Findings
Achieves high R^2 (>0.97) for optical property recovery on benchmark data.
Maintains accuracy under spectral shifts of ±10 bands and broader instrument line shapes.
Reveals interpretable attention maps with physically meaningful probe strategies.
Abstract
The accurate recovery of constituent-level optical properties from integrating sphere measurements is a central analytical challenge in pharmaceutical analysis, food science, and biomedical diagnostics. Neural network autoencoders can extract spectrally resolved absorption and scattering coefficients for each constituent without prior knowledge, but their fully connected encoders bind learned features to absolute wavelength indices, causing accuracy loss under spectrometer calibration drift or hardware exchange. This work introduces the Bin Latent Transformer (BiLT)-Autoencoder, in which the dense encoder is replaced by a cross-attention scanner: 16 learnable probe vectors query a convolutional feature map, aggregating morphological spectral information independently of absolute wavelength position. A physics-constrained linear decoder with enforced absorption/scattering separation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
