TL;DR
AXON is a diffusion-based framework that reconstructs high-quality 3D CT volumes from real 2D X-ray images, improving diagnostic resolution and generalizability over existing methods.
Contribution
The paper introduces AXON, a novel multi-stage diffusion model that directly reconstructs 3D CT from real X-rays, incorporating a coarse-to-fine strategy and bi-planar input handling.
Findings
Outperforms state-of-the-art methods with 11.9% higher PSNR.
Achieves 11.0% better SSIM, indicating improved image quality.
Demonstrates strong generalization across different clinical datasets.
Abstract
Computed tomography (CT) provides rich 3D anatomical details but is often constrained by high radiation exposure, substantial costs, and limited availability. While standard chest X-rays are cost-effective and widely accessible, they only provide 2D projections with limited pathological information. Reconstructing 3D CT volumes from 2D X-rays offers a transformative solution to increase diagnostic accessibility, yet existing methods predominantly rely on synthetic X-ray projections, limiting clinical generalization. In this work, we propose AXON, a multi-stage diffusion-based framework that reconstructs high-fidelity 3D CT volumes directly from real X-rays. AXON employs a coarse-to-fine strategy, with a Brownian Bridge diffusion model-based initial stage for global structural synthesis, followed by a ControlNet-based refinement stage for local intensity optimization. It also supports…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
