MvKeTR: Chest CT Report Generation with Multi-View Perception and Knowledge Enhancement
Xiwei Deng, Xianchun He, Jianfeng Bao, Yudan Zhou, Shuhui Cai, Congbo Cai, Zhong Chen

TL;DR
This paper introduces MvKeTR, a novel transformer-based framework that mimics radiologists' multi-view analysis and knowledge retrieval to improve automatic chest CT report generation, outperforming existing models.
Contribution
The paper proposes a multi-view perception and knowledge enhancement approach using Kolmogorov-Arnold Networks, advancing CT report generation with better integration of diagnostic views and clinical knowledge.
Findings
Outperforms state-of-the-art models on CTRG-Chest-548 dataset
Effectively integrates multi-view diagnostic information
Utilizes KANs for improved parameter efficiency and high-frequency feature capture
Abstract
CT report generation (CTRG) aims to automatically generate diagnostic reports for 3D volumes, relieving clinicians' workload and improving patient care. Despite clinical value, existing works fail to effectively incorporate diagnostic information from multiple anatomical views and lack related clinical expertise essential for accurate and reliable diagnosis. To resolve these limitations, we propose a novel Multi-view perception Knowledge-enhanced TansfoRmer (MvKeTR) to mimic the diagnostic workflow of clinicians. Just as radiologists first examine CT scans from multiple planes, a Multi-View Perception Aggregator (MVPA) with view-aware attention is proposed to synthesize diagnostic information from multiple anatomical views effectively. Then, inspired by how radiologists further refer to relevant clinical records to guide diagnostic decision-making, a Cross-Modal Knowledge Enhancer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Lung Cancer Diagnosis and Treatment
MethodsAttention Is All You Need · Byte Pair Encoding · Dense Connections · Absolute Position Encodings · Dropout · Linear Layer · Softmax · Adam · Residual Connection · Multi-Head Attention
