Hybrid attention structure preserving network for reconstruction of under-sampled OCT images
Zezhao Guo, Zhanfang Zhao

TL;DR
This paper introduces HASPN, a novel neural network that enhances under-sampled OCT images by combining attention mechanisms and high-frequency detail reconstruction, improving image quality and speed.
Contribution
The paper proposes a hybrid attention network with a high-frequency branch for OCT image super-resolution, addressing limitations of CNNs in reconstructing fine details.
Findings
Outperforms mainstream methods in qualitative and quantitative metrics.
Demonstrates good generalization on diabetic macular edema dataset.
Effectively reconstructs retinal structures with enhanced detail.
Abstract
Optical coherence tomography (OCT) is a non-invasive, high-resolution imaging technology that provides cross-sectional images of tissues. Dense acquisition of A-scans along the fast axis is required to obtain high digital resolution images. However, the dense acquisition will increase the acquisition time, causing the discomfort of patients. In addition, the longer acquisition time may lead to motion artifacts, thereby reducing imaging quality. In this work, we proposed a hybrid attention structure preserving network (HASPN) to achieve super-resolution of under-sampled OCT images to speed up the acquisition. It utilized adaptive dilated convolution-based channel attention (ADCCA) and enhanced spatial attention (ESA) to better capture the channel and spatial information of the feature. Moreover, convolutional neural networks (CNNs) exhibit a higher sensitivity of low-frequency than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhotoacoustic and Ultrasonic Imaging · Optical Coherence Tomography Applications · Advanced Image Fusion Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
