Frequency-domain Learning for Volumetric-based 3D Data Perception

Zifan Yu; Suya You; Fengbo Ren

arXiv:2302.08595·cs.CV·February 22, 2023

Frequency-domain Learning for Volumetric-based 3D Data Perception

Zifan Yu, Suya You, Fengbo Ren

PDF

Open Access

TL;DR

This paper explores frequency-domain learning for 3D volumetric data perception, demonstrating significant data size reduction with minimal accuracy loss and revealing spectral bias in 3D CNNs.

Contribution

It is the first study to analyze frequency-domain learning in 3D CNNs, showing its effectiveness in reducing input data size while maintaining accuracy.

Findings

01

Reduces 3D input data size by 98% with less than 2% accuracy drop.

02

Improves segmentation accuracy by 1.48% with 98% data reduction.

03

Enhances accuracy and IoU by over 3% when using higher-resolution data.

Abstract

Frequency-domain learning draws attention due to its superior tradeoff between inference accuracy and input data size. Frequency-domain learning in 2D computer vision tasks has shown that 2D convolutional neural networks (CNN) have a stationary spectral bias towards low-frequency channels so that high-frequency channels can be pruned with no or little accuracy degradation. However, frequency-domain learning has not been studied in the context of 3D CNNs with 3D volumetric data. In this paper, we study frequency-domain learning for volumetric-based 3D data perception to reveal the spectral bias and the accuracy-input-data-size tradeoff of 3D CNNs. Our study finds that 3D CNNs are sensitive to a limited number of critical frequency channels, especially low-frequency channels. Experiment results show that frequency-domain learning can significantly reduce the size of volumetric-based 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Optical measurement and interference techniques