Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework   on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous   Modalities

Runwei Guan; Haocheng Zhao; Shanliang Yao; Ka Lok Man; Xiaohui Zhu,; Limin Yu; Yong Yue; Jeremy Smith; Eng Gee Lim; Weiping Ding; Yutao Yue

arXiv:2312.08851·cs.CV·December 15, 2023·1 cites

Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities

Runwei Guan, Haocheng Zhao, Shanliang Yao, Ka Lok Man, Xiaohui Zhu,, Limin Yu, Yong Yue, Jeremy Smith, Eng Gee Lim, Weiping Ding, Yutao Yue

PDF

Open Access 1 Repo

TL;DR

Achelous++ is a low-power, multi-task perception framework for water-surface environments that fuses vision and radar data, enabling efficient, real-time aquatic monitoring on edge devices with novel pruning strategies.

Contribution

The paper introduces Achelous++, a framework for multi-task water-surface perception combining vision and radar, with a novel multi-modal pruning method for low-power edge deployment.

Findings

01

Achieves state-of-the-art accuracy on WaterScenes benchmark.

02

Demonstrates high speed and low power consumption for multiple perception tasks.

03

Supports customizable pruning strategies for real-time inference on low-performance devices.

Abstract

Urban water-surface robust perception serves as the foundation for intelligent monitoring of aquatic environments and the autonomous navigation and operation of unmanned vessels, especially in the context of waterway safety. It is worth noting that current multi-sensor fusion and multi-task learning models consume substantial power and heavily rely on high-power GPUs for inference. This contributes to increased carbon emissions, a concern that runs counter to the prevailing emphasis on environmental preservation and the pursuit of sustainable, low-carbon urban environments. In light of these concerns, this paper concentrates on low-power, lightweight, multi-task panoptic perception through the fusion of visual and 4D radar data, which is seen as a promising low-cost perception method. We propose a framework named Achelous++ that facilitates the development and comprehensive evaluation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

GuanRunwei/Achelous
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUnderwater Acoustics Research · Underwater Vehicles and Communication Systems · Maritime Navigation and Safety

MethodsPruning · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings