Semantic Segmentation of Fruits on Multi-sensor Fused Data in Natural Orchards
Hanwen Kang, Xing Wang

TL;DR
This paper presents a deep learning method for accurate semantic segmentation of fruits in natural orchards by fusing LiDAR and camera data, addressing challenges of data imbalance and unstructured environments.
Contribution
It introduces a novel approach for multi-sensor data fusion and training in unstructured orchard environments, achieving high segmentation accuracy.
Findings
Achieved 86.2% mIoU on fruit segmentation
Effectively fused texture and geometrical features from LiDAR and camera data
Demonstrated robustness in noisy, unstructured orchard environments
Abstract
Semantic segmentation is a fundamental task for agricultural robots to understand the surrounding environments in natural orchards. The recent development of the LiDAR techniques enables the robot to acquire accurate range measurements of the view in the unstructured orchards. Compared to RGB images, 3D point clouds have geometrical properties. By combining the LiDAR and camera, rich information on geometries and textures can be obtained. In this work, we propose a deep-learning-based segmentation method to perform accurate semantic segmentation on fused data from a LiDAR-Camera visual sensor. Two critical problems are explored and solved in this work. The first one is how to efficiently fused the texture and geometrical features from multi-sensor data. The second one is how to efficiently train the 3D segmentation network under severely imbalance class conditions. Moreover, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI · Remote Sensing and LiDAR Applications · Horticultural and Viticultural Research
