HitoMi-Cam: A Shape-Agnostic Person Detection Method Using the Spectral Characteristics of Clothing
Shuji Ono

TL;DR
HitoMi-Cam is a real-time, shape-agnostic person detection method leveraging spectral clothing properties, demonstrating high accuracy and efficiency on resource-limited devices, especially useful in unpredictable environments like disaster rescue.
Contribution
This paper introduces HitoMi-Cam, a novel spectral-based person detection system that operates efficiently on edge devices and performs well in shape-agnostic scenarios, complementing CNN methods.
Findings
Achieves 23.2 fps on resource-constrained hardware.
Surpasses CNNs with 93.5% average precision in rescue scenarios.
Maintains minimal false positives across evaluations.
Abstract
While convolutional neural network (CNN)-based object detection is widely used, it exhibits a shape dependency that degrades performance for postures not included in the training data. Building upon our previous simulation study published in this journal, this study implements and evaluates the spectral-based approach on physical hardware to address this limitation. Specifically, this paper introduces HitoMi-Cam, a lightweight and shape-agnostic person detection method that uses the spectral reflectance properties of clothing. The author implemented the system on a resource-constrained edge device without a GPU to assess its practical viability. The results indicate that a processing speed of 23.2 frames per second (fps) (253x190 pixels) is achievable, suggesting that the method can be used for real-time applications. In a simulated search and rescue scenario where the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Gait Recognition and Analysis · Human Pose and Action Recognition
