XiYOLO: Energy-Aware Object Detection via Iterative Architecture Search and Scaling
Tony Tran, Richie R. Suganda, Bin Hu

TL;DR
XiYOLO introduces an energy-aware neural architecture search framework that optimizes object detection models for edge devices by balancing accuracy and energy consumption, with strong results on standard datasets.
Contribution
The paper proposes a novel energy-adaptive search framework combining a new search space, a two-stage energy estimator, and iterative search for efficient architecture design.
Findings
XiYOLO achieves 20-54% energy reduction over YOLO baselines.
The medium XiYOLO model reaches 86.15 mAP50 on PascalVOC.
The two-stage estimator improves sample efficiency in device-specific adaptation.
Abstract
Object detection on heterogeneous edge devices must satisfy strict energy, latency, and memory constraints while still providing reliable perception for downstream autonomy. Existing energy-aware NAS methods often target limited deployment settings, while real energy remains difficult to optimize because it is highly device-dependent and costly to measure. We address these challenges with an energy-adaptive framework that combines an energy-aware XiResOFA search space, a two-stage energy estimator, and iterative search to identify a single energy-efficient base architecture. We then apply compound scaling to transform this base design into the XiYOLO family across deployment budgets, enabling interpretable accuracy-energy tradeoffs under sparse hardware measurements. Experiments on PascalVOC, COCO, and real-device deployment show that XiYOLO achieves a stronger energy-accuracy tradeoff…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
