NNStreamer: Efficient and Agile Development of On-Device AI Systems
MyungJoo Ham, Jijoong Moon, Geunsik Lim, Jaeyun Jung, Hyoungjoo Ahn,, Wook Song, Sangjung Woo, Parichay Kapoor, Dongju Chae, Gichan Jang, Yongjoo, Ahn, Jihoon Lee

TL;DR
NNStreamer is an open-source framework that streamlines on-device AI development by integrating neural networks into stream pipelines, improving performance, reducing costs, and simplifying implementation across diverse devices.
Contribution
It introduces a novel stream processing approach for neural networks, enabling efficient, flexible, and reusable on-device AI system development.
Findings
Significant performance improvements in on-device neural network processing
Reduction in development costs for AI applications
Successful deployment across multiple consumer devices
Abstract
We propose NNStreamer, a software system that handles neural networks as filters of stream pipelines, applying the stream processing paradigm to deep neural network applications. A new trend with the wide-spread of deep neural network applications is on-device AI. It is to process neural networks on mobile devices or edge/IoT devices instead of cloud servers. Emerging privacy issues, data transmission costs, and operational costs signify the need for on-device AI, especially if we deploy a massive number of devices. NNStreamer efficiently handles neural networks with complex data stream pipelines on devices, significantly improving the overall performance with minimal efforts. Besides, NNStreamer simplifies implementations and allows reusing off-the-shelf media filters directly, which reduces developmental costs significantly. We are already deploying NNStreamer for a wide range of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · IoT and Edge/Fog Computing
