SINGA-Easy: An Easy-to-Use Framework for MultiModal Analysis
Naili Xing, Sai Ho Yeung, Chenghao Cai, Teck Khim Ng, Wei Wang,, Kaiyuan Yang, Nan Yang, Meihui Zhang, Gang Chen, Beng Chin Ooi

TL;DR
SINGA-Easy is a user-friendly, adaptable deep learning framework designed for multi-modal data analysis, offering distributed hyper-parameter tuning, dynamic inference cost control, and intuitive multimedia interactions.
Contribution
It introduces a novel framework that enhances usability and adaptability for multimedia applications through distributed tuning, cost control, and model explanation features.
Findings
Framework is usable for multimedia applications.
Supports dynamic workload adaptation.
Effective in training and deployment of multi-modal data analysis.
Abstract
Deep learning has achieved great success in a wide spectrum of multimedia applications such as image classification, natural language processing and multimodal data analysis. Recent years have seen the development of many deep learning frameworks that provide a high-level programming interface for users to design models, conduct training and deploy inference. However, it remains challenging to build an efficient end-to-end multimedia application with most existing frameworks. Specifically, in terms of usability, it is demanding for non-experts to implement deep learning models, obtain the right settings for the entire machine learning pipeline, manage models and datasets, and exploit external data sources all together. Further, in terms of adaptability, elastic computation solutions are much needed as the actual serving workload fluctuates constantly, and scaling the hardware resources…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
