A comprehensive study of on-device NLP applications -- VQA, automated Form filling, Smart Replies for Linguistic Codeswitching
Naman Goyal

TL;DR
This paper introduces new on-device NLP applications leveraging large language models, including visual question answering, automated form filling, and multilingual smart replies for code-switching, aiming to bridge research and real-world impact.
Contribution
It proposes novel on-device NLP experiences for visual understanding and multilingual communication, addressing gaps in current research and practical deployment.
Findings
First work to address on-device VQA and form filling.
Introduces smart replies supporting multilingual code-switching.
Bridges research with real-world on-device NLP applications.
Abstract
Recent improvement in large language models, open doors for certain new experiences for on-device applications which were not possible before. In this work, we propose 3 such new experiences in 2 categories. First we discuss experiences which can be powered in screen understanding i.e. understanding whats on user screen namely - (1) visual question answering, and (2) automated form filling based on previous screen. The second category of experience which can be extended are smart replies to support for multilingual speakers with code-switching. Code-switching occurs when a speaker alternates between two or more languages. To the best of our knowledge, this is first such work to propose these tasks and solutions to each of them, to bridge the gap between latest research and real world impact of the research in on-device applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · Natural Language Processing Techniques · Software Engineering Research
