Just ASK: Building an Architecture for Extensible Self-Service Spoken   Language Understanding

Anjishnu Kumar; Arpit Gupta; Julian Chan; Sam Tucker; Bjorn; Hoffmeister; Markus Dreyer; Stanislav Peshterliev; Ankur Gandhe; Denis; Filiminov; Ariya Rastrow; Christian Monson; Agnika Kumar

arXiv:1711.00549·cs.CL·March 5, 2018·56 cites

Just ASK: Building an Architecture for Extensible Self-Service Spoken Language Understanding

Anjishnu Kumar, Arpit Gupta, Julian Chan, Sam Tucker, Bjorn, Hoffmeister, Markus Dreyer, Stanislav Peshterliev, Ankur Gandhe, Denis, Filiminov, Ariya Rastrow, Christian Monson, Agnika Kumar

PDF

Open Access

TL;DR

This paper details the architecture of the Alexa Skills Kit, a scalable and flexible SLU SDK that enables rapid development of voice skills for Alexa, supporting thousands of skills and small datasets.

Contribution

It introduces a machine learning architecture that facilitates extensible, robust SLU models from minimal data, enhancing developer accessibility and rapid iteration.

Findings

01

Supports over 25,000 skills deployed on Alexa

02

Learns robust SLU models from small, sparse datasets

03

Enables rapid development and iteration for third-party developers

Abstract

This paper presents the design of the machine learning architecture that underlies the Alexa Skills Kit (ASK) a large scale Spoken Language Understanding (SLU) Software Development Kit (SDK) that enables developers to extend the capabilities of Amazon's virtual assistant, Alexa. At Amazon, the infrastructure powers over 25,000 skills deployed through the ASK, as well as AWS's Amazon Lex SLU Service. The ASK emphasizes flexibility, predictability and a rapid iteration cycle for third party developers. It imposes inductive biases that allow it to learn robust SLU models from extremely small and sparse datasets and, in doing so, removes significant barriers to entry for software developers and dialogue systems researchers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems