# Towards a Knowledge Graph based Speech Interface

**Authors:** Ashwini Jaya Kumar, S\"oren Auer, Christoph Schmidt, Joachim k\"ohler

arXiv: 1705.09222 · 2017-05-26

## TL;DR

This paper proposes a methodology to connect speech recognition outputs to knowledge graphs, enhancing semantic understanding for applications like question answering and dialogue systems, and analyzes the impact of recognition errors.

## Contribution

It introduces a novel approach for linking speech input to knowledge graphs using DBpedia Spotlight, and studies how recognition errors affect this process.

## Key findings

- Higher word error rate reduces entity linking accuracy
- Using DBpedia Spotlight improves semantic annotation of speech
- Knowledge graph linking enhances speech interface capabilities

## Abstract

Applications which use human speech as an input require a speech interface with high recognition accuracy. The words or phrases in the recognised text are annotated with a machine-understandable meaning and linked to knowledge graphs for further processing by the target application. These semantic annotations of recognised words can be represented as a subject-predicate-object triples which collectively form a graph often referred to as a knowledge graph. This type of knowledge representation facilitates to use speech interfaces with any spoken input application, since the information is represented in logical, semantic form, retrieving and storing can be followed using any web standard query languages. In this work, we develop a methodology for linking speech input to knowledge graphs and study the impact of recognition errors in the overall process. We show that for a corpus with lower WER, the annotation and linking of entities to the DBpedia knowledge graph is considerable. DBpedia Spotlight, a tool to interlink text documents with the linked open data is used to link the speech recognition output to the DBpedia knowledge graph. Such a knowledge-based speech recognition interface is useful for applications such as question answering or spoken dialog systems.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.09222/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1705.09222/full.md

## References

14 references — full list in the complete paper: https://tomesphere.com/paper/1705.09222/full.md

---
Source: https://tomesphere.com/paper/1705.09222