# Dolphin: A Spoken Language Proficiency Assessment System for Elementary   Education

**Authors:** Wenbiao Ding, Guowei Xu, Tianqiao Liu, Weiping Fu, Yujia Song, Chaoyou, Guo, Cong Kong, Songfan Yang, Gale Yan Huang, Zitao Liu

arXiv: 1908.00358 · 2020-01-30

## TL;DR

Dolphin is an automated spoken language proficiency assessment system designed for elementary education in China, improving grading efficiency and accuracy for verbal fluency tasks through advanced evaluation techniques.

## Contribution

The paper introduces Dolphin, a novel system that automatically assesses phonological fluency and semantic relevance in students' spoken answers, reducing teachers' grading burden.

## Key findings

- Dolphin outperforms state-of-the-art baselines in evaluation accuracy.
- Online experiments show a 22% increase in grading coverage.
- Dolphin effectively evaluates spoken language proficiency in real-world settings.

## Abstract

Spoken language proficiency is critically important for children's growth and personal development. Due to the limited and imbalanced educational resources in China, elementary students barely have chances to improve their oral language skills in classes. Verbal fluency tasks (VFTs) were invented to let the students practice their spoken language proficiency after school. VFTs are simple but concrete math related questions that ask students to not only report answers but speak out the entire thinking process. In spite of the great success of VFTs, they bring a heavy grading burden to elementary teachers. To alleviate this problem, we develop Dolphin, a spoken language proficiency assessment system for Chinese elementary education. Dolphin is able to automatically evaluate both phonological fluency and semantic relevance of students' VFT answers. We conduct a wide range of offline and online experiments to demonstrate the effectiveness of Dolphin. In our offline experiments, we show that Dolphin improves both phonological fluency and semantic relevance evaluation performance when compared to state-of-the-art baselines on real-world educational data sets. In our online A/B experiments, we test Dolphin with 183 teachers from 2 major cities (Hangzhou and Xi'an) in China for 10 weeks and the results show that VFT assignments grading coverage is improved by 22\%.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.00358/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1908.00358/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/1908.00358/full.md

---
Source: https://tomesphere.com/paper/1908.00358