Building a Swedish Open-Domain Conversational Language Model

Tobias Norlund; Agnes Stenbom

arXiv:2104.05277·cs.CL·April 13, 2021

Building a Swedish Open-Domain Conversational Language Model

Tobias Norlund, Agnes Stenbom

PDF

Open Access 1 Repo

TL;DR

This paper introduces a large Swedish conversational language model trained on online forum data, demonstrating promising human-like and informative responses, while also discussing ethical considerations and safety measures.

Contribution

The paper presents the first large Swedish conversational model trained on online forum data and evaluates its conversational abilities through human assessment.

Findings

01

Model responds in a human-like manner

02

Model covers diverse topics effectively

03

Highlights ethical considerations in deployment

Abstract

We present on-going work of evaluating the, to our knowledge, first large generative language model trained to converse in Swedish, using data from the online discussion forum Flashback. We conduct a human evaluation pilot study that indicates the model is often able to respond to conversations in both a human-like and informative manner, on a diverse set of topics. While data from online forums can be useful to build conversational systems, we reflect on the negative consequences that incautious application might have, and the need for taking active measures to safeguard against them.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TobiasNorlund/flashback-gpt
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems