Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk

Benyou Wang; Xiangbo Wu; Xiaokang Liu; Jianquan Li; Prayag Tiwari,; Qianqian Xie

arXiv:2207.00735·cs.CL·July 5, 2022

Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk

Benyou Wang, Xiangbo Wu, Xiaokang Liu, Jianquan Li, Prayag Tiwari,, Qianqian Xie

PDF

Open Access 1 Repo

TL;DR

This paper investigates the capability of pre-trained language models to generate Chinese comedic crosstalk scripts, highlighting improvements with large-scale models but also emphasizing the current limitations in humor quality.

Contribution

It introduces a new Chinese crosstalk dataset and benchmarks various language models for humor generation, providing insights into current capabilities and challenges.

Findings

01

Large-scale pretraining improves crosstalk quality.

02

Generated scripts reach 65% of human quality.

03

Humor generation remains in early development stage.

Abstract

Language is the principal tool for human communication, in which humor is one of the most attractive parts. Producing natural language like humans using computers, a.k.a, Natural Language Generation (NLG), has been widely used for dialogue systems, chatbots, machine translation, as well as computer-aid creation e.g., idea generations, scriptwriting. However, the humor aspect of natural language is relatively under-investigated, especially in the age of pre-trained language models. In this work, we aim to preliminarily test whether NLG can generate humor as humans do. We build a new dataset consisting of numerous digitized Chinese Comical Crosstalk scripts (called C $^{3}$ in short), which is for a popular Chinese performing art called `Xiangsheng' since 1800s. (For convenience for non-Chinese speakers, we called `crosstalk' for `Xiangsheng' in this paper.) We benchmark various generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anonno2/crosstalk-generation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications

MethodsTest