Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk
Benyou Wang, Xiangbo Wu, Xiaokang Liu, Jianquan Li, Prayag Tiwari,, Qianqian Xie

TL;DR
This paper investigates the capability of pre-trained language models to generate Chinese comedic crosstalk scripts, highlighting improvements with large-scale models but also emphasizing the current limitations in humor quality.
Contribution
It introduces a new Chinese crosstalk dataset and benchmarks various language models for humor generation, providing insights into current capabilities and challenges.
Findings
Large-scale pretraining improves crosstalk quality.
Generated scripts reach 65% of human quality.
Humor generation remains in early development stage.
Abstract
Language is the principal tool for human communication, in which humor is one of the most attractive parts. Producing natural language like humans using computers, a.k.a, Natural Language Generation (NLG), has been widely used for dialogue systems, chatbots, machine translation, as well as computer-aid creation e.g., idea generations, scriptwriting. However, the humor aspect of natural language is relatively under-investigated, especially in the age of pre-trained language models. In this work, we aim to preliminarily test whether NLG can generate humor as humans do. We build a new dataset consisting of numerous digitized Chinese Comical Crosstalk scripts (called C in short), which is for a popular Chinese performing art called `Xiangsheng' since 1800s. (For convenience for non-Chinese speakers, we called `crosstalk' for `Xiangsheng' in this paper.) We benchmark various generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications
MethodsTest
