Is ChatGPT a Good Multi-Party Conversation Solver?
Chao-Hong Tan, Jia-Chen Gu, Zhen-Hua Ling

TL;DR
This paper evaluates the capabilities of ChatGPT and GPT-4 in multi-party conversations, highlighting current limitations and potential improvements through structural enhancements, with GPT-4 showing promising results.
Contribution
It provides an empirical assessment of LLMs in multi-party conversations and explores structural modifications to improve their performance.
Findings
ChatGPT performs poorly on MPC tasks
GPT-4 shows promising results in MPC scenarios
Structural enhancements can improve LLMs' MPC handling
Abstract
Large Language Models (LLMs) have emerged as influential instruments within the realm of natural language processing; nevertheless, their capacity to handle multi-party conversations (MPCs) -- a scenario marked by the presence of multiple interlocutors involved in intricate information exchanges -- remains uncharted. In this paper, we delve into the potential of generative LLMs such as ChatGPT and GPT-4 within the context of MPCs. An empirical analysis is conducted to assess the zero-shot learning capabilities of ChatGPT and GPT-4 by subjecting them to evaluation across three MPC datasets that encompass five representative tasks. The findings reveal that ChatGPT's performance on a number of evaluated MPC tasks leaves much to be desired, whilst GPT-4's results portend a promising future. Additionally, we endeavor to bolster performance through the incorporation of MPC structures,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Byte Pair Encoding · Dropout · Layer Normalization
