There is no rose without a thorn: Finding weaknesses on BlenderBot 2.0 in terms of Model, Data and User-Centric Approach
Jungseob Lee, Midan Shim, Suhyune Son, Chanjun Park, Yujin Kim,, Heuiseok Lim

TL;DR
This paper critically analyzes BlenderBot 2.0's limitations across model, data, and user perspectives, highlighting areas for improvement and proposing practical solutions and future research directions.
Contribution
It provides a comprehensive critique of BlenderBot 2.0's weaknesses and offers targeted improvement strategies from multiple viewpoints.
Findings
Data collection issues, including unclear guidelines and lack of hate speech refinement.
Identification of nine user-related limitations and their causes.
Discussion of potential future research directions.
Abstract
BlenderBot 2.0 is a dialogue model that represents open-domain chatbots by reflecting real-time information and remembering user information for an extended period using an internet search module and multi-session. Nonetheless, the model still has room for improvement. To this end, we examine BlenderBot 2.0 limitations and errors from three perspectives: model, data, and user. From the data point of view, we highlight the unclear guidelines provided to workers during the crowdsourcing process, as well as a lack of a process for refining hate speech in the collected data and verifying the accuracy of internet-based information. From a user perspective, we identify nine types of limitations of BlenderBot 2.0, and their causes are thoroughly investigated. Furthermore, for each point of view, we propose practical improvement methods and discuss several potential future research directions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · AI in Service Interactions · Spam and Phishing Detection
