Performance Comparison of Large Language Models on VNHSGE English   Dataset: OpenAI ChatGPT, Microsoft Bing Chat, and Google Bard

Xuan-Quy Dao

arXiv:2307.02288·cs.CL·July 21, 2023·26 cites

Performance Comparison of Large Language Models on VNHSGE English Dataset: OpenAI ChatGPT, Microsoft Bing Chat, and Google Bard

Xuan-Quy Dao

PDF

Open Access

TL;DR

This study compares the performance of OpenAI ChatGPT, Microsoft Bing Chat, and Google Bard on the VNHSGE English dataset, showing BingChat's superior results and highlighting their potential in English education.

Contribution

It provides a comparative analysis of major LLMs on a Vietnamese English dataset, demonstrating their effectiveness in educational contexts.

Findings

01

BingChat outperforms ChatGPT and Bard in English proficiency tests.

02

All three LLMs outperform Vietnamese students in English.

03

BingChat and Bard can replace ChatGPT in Vietnam.

Abstract

This paper presents a performance comparison of three large language models (LLMs), namely OpenAI ChatGPT, Microsoft Bing Chat (BingChat), and Google Bard, on the VNHSGE English dataset. The performance of BingChat, Bard, and ChatGPT (GPT-3.5) is 92.4\%, 86\%, and 79.2\%, respectively. The results show that BingChat is better than ChatGPT and Bard. Therefore, BingChat and Bard can replace ChatGPT while ChatGPT is not yet officially available in Vietnam. The results also indicate that BingChat, Bard and ChatGPT outperform Vietnamese students in English language proficiency. The findings of this study contribute to the understanding of the potential of LLMs in English language education. The remarkable performance of ChatGPT, BingChat, and Bard demonstrates their potential as effective tools for teaching and learning English at the high school level.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Online Learning and Analytics