Critical Insights into Leading Conversational AI Models
Urja Kohli (1), Aditi Singh (2), Arun Sharma (3) ((1) Department of Mechanical, Automation Engineering, Indira Gandhi Delhi Technical University for Women, Delhi, India, (2) Department of Electronics, Communication Engineering, Indira Gandhi Delhi Technical University for Women

TL;DR
This paper compares five leading LLMs across performance, ethics, and usability, highlighting their unique strengths and suggesting tailored applications to maximize their benefits.
Contribution
It provides a comprehensive analysis of top LLMs, detailing their differences in performance, ethical behavior, and usability, guiding better model selection and deployment.
Findings
Claude excels in moral reasoning
Gemini has superior multimodal capabilities
DeepSeek is strong in factual reasoning
Abstract
Big Language Models (LLMs) are changing the way businesses use software, the way people live their lives and the way industries work. Companies like Google, High-Flyer, Anthropic, OpenAI and Meta are making better LLMs. So, it's crucial to look at how each model is different in terms of performance, moral behaviour and usability, as these differences are based on the different ideas that built them. This study compares five top LLMs: Google's Gemini, High-Flyer's DeepSeek, Anthropic's Claude, OpenAI's GPT models and Meta's LLaMA. It performs this by analysing three important factors: Performance and Accuracy, Ethics and Bias Mitigation and Usability and Integration. It was found that Claude has good moral reasoning, Gemini is better at multimodal capabilities and has strong ethical frameworks. DeepSeek is great at reasoning based on facts, LLaMA is good for open applications and ChatGPT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
