From Data to Commonsense Reasoning: The Use of Large Language Models for Explainable AI
Stefanie Krause, Frieder Stolzenburg

TL;DR
This paper evaluates large language models like GPT-3.5, Gemma, and Llama 3 on commonsense reasoning and explainability in question answering tasks, showing LLMs can outperform humans and provide good explanations.
Contribution
It demonstrates the reasoning and explainability capabilities of LLMs on QA benchmarks, highlighting Llama 3's superior accuracy and GPT-3.5's effective explanations.
Findings
Llama 3 outperforms humans on all datasets with 21% higher accuracy.
GPT-3.5 achieves 56-93% accuracy on QA benchmarks.
66% of participants rated GPT-3.5's explanations as good or excellent.
Abstract
Commonsense reasoning is a difficult task for a computer, but a critical skill for an artificial intelligence (AI). It can enhance the explainability of AI models by enabling them to provide intuitive and human-like explanations for their decisions. This is necessary in many areas especially in question answering (QA), which is one of the most important tasks of natural language processing (NLP). Over time, a multitude of methods have emerged for solving commonsense reasoning problems such as knowledge-based approaches using formal logic or linguistic analysis. In this paper, we investigate the effectiveness of large language models (LLMs) on different QA tasks with a focus on their abilities in reasoning and explainability. We study three LLMs: GPT-3.5, Gemma and Llama 3. We further evaluate the LLM results by means of a questionnaire. We demonstrate the ability of LLMs to reason with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Adam · Dropout · Dense Connections · Weight Decay · Multi-Head Attention
