Large Language Models Are Also Good Prototypical Commonsense Reasoners
Chenin Li, Qianglong Chen, Yin Zhang, Yifei Zhang, Hongxiang Yao

TL;DR
This paper demonstrates that carefully designed prompts can enable large language models to excel at commonsense reasoning tasks, achieving new state-of-the-art results without fine-tuning.
Contribution
The authors introduce novel prompt engineering techniques, including task relevance and supportive evidence generation, to enhance large language models' reasoning capabilities.
Findings
Achieved new SOTA on ProtoQA with an 8% improvement in Max Answer@1
Surpassed previous models on StrategyQA and CommonsenseQA2.0 by 3% and 1%
Enhanced interpretability through Chain-of-Thought and knowledge prompts
Abstract
Commonsense reasoning is a pivotal skill for large language models, yet it presents persistent challenges in specific tasks requiring this competence. Traditional fine-tuning approaches can be resource-intensive and potentially compromise a model's generalization capacity. Furthermore, state-of-the-art language models like GPT-3.5 and Claude are primarily accessible through API calls, which makes fine-tuning models challenging. To address these challenges, we draw inspiration from the outputs of large models for tailored tasks and semi-automatically developed a set of novel prompts from several perspectives, including task-relevance, supportive evidence generation (e.g. chain-of-thought and knowledge), diverse path decoding to aid the model. Experimental results on ProtoQA dataset demonstrate that with better designed prompts we can achieve the new state-of-art(SOTA) on the ProtoQA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · Residual Connection · Attention Dropout · Adam · Layer Normalization · Dropout
