Self-Critique-Guided Curiosity Refinement: Enhancing Honesty and Helpfulness in Large Language Models via In-Context Learning
Duc Hieu Ho, Chenglin Fan

TL;DR
This paper introduces a self-critique-guided prompting method that enhances honesty and helpfulness in large language models without additional training, demonstrated through comprehensive benchmarking and significant quality improvements.
Contribution
It proposes a novel, training-free prompting strategy that enables LLMs to self-critique and refine responses, improving output trustworthiness across multiple models.
Findings
Consistent improvements in honesty and helpfulness scores across models.
Reduction in poor-quality responses and increase in high-quality outputs.
Relative gains in H2 scores ranging from 1.4% to 4.3%.
Abstract
Large language models (LLMs) have demonstrated robust capabilities across various natural language tasks. However, producing outputs that are consistently honest and helpful remains an open challenge. To overcome this challenge, this paper tackles the problem through two complementary directions. It conducts a comprehensive benchmark evaluation of ten widely used large language models, including both proprietary and open-weight models from OpenAI, Meta, and Google. In parallel, it proposes a novel prompting strategy, self-critique-guided curiosity refinement prompting. The key idea behind this strategy is enabling models to self-critique and refine their responses without additional training. The proposed method extends the curiosity-driven prompting strategy by incorporating two lightweight in-context steps including self-critique step and refinement step. The experiment results on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Mobile Crowdsensing and Crowdsourcing · Ethics and Social Impacts of AI
