Self-Critique-Guided Curiosity Refinement: Enhancing Honesty and Helpfulness in Large Language Models via In-Context Learning

Duc Hieu Ho; Chenglin Fan

arXiv:2506.16064·cs.CL·June 23, 2025

Self-Critique-Guided Curiosity Refinement: Enhancing Honesty and Helpfulness in Large Language Models via In-Context Learning

Duc Hieu Ho, Chenglin Fan

PDF

Open Access

TL;DR

This paper introduces a self-critique-guided prompting method that enhances honesty and helpfulness in large language models without additional training, demonstrated through comprehensive benchmarking and significant quality improvements.

Contribution

It proposes a novel, training-free prompting strategy that enables LLMs to self-critique and refine responses, improving output trustworthiness across multiple models.

Findings

01

Consistent improvements in honesty and helpfulness scores across models.

02

Reduction in poor-quality responses and increase in high-quality outputs.

03

Relative gains in H2 scores ranging from 1.4% to 4.3%.

Abstract

Large language models (LLMs) have demonstrated robust capabilities across various natural language tasks. However, producing outputs that are consistently honest and helpful remains an open challenge. To overcome this challenge, this paper tackles the problem through two complementary directions. It conducts a comprehensive benchmark evaluation of ten widely used large language models, including both proprietary and open-weight models from OpenAI, Meta, and Google. In parallel, it proposes a novel prompting strategy, self-critique-guided curiosity refinement prompting. The key idea behind this strategy is enabling models to self-critique and refine their responses without additional training. The proposed method extends the curiosity-driven prompting strategy by incorporating two lightweight in-context steps including self-critique step and refinement step. The experiment results on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Mobile Crowdsensing and Crowdsourcing · Ethics and Social Impacts of AI