Reward Engineering for Generating Semi-structured Explanation

Jiuzhou Han; Wray Buntine; Ehsan Shareghi

arXiv:2309.08347·cs.CL·January 25, 2024

Reward Engineering for Generating Semi-structured Explanation

Jiuzhou Han, Wray Buntine, Ehsan Shareghi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reward engineering approach using reinforcement learning to improve the generation of semi-structured explanations in language models, surpassing previous methods on key benchmarks.

Contribution

It presents a novel reward engineering technique in reinforcement learning that enhances semi-structured explanation generation, addressing limitations of supervised fine-tuning.

Findings

01

Achieves state-of-the-art results on ExplaGraph and COPA-SSE benchmarks.

02

Demonstrates the effectiveness of reward aggregation methods in RL for explanation tasks.

03

Highlights potential of RL for future semi-structured explanation generation research.

Abstract

Semi-structured explanation depicts the implicit process of a reasoner with an explicit representation. This explanation highlights how available information in a specific query is utilised and supplemented with information a reasoner produces from its internal weights towards generating an answer. Despite the recent improvements in generative capabilities of language models, producing structured explanations to verify a model's true reasoning capabilities remains a challenge. This issue is particularly pronounced for not-so-large LMs (e.g., FLAN-T5-XXL). In this work, we first underscore the limitations of supervised fine-tuning (SFT) in tackling this challenge, and then introduce a carefully crafted reward engineering method in reinforcement learning (RL) to better address this problem. We investigate multiple reward aggregation methods and provide a detailed discussion which sheds…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiuzhouh/reward-engineering-for-generating-seg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Machine Learning in Healthcare