Fuzzy Logic Guided Reward Function Variation: An Oracle for Testing   Reinforcement Learning Programs

Shiyu Zhang; Haoyang Song; Qixin Wang; Yu Pei

arXiv:2406.19812·cs.SE·July 1, 2024·1 cites

Fuzzy Logic Guided Reward Function Variation: An Oracle for Testing Reinforcement Learning Programs

Shiyu Zhang, Haoyang Song, Qixin Wang, Yu Pei

PDF

Open Access 1 Repo

TL;DR

This paper introduces a fuzzy logic-based automated oracle for testing reinforcement learning programs, effectively addressing the oracle problem by analyzing behavioral compliance trends and outperforming human oracles in complex scenarios.

Contribution

It presents a novel fuzzy logic-guided approach to automate RL program testing, improving reliability and scalability in complex environments.

Findings

01

Fuzzy oracle outperforms human oracles in complex RL testing scenarios.

02

The approach effectively detects buggy RL programs based on compliance trend violations.

03

Automated testing improves efficiency and scalability over manual methods.

Abstract

Reinforcement Learning (RL) has gained significant attention across various domains. However, the increasing complexity of RL programs presents testing challenges, particularly the oracle problem: defining the correctness of the RL program. Conventional human oracles struggle to cope with the complexity, leading to inefficiencies and potential unreliability in RL testing. To alleviate this problem, we propose an automated oracle approach that leverages RL properties using fuzzy logic. Our oracle quantifies an agent's behavioral compliance with reward policies and analyzes its trend over training episodes. It labels an RL program as "Buggy" if the compliance trend violates expectations derived from RL characteristics. We evaluate our oracle on RL programs with varying complexities and compare it with human oracles. Results show that while human oracles perform well in simpler testing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qixinwangcpslab/rl-testing-new
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Software Engineering Methodologies · Software Engineering Research

MethodsSoftmax · Attention Is All You Need