LinguaFluid: Language Guided Fluid Control via Semantic Rewards in Reinforcement Learning

Aoming Liang; Chi Cheng; Dashuai Chen; Boai Sun; Dixia Fan

arXiv:2508.05977·cs.LG·August 15, 2025

LinguaFluid: Language Guided Fluid Control via Semantic Rewards in Reinforcement Learning

Aoming Liang, Chi Cheng, Dashuai Chen, Boai Sun, Dixia Fan

PDF

TL;DR

This paper presents LinguaFluid, a reinforcement learning approach that uses semantic similarity between language instructions and states to guide control tasks, reducing reliance on manual reward engineering.

Contribution

It introduces a novel semantic reward mechanism using SBERT for aligning language instructions with agent states in RL environments.

Findings

01

Semantic rewards guide learning effectively without hand-crafted rewards.

02

Language embedding space correlates with Euclidean space in control tasks.

03

Framework enables natural language goal alignment in fluid control.

Abstract

In the domain of scientific machine learning, designing effective reward functions remains a challenge in reinforcement learning (RL), particularly in environments where task goals are difficult to specify numerically. Reward functions in existing work are predominantly based on heuristics, manual engineering, or task-specific tuning. In this work, we introduce a semantically aligned reinforcement learning method where rewards are computed by aligning the current state with a target semantic instruction using a Sentence-Bidirectional Encoder Representations from Transformers (SBERT). Instead of relying on manually defined reward functions, the policy receives feedback based on the reward, which is a cosine similarity between the goal textual description and the statement description in the episode. We evaluated our approach in several environments and showed that semantic reward can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.