Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time

Mohamad Chehade; Soumya Suvra Ghosal; Souradip Chakraborty; Avinash Reddy; Dinesh Manocha; Hao Zhu; Amrit Singh Bedi

arXiv:2505.23729·cs.CL·June 3, 2025

Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time

Mohamad Chehade, Soumya Suvra Ghosal, Souradip Chakraborty, Avinash Reddy, Dinesh Manocha, Hao Zhu, Amrit Singh Bedi

PDF

Open Access

TL;DR

This paper introduces SITAlign, a novel inference-time framework for aligning large language models with human preferences by satisficing, which balances primary objectives with secondary thresholds, backed by theoretical bounds and empirical validation.

Contribution

It proposes a new satisficing-based inference method for LLM alignment, providing theoretical bounds and demonstrating superior performance over existing multi-objective strategies.

Findings

01

SITAlign outperforms state-of-the-art methods by 22.3% in helpfulness GPT-4 win-tie rate.

02

The framework effectively balances helpfulness and harmlessness constraints.

03

Theoretical bounds on sub-optimality are derived for the satisficing approach.

Abstract

Aligning large language models with humans is challenging due to the inherently multifaceted nature of preference feedback. While existing approaches typically frame this as a multi-objective optimization problem, they often overlook how humans actually make decisions. Research on bounded rationality suggests that human decision making follows satisficing strategies-optimizing primary objectives while ensuring others meet acceptable thresholds. To bridge this gap and operationalize the notion of satisficing alignment, we propose SITAlign: an inference time framework that addresses the multifaceted nature of alignment by maximizing a primary objective while satisfying threshold-based constraints on secondary criteria. We provide theoretical insights by deriving sub-optimality bounds of our satisficing based inference alignment approach. We empirically validate SITAlign's performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Natural Language Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Adam · Dense Connections · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Multi-Head Attention · Byte Pair Encoding