Advising with Threshold Tests: Complexity, Signaling, and Effort
Georgy Lukyanov, Mark Izgarshev

TL;DR
This paper analyzes how different informational regimes in advising with threshold tests affect effort, signaling, and outcomes, revealing how complexity influences optimal thresholds and equilibrium types.
Contribution
It introduces a model comparing naive and sophisticated informational regimes, characterizes equilibrium policies, and provides explicit conditions for different equilibrium outcomes.
Findings
Optimal threshold increases with complexity.
Explicit parameter regions determine equilibrium types.
Refinement reduces pooling outcomes.
Abstract
A benevolent advisor observes a project's complexity and posts a pass - fail threshold before the agent chooses effort. The project succeeds only if ability and effort together clear complexity. We compare two informational regimes. In the naive regime, the threshold is treated as non-informative; in the sophisticated regime, the threshold is a signal and the agent updates beliefs. We characterize equilibrium threshold policies and show that the optimal threshold rises with complexity under mild regularity. We then give primitives-based sufficient conditions that guarantee separating, pooling, or semi-separating outcomes. In a benchmark with uniform ability, exponential complexity, and power costs, we provide explicit parameter regions that partition the space by equilibrium type; a standard refinement eliminates most pooling. The results yield transparent comparative statics and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExperimental Behavioral Economics Studies · Game Theory and Applications · Auction Theory and Applications
Advising with Threshold Tests: Complexity, Signaling, and Effort††thanks: We are grateful to Emin Ablyatifov, Jacob Gershman, Arthur Izgarshev, David Li, Anastasia Makhmudova, and Yuliia Tukmakova for helpful comments and suggestions. All remaining errors are our own.
Georgy Lukyanov
Toulouse School of Economics, 1, Esplanade de l’Université, Toulouse, France
Mark Izgarshev
School number 1239, Vspolny per., 6, Moscow, Russia
Abstract
A benevolent advisor observes a project’s complexity and posts a pass–fail threshold before the agent chooses effort. The project succeeds only if ability and effort together clear complexity. We compare two informational regimes. In the naive regime, the threshold is treated as non-informative; in the sophisticated regime, the threshold is a signal and the agent updates beliefs. We characterize equilibrium threshold policies and show that the optimal threshold rises with complexity under mild regularity. We then give primitives-based sufficient conditions that guarantee separating, pooling, or semi-separating outcomes. In a benchmark with uniform ability, exponential complexity, and power costs, we provide explicit parameter regions that partition the space by equilibrium type; a standard refinement eliminates most pooling. The results yield transparent comparative statics and welfare comparisons across regimes.
Keywords: threshold tests; signaling; information design; monotone comparative statics; pooling vs. separation.
1 Introduction
Most institutions run on thresholds. A medical graduate either clears a licensing bar or does not; an airline first officer either meets a minimum flight-hours rule or sits in the right seat later; a firm’s promotion “bar” is raised or lowered with consequences for who advances and how hard people push. In each case a single cutoff both filters who proceeds and shapes what others learn about the task at hand. Recent policy shifts underscore how consequential such decisions are: the USMLE moved Step 1 to pass/fail111The stated motivations included reducing “score obsession,” rebalancing curricula, and mitigating wellness concerns. The reform also altered residency selection, where Step 1 had functioned as a single‐index screen; see the discussion around (USMLE Program,, 2020). to redirect student effort and alter residency-selection incentives (USMLE Program,, 2020); in commercial aviation, the post-2010 “1,500-hour rule”222Industry debates emphasize both safety benefits and pipeline costs. For our purposes, once the rule is codified the bar itself becomes an object of inference and incentive; see (Federal Aviation Administration,, 2013) for the policy background. codified in the FAA’s pilot-qualification reforms made the hour threshold itself the pivotal gate into airline cockpits (Federal Aviation Administration,, 2013). Thresholds are simple to communicate and enforce—but they also broadcast information about underlying difficulty. When a bar is set high (or lowered), careful observers may infer something about the environment, not just the candidates.
This paper proposes a model to study that dual role. An advisor privately observes project complexity and, before any costly action, commits to a pass–fail threshold that only reveals whether the agent’s latent ability lies above the cutoff. Success requires that ability and effort together clear true complexity, so the test both partitions beliefs and shifts the marginal return to effort.333This dual role echoes education signaling: actions both sort types and affect payoffs. In our case the cutoff simultaneously filters and motivates.
We analyze two informational regimes that bracket many applications. In the naive regime, the agent treats the posted threshold as non-informative444“naive” need not mean irrational. Institutions sometimes explicitly instruct candidates not to infer difficulty from posted rules (“don’t read into the bar”), or norms make such inference taboo. We model this as a benchmark that brackets the fully Bayesian case. about complexity and conditions only on pass/fail. In the sophisticated regime, the threshold is itself a signal: the agent understands the advisor’s policy and updates beliefs about complexity from the cutoff. Our object is the advisor’s threshold policy—the mapping from complexity to cutoffs—chosen to maximize the agent’s expected utility (a benevolent benchmark555Equivalently, the designer is utilitarian over the agent’s ex‐ante payoff. Replacing this with an institutional objective (e.g., weighted success minus effort externalities) preserves our monotonicity results under mild alignment. that isolates information-and-incentives effects).
We show that optimal thresholds are (weakly) increasing in complexity in both regimes. Formally, we establish increasing differences and apply monotone comparative statics to prove that any optimal selection is nondecreasing in complexity; for interior solutions the derivative is nonnegative (Theorems 1 and 2). Beyond monotonicity, we classify equilibrium threshold policies by informativeness—separating, pooling, or semi-separating—and give primitives-based sufficient conditions for each case that are stated directly in terms of the thickness of the ability distribution near the cutoff, the dispersion of complexity, the curvature of effort costs, and the value of success (Definition 1 and Theorem 3).
A key novelty is to make the threshold itself an equilibrium signal about the state of the world (complexity) in a setting where the receiver still chooses costly post-signal effort. This tight coupling of endogenous disclosure (via the bar) and subsequent incentives is distinct from standard disclosure or persuasion environments where the receiver’s action is the terminal choice, and from standard threshold/standard-setting models where the bar screens agents but does not reveal the state that determines the productivity of effort. Contemporaneously, Bertola, (2025) studies equilibrium failure rates when institutions endogenize pass thresholds. His analysis focuses on how standards map into observed failure frequencies; by contrast, our advisor observes complexity and the threshold signals that state, which then feeds into the agent’s continuation effort. The two perspectives are complementary: our separating/pooling taxonomy explains when bar movements should be informative about the environment, while his failure-rate lens speaks to observable pass/fail frequencies.
In a benchmark with uniform ability, exponential complexity, and power costs,666Uniform keeps ability tails uncluttered; exponential gives memorylessness and tractable posteriors; power costs include quadratic and provide interpretable elasticities of effort. we derive explicit, graphable inequalities that compare a feasible separating policy to a conservative pooling benchmark. These yield ready-to-plot separating/pooling regions; with quadratic costs they are fully explicit via the Lambert function, and for general power costs they collapse to a one-dimensional root. We then show how the boundary curves shift with the project’s value and cost curvature. Finally, imposing a standard refinement (the Intuitive Criterion) removes most pooling:777The Intuitive Criterion assigns off‐path beliefs to the type that would strictly gain from deviating; here, a small threshold tweak that benefits exactly one forces posteriors that unravel pooling. See Cho and Kreps, (1987) and the related D1/Divinity idea in Banks and Sobel, (1987). a small threshold deviation can be profitable only for the type that benefits from separation, which forces beliefs that unravel pooling (Theorem 4).
The empirical upshot is straightforward: our sufficient conditions produce transparent diagnostics for when institutions should separate (let bars track complexity closely) versus pool (hold the line at a constant bar), and our refinement results explain why, in practice, pooling often unravels once stakeholders can infer complexity from observed bar moves (e.g., grading standards, safety minima, or promotion bars that “creep” with market conditions).
Related literature
We connect to (i) signaling/communication, (ii) information design and evidence disclosure, and (iii) monotone comparative statics.
Classic models show how monotone message partitions arise endogenously in education and communication (Spence,, 1973; Crawford and Sobel,, 1982). Our advisor’s threshold is a one-dimensional signal whose informativeness is determined in equilibrium. Because pooling can often be sustained by beliefs, we rely on the Intuitive Criterion (Cho and Kreps,, 1987) and related D1/Divinity logic (Banks and Sobel,, 1987) to discipline off-path beliefs in the threshold environment.
The pass–fail rule is a coarse scoring device, akin to the optimal coarse disclosure mechanisms in Rayo and Segal, (2010) and to binary experiments in Kamenica and Gentzkow, (2011). A closely related strand studies endogenous grading/standards. Boleslavsky and Cotton, (2015) analyze how a school sets grading standards to trade off selection and incentives; their bar screens students and shapes effort but does not signal the state of the world. Bertola, (2025) examines equilibrium failure rates under endogenous pass thresholds, emphasizing how standards translate into observed fail shares. In contrast, our threshold both sorts and reveals complexity, and the agent responds with post-signal effort—this two-way interaction between disclosure and incentives is where our comparative statics and taxonomy bite. We also relate to evidence design and selective disclosure: Herresthal, (2022) studies hidden testing and what evidence is disclosed ex post. Our advisor instead posts an observable bar ex ante; yet the common theme is how a designer chooses a coarse experiment that strategically shapes downstream behavior. Finally, links to certification and grading with endogenous thresholds (Chan et al.,, 2007; Ostrovsky and Schwarz,, 2010) are direct: our results rationalize when coarser (pooling) certification survives refinements and when standards should track underlying difficulty.
We use single-crossing and lattice methods (Milgrom and Shannon,, 1994; Topkis,, 1998), extended to uncertainty/endogenous information via Athey, (2001, 2002) and strict/interval dominance tools (Edlin and Shannon,, 1998; Quah and Strulovici,, 2009). Log-concavity and related hazard-rate conditions (Bagnoli and Bergstrom,, 2005) ensure that threshold and effort best responses move monotonically with complexity and with posterior informativeness, delivering our separating/pooling regions.
Because policy thresholds in education, labor markets, and regulation have measurable behavioral effects, our model provides structure beneath RD-style estimates (Hahn et al.,, 2001; Imbens and Lemieux,, 2008; Cattaneo et al.,, 2020). It clarifies when endogenously informative cutoffs amplify or attenuate post-threshold behavior and how cost curvature and the distributional shape of complexity govern whether institutions should separate or pool; and it complements the failure-rate perspective emphasized by Bertola, (2025) by highlighting when bar movements themselves should be interpreted as signals of environmental difficulty.
The paper proceeds as follows. Section 2 lays out the environment and timing. Sections 3 and 4 develop the naive and signaling analyses and prove monotonicity. Section 5 derives the sufficient partitions. Section 6 applies the refinement. Section 7 provides extensions showing robustness to additive technologies, correlation, noisy tests, and testing costs, and quantify the value of observing complexity. Section 8 concludes. All the proofs are in A.
2 Model description
Environment
There is a benevolent advisor and a single agent. If the project succeeds, the agent receives a benefit . Success depends on three primitives: the agent’s latent ability , his effort , and the project’s complexity . Technology is multiplicative:
[TABLE]
Effort is costly. The cost function satisfies , , and for .
Uncertainty
Ability is drawn from a commonly known distribution with density on . Complexity is drawn from a distribution with density on . Unless stated otherwise, and are independent.888Positive assignment (hard tasks to strong agents) is common in hospitals and professional services. Our correlation extension shows the results persist under MLR‐type dependence between and . We will impose shape restrictions only when needed for existence and monotone comparative statics (Sections 3 and 4).
Test
Before any effort choice, the advisor can post a pass–fail test999Pass/fail is the coarsest scoring rule. Institutions often choose coarse disclosure to manage incentives; see Rayo and Segal, (2010). Our endogenous cutoff implements such coarse experiments. that compares ability to a threshold . The test is costless and does not require effort. It returns an outcome with
[TABLE]
The threshold may carry information about in equilibrium, since the advisor observes and commits to the test.
Timing
Nature draws from . The advisor privately observes ; the agent observes neither nor . 2. 2.
The advisor commits to a threshold . A policy is a measurable mapping , . 3. 3.
The test is administered and is publicly observed. 4. 4.
The agent chooses continuation effort ; it can depend on the posted and on . 5. 5.
Payoffs realize. The agent’s utility is . The advisor is benevolent and maximizes the agent’s expected utility.
Beliefs from the test
Conditional on , the agent knows whether lies above or below . Let the truncated densities be
[TABLE]
For complexity, the beliefs used at the effort stage depend on the informational regime defined next.
We analyze two cases:101010Think of the naive case as “policy is posted by committee; candidates are told not to infer difficulty,” and the sophisticated case as “everyone knows how the bar moves with conditions.” Both are realistic bracketing assumptions.
- •
naive agent. The agent treats as non‐informative about and uses the prior when choosing . Thus the CDF and density entering the effort problem are and .
- •
Sophisticated agent. The agent is fully Bayesian and understands the advisor’s policy . Upon observing an on‐path threshold , he forms the posterior over ,
[TABLE]
with corresponding and density . Off‐path beliefs are part of equilibrium; when we refine equilibria, we apply the Intuitive Criterion.
Agent’s continuation problem
Given , an outcome , and regime‐specific beliefs about , the agent solves
[TABLE]
where the expectation is with respect to or depending on . When interior, first‐order conditions are
[TABLE]
Under the regularity imposed below, (pass truncation raises expected ) and each effort choice is unique if interior.
Advisor’s problem
Fix (observed by the advisor). Anticipating and , the advisor chooses the threshold to maximize the agent’s expected utility:
[TABLE]
A threshold policy is a measurable , .
Equilibrium
In the naive regime, an equilibrium is a pair such that, for every , given and efforts solving (1)–(2).
In the sophisticated regime, a Perfect Bayesian Equilibrium (PBE) is a triple with:
Agent optimality. For every on‐path , the efforts solve (1)–(2) using from . 2. 2.
Advisor optimality. For every , given those efforts and beliefs. 3. 3.
Belief consistency. For on‐path thresholds, is Bayes’ rule under ; off‐path beliefs are specified, and when we refine equilibria they satisfy the Intuitive Criterion.
Regularity
We will appeal to the following standing assumptions when proving existence and monotonicity.
Assumption 1**.**
(i) and are continuous, strictly positive on compact subsets of , and (weakly) log‐concave. (ii) is , strictly convex for , with continuous and bounded away from [math] on compacts.
Under Assumption 1, the kernels in (1)–(2) are decreasing in , so each effort is well defined and unique when interior; pass truncation implies . These properties, combined with the piecewise form of (3), yield increasing differences in and the monotonicity results stated in Theorems 1 and 2.
Benchmark parameterization
When we specialize, we will use , , and with . This delivers closed‐form expressions for the sufficient inequalities that partition the parameter space by equilibrium type (B).
3 Equilibrium analysis: naive agent
The naive regime isolates the selection effect of pass/fail without the extra feedback loop from threshold–as–signal. This gives a clean “moving kink” structure in the advisor’s problem that we will reuse verbatim in the signaling case: only the distribution over changes. The section therefore builds intuition for all that follows—how pass/fail truncation shifts expected ability, how the marginal benefit of raising behaves, and why monotone comparative statics fall out.
In the naive regime, the agent treats the posted threshold as non-informative about and uses the prior at the effort stage. Hence and in (1)–(2). The only information created by the test is the pass/fail truncation in ability.
Effort choices
Given and , the agent solves
[TABLE]
so the interior first-order conditions specialize to
[TABLE]
Pass/fail does two things to the agent’s problem: it shifts the distribution of (pass FOSD dominates fail) and leaves the kernel decreasing in under log-concavity. The first drives ; the second guarantees uniqueness (when interior) and underpins all comparative statics by the implicit-function theorem.
Lemma 1**.**
Suppose Assumption 1 holds. For each threshold and each outcome , the agent’s continuation problem admits a maximizer . Whenever the solution is interior, it is unique and satisfies . Moreover, is (weakly) increasing in and (weakly) decreasing under first-order stochastic dominance shifts of toward greater complexity.
Proof sketch.
Strict convexity of and continuity of the kernels yield at most one interior solution; coercivity of yields existence. Since first-order stochastically dominates and the right-hand side of (4) is increasing in the distribution of under FOSD, we obtain . Monotonicity in and under FOSD shifts of follows by the implicit function theorem. ∎
Pass/fail “zooms in” on different regions of ability. Conditioning on pass shifts mass to the right of , which raises the marginal return to effort in (4); since the right-hand side is decreasing in , the unique solution must be weakly higher in the pass group. This is the basic selection–incentives complementarity that recurs throughout the paper.
Advisor’s problem
Given and anticipating , , the advisor chooses to maximize in (3) with .
Holding fixed the (best‐response) efforts and , the advisor’s objective can be read as areas cut off by a right‐moving kink at and . As increases, those kinks slide right, which is exactly the source of increasing differences in .
Theorem 1**.**
Under Assumption 1, the advisor’s objective has increasing differences in . Hence for every the set of optimal thresholds is nonempty, and any optimal selection is (weakly) increasing in . In particular, when the optimizer is interior, .
Proof sketch.
Write as a sum of terms of the form and , with depending only on . Each term has a kink that moves right with , generating increasing differences. Topkis then yields monotone optimal selections. ∎
Harder environments mechanically call for (weakly) tougher bars. Empirically, any proxy for that shifts right should be accompanied by nondecreasing if the institution is close to optimal in this benchmark.
The interior MCS result leaves open what happens near extremes of . When is tiny, success saturates and only costs matter near , creating a no‐test corner; when is huge, success is rare unless either the threshold collapses (expanding fail) or pass effort explodes—so the limiting choice is governed by the continuation-cost tradeoff. The next statement formalizes this split and pins down large‐ limits through the cost aggregator .
Proposition 1**.**
Let be the advisor’s objective in the naive regime. There exists such that is optimal for all .
For every compact interval there exists such that, for each , the function attains a maximizer on . Moreover, for any at which is differentiable at the maximizer, the optimizer is interior and satisfies the first–order condition; otherwise the maximizer occurs at a finite kink of .
Define the continuation cost function
[TABLE]
and extend the policy space to by setting and . Then, uniformly on compact sets of ,
[TABLE]
Consequently, any sequence of maximizers has limit points contained in
[TABLE]
If is a singleton, then as . In particular:
- (i)
If , then .
- (ii)
If , then .
- (iii)
If , then .
Remark**.**
When , the pass set is empty and everyone realizes . The induced effort converges to the effort computed under the full prior (since ), so is the limiting value with an “always fail” test. If one restricts with , case (ii) above becomes the boundary choice .
Corollary 1**.**
Let as in Proposition 1(iii). Suppose is (weakly) increasing and is (weakly) decreasing in . Then:
If there exists such that is strictly decreasing on and is nondecreasing there, then is strictly decreasing on and hence . Consequently, as . 2. 2.
If there exists such that is nondecreasing on and is strictly decreasing there, then is strictly decreasing on and hence . Consequently, as .
In the benchmark , , and , these monotonicities hold numerically across the parameter ranges we consider, and the case in (1) (shut–down) is the typical large- limit.
Proof.
By Proposition 1(iii), uniformly on compacts as , so any limit point of maximizers must lie in . In (1), is strictly decreasing on , hence minimized at the endpoint ; similarly in (2) it is strictly decreasing on , hence minimized at [math]. The convergence of follows because a unique minimizer forces all maximizers of to converge to it as . ∎
Case (1) (“shut–down”) says that when the marginal cost of keeping a pass group is eventually dominated by its measure , the optimal policy is to send everyone to the fail branch at very high . In many licensing or safety screens, this corresponds to “no one passes in very bad states”—consistent with observed de facto moratoria in extreme conditions.
In Section 5 we illustrate the large- behavior by showing that, for our benchmark primitives, the shut–down limit in Corollary 1 is the empirically relevant case across wide parameter ranges.
4 Equilibrium analysis: signaling
The only element that changes relative to the naive case is the distribution of entering the agent’s problem: the posted now induces a posterior through . All primitives of the “moving kink” remain; the difference is that informativeness raises the marginal return to effort via the posterior density , which strengthens (rather than weakens) our monotonicity conclusions.
In the signaling (sophisticated) regime, the agent understands the advisor’s policy , so the posted threshold is itself informative about . After observing and the pass/fail outcome , the agent updates beliefs and then chooses effort.
Posteriors induced by the threshold
Given a threshold policy , the on-path posterior over complexity after observing is
[TABLE]
Pass/fail truncation in ability is the same as in Section 2: and .
Effort choices
Given and , the agent solves
[TABLE]
so the interior first-order conditions are exactly (1)–(2) with the posterior density :
[TABLE]
Lemma 2**.**
Under Assumption 1 and on-path Bayes posteriors, the continuation problems admit maximizers and . Whenever interior, solutions are unique and satisfy . For fixed , is (weakly) increasing in and (weakly) increasing in the informativeness of the posterior in the Blackwell order.
A more informative posterior tightens mass around realized . With log-concavity, is decreasing, so the right-hand side of (6)–(7) shifts up: the agent’s best responses (weakly) increase. This is the “better information raises effort” channel behind our separation results.
As in Lemma 1, strict convexity of and the fact that is decreasing deliver at most one interior solution; coercivity gives existence. Pass truncation implies that FOSD-dominates , hence . For informativeness, if becomes more informative in the Blackwell sense and is log-concave, then is decreasing and the right-hand side of (6)–(7) rises; the implicit function theorem yields higher .
Advisor’s problem
Given and anticipating , the advisor chooses to maximize as in (3), but the effort kernels now use induced by .
Theorem 2**.**
Under Assumption 1 with on-path Bayes consistency, the advisor’s value has increasing differences in . Consequently, for every the set of optimal thresholds is nonempty, and any optimal selection is (weakly) increasing in ; interior solutions satisfy .
Proof sketch.
Fix and the induced posterior. As in Section 3, is a sum of terms like and , with depending only on . The “kinks” in these terms shift right with , producing increasing differences pointwise in the posterior. Taking expectation over the on-path posterior preserves increasing differences. Topkis then gives a nondecreasing selection in . ∎
A nondecreasing optimal selection may be strict, flat, or flat on blocks: these correspond exactly to separating, pooling, and semi–separating policies. The next definition and theorem make that partition explicit and connect it to primitive conditions.
We classify equilibria by the informativeness of about .
Definition 1**.**
A PBE is separating if implies ; pooling if is constant on a set of types of positive measure; and semi-separating otherwise (pooling on a subinterval and separating elsewhere).
Theorem 3**.**
Let denote the advisor’s continuation value when threshold is posted and the agent best–responds given the (on–path) posterior induced by the candidate policy . Define the argmax correspondence
[TABLE]
Then:
- (i)
By Theorem 2, has increasing differences in , hence is nonempty and has a nondecreasing measurable selection . With on–path Bayes beliefs, is a PBE. 2. (ii)
Every monotone PBE falls into exactly one of:
- •
Separating*: is strictly increasing on a full–measure set (equivalently: for a.e. the maximizer is unique and is strictly increasing a.e.).*
- •
Pooling*: is (a.e.) constant: there exists with for a.e. .*
- •
Semi–separating*: There exists a nondegenerate closed interval with for all , while is strictly increasing on and on .*
These three cases are mutually exclusive and exhaustive.
If for Lebesgue–a.e. the maximizer is unique, i.e. , then the monotone selection is strictly increasing a.e.; hence a separating PBE exists.
For any constant , there exist off–path beliefs that support a pooling PBE with . (Advisor optimality holds pointwise because off–path beliefs can make all deviations weakly unattractive; on–path Bayes pins the posterior at to the pooled set.)
Suppose there exist and such that for every , while for and for . Then there exists a semi–separating PBE with for and strictly increasing outside .
Remark**.**
Any constant is a PBE given suitable off–path beliefs; on–path Bayes pins the posterior on the pooled set.
If the argmax is a singleton a.e., the nondecreasing selection is strictly increasing a.e., hence separating.
If for all while lies strictly below (above) for (), paste the constant block on and the unique selection outside.
Where the marginal value of informativeness equals the marginal continuation distortion, flat blocks become viable; these equalities define (weakly) monotone frontiers in primitive space by continuity of the envelopes.
Pooling can often be supported by beliefs. A small off-path shift in that helps only a specific triggers the Intuitive Criterion, which assigns posterior mass to and breaks pooling. We formalize this in Section 6 (Theorem 4); the effect is to expand the separating region at the expense of pooling.
Under , , and , the posterior is a (normalized) restriction of the exponential density to the preimage . If is separating and continuous, then is a singleton and the posterior collapses to a point mass; effort then solves a deterministic cutoff problem conditional on pass/fail truncation in . If is piecewise constant (pooling or semi-separating), the posterior is exponential truncated to the pooled interval. In either case, (6)–(7) reduce to one-dimensional equations in with kernels that are either degenerate or truncated exponentials, and the advisor’s objective is computed by the same piecewise logic as in Section 3.
5 Parametric benchmark and equilibrium partition
We now specialize to a benchmark where formulas are explicit and the partition by equilibrium type can be graphed directly. Throughout this section:
[TABLE]
Uniform ability keeps tails uncluttered, exponential complexity delivers posteriors that stay in family under truncation, and power costs give interpretable elasticities while retaining tractability. The point is not knife–edge optimality, but closed‐form inequalities that make the separating/pooling frontier graphable and comparable across .
The naive regime uses the prior at the effort stage; the signaling regime replaces by the posterior induced by the on-path threshold .
Effort FOCs in the benchmark
Let and be the pass/fail truncations of as in Section 2. In the naive regime, . Plugging into (1)–(2) yields:
Lemma 3**.**
Let , , and with . In the naïve regime, interior efforts satisfy
[TABLE]
In the signaling regime, replace by the on-path posterior density .
Proof.
Direct integration of with under the two uniform truncations delivers the closed forms. The signaling case is the same substitution with in place of . Uniqueness (if interior) follows from strict convexity of and the fact that is decreasing. ∎
For each group, the left side is marginal cost ; the right side is a pass/fail–truncated moment of the density of evaluated at . As rises, the pass truncation shifts right and the fail truncation left, widening the gap between and . Increasing steepens the density and dampens both efforts.
Fix a one–parameter family of feasible separating policies indexed by a policy knob : upon observing , the advisor posts
[TABLE]
This design guarantees that every passing agent (including the marginal pass type) succeeds, because .111111We use purely as a tractable tuning parameter to generate a lower bound on the value attainable under separation. Smaller means lower pass effort but a tougher threshold (fewer passers); larger means higher pass effort but a milder threshold (more passers). The per–pass net surplus is because C(e_{s})=\big{(}\alpha V^{1/\gamma}\big{)}^{\gamma}=\alpha^{\gamma}V. We do not claim this family is pointwise optimal among all separating policies; it is a convenient, closed-form benchmark that yields transparent inequalities.
We will compare this feasible separating value to a conservative pooling benchmark. Relative to this family, B derived two primitives–based objects: the feasible separating lower bound in (10) and the pooling upper benchmark (with the corner enforced). Define
[TABLE]
To understand Figure 1, observe that when is small, complexity is typically low, so a revealing threshold lets the advisor secure success in the pass group with modest effort; the separating lower bound rises faster than the pooling bound and dominates. As grows, more mass sits on high complexity: the pooling upper bound approaches (the “best you could ever do” without information), while the separating bound saturates at because the pass group’s effort is capped by the policy. The curves cross once at , which is exactly the comparative-statics boundary for our sufficient conditions.
Corollary 2**.**
For fixed , the crossing is (weakly) increasing in . As a function of , is (weakly) nonincreasing under the sufficient bounds.
Proof.
Differentiate at a root. The implicit function theorem gives
[TABLE]
At the crossing, because scales linearly in with a strictly positive bracket while the envelope derivative of with respect to equals (where is the pooling optimizer). Numerically, and under mild regularity on growth of , at the crossing (the pooling envelope derivative in dominates the separating derivative). Hence ; strict positivity is observed throughout our grids.
For fixed , is strictly decreasing in , and is weakly decreasing if (weakly increasing if ), so the separating term weakly falls in ; the pooling value is the maximum of over , which is weakly decreasing in pointwise in ; hence its supremum is weakly decreasing. Therefore, moving up shifts weakly down for each , and any crossing moves weakly left. ∎
Thus the sufficient separating region expands with project value and (weakly) shrinks with curvature: higher pushes, higher pulls.
Figure 2 illustrates the intuition. A higher raises the return to making thresholds informative (it scales the marginal value of effort and success), so the separating region expands: the crossing moves to larger .
To draw the partition for a given :
For each gridpoint , solve the pooling FOC for (unique root), and compute by the envelope. 2. 2.
Evaluate from (10). 3. 3.
Set . Points with lie in the separating region (sufficient); are pooling-sustainable by the bounds. The boundary is ; find it by bracketing and bisection in for each .
For , the pooling bound admits the closed form in (8) via Lambert .
When effort gets quickly expensive (high ), informative thresholds buy less extra effort; institutions should then either lower the bar’s sensitivity to perceived difficulty or complement thresholds with non-effort instruments (mentoring, team composition).
Consider the intuition of Figure 3. More convex costs dull the incentive effect from information: the pass/fail split raises effort less when marginal costs explode, so the pooling benchmark overtakes at lower .
Small means complexity is typically low; the feasible separating policy guarantees success in the pass group at modest cost and dominates the pooling benchmark. Large puts substantial weight on high complexity; the pooling benchmark then captures most attainable value (approaching ) while the separating policy preserves a wedge , so pooling becomes sustainable by the bound. Higher pushes the crossing outward (more separation), while higher curvature brings it inward (costly effort dulls the benefit of revealing policies).
Together, Figures 1–3 show that higher project value expands the separating region (outward shift in ), while greater cost curvature shrinks it—clean, testable comparative statics.
Numerical implementation
We locate by bracketing and bisection. For each , the pooling FOC
[TABLE]
has a unique solution because the left-hand side is strictly decreasing in while the right-hand side is strictly increasing. We then evaluate and via (10), compute , and refine the bracket to the zero of by bisection. A mixed grid in (log near [math], linear for ) speeds bracketing. For , one may use the Lambert- closed form (8).
6 Refinement and elimination of pooling
In threshold environments, tiny deviations identify the unique type that truly benefits from a slightly different bar. Any refinement that assigns beliefs to “the type with a strict local gain” therefore breaks most pooling arguments. The Intuitive Criterion is the classic way to formalize this logic.
Pooling can often be supported by off-path beliefs. We therefore impose the Intuitive Criterion (IC) of Cho and Kreps, (1987). Intuitively, if some type of advisor (i.e., some complexity ) would strictly benefit from a deviation to a nearby threshold while all other types would not, then observing should make the agent assign probability one to that unique profitable type; in that posterior, the deviation must be optimal for that type, contradicting a pooling best response.
Local profitability and type separation
Fix a pooling candidate with on-path threshold . Let be the advisor’s value from (3) evaluated at the continuation efforts induced by . Write the marginal value of informativeness (MVI) at for type as
[TABLE]
where the derivative is taken holding the agent’s effort responses at their envelope values. By Theorems 1 and 2, has increasing differences, so is nondecreasing in under the regularity in Assumption 1.
Lemma 4**.**
Suppose there exists with , and for all in a neighborhood of . Then there exists arbitrarily close to such that
[TABLE]
By continuity of in and monotonicity of in , a small move in in the direction of raises at while weakly lowering it for nearby types with nonpositive marginal gains. A standard mean-value argument delivers the strict/weak inequalities.
Intuitive Criterion
Under IC, if a deviation is strictly profitable for some types and (weakly) unprofitable for all others, the agent’s posterior upon observing must be supported on the profitable set; if that set is a singleton , the posterior assigns probability one to .
Theorem 4**.**
Consider a pooling PBE with on-path threshold . If there exists a type and a nearby threshold such that while for all , then the pooling equilibrium fails the Intuitive Criterion. Hence such pooling cannot be sustained.
If observed thresholds occasionally jump by tiny amounts in “easy‐to‐rationalize” directions (e.g., slightly higher bars in clearly tougher periods) and behavior responds as our FOCs predict, IC would select separation in those neighborhoods—pooling should not be expected to persist there.
By Lemma 5 there is a type-separating deviation . IC requires the agent to assign posterior probability one to after observing . Given that posterior, strictly increases the advisor’s payoff at , making the deviation profitable and contradicting the optimality of at . Hence the pooling PBE is eliminated.121212A formalizes the IC posterior and the deviation construction.
Implications in the benchmark
The closed-form bounds in B compare a feasible separating policy with a conservative pooling benchmark. They identify a region where separation strictly dominates pooling by the bounds:
[TABLE]
In that region, there exists a neighborhood of complexities for which the marginal gain from informativeness is strictly positive at any pooling threshold.
Proposition 2**.**
If , then no pooling PBE at that survives the Intuitive Criterion. Only separating (or semi-separating with flat zero–marginal-gain blocks131313IC does not rule out flat segments if multiple types are exactly indifferent to small movements in .) can remain.
Proof sketch.
implies that for a positive-measure set of the advisor strictly prefers a separating move over any pooling policy. By continuity, at a pooling candidate there exists with strictly positive local marginal value, while nearby types do not gain. Lemma 5 then produces a type-specific deviation and Theorem 4 eliminates pooling. ∎
Proposition 3**.**
In any semi-separating PBE that survives the Intuitive Criterion, is weakly increasing and piecewise continuous. On any pooled block with pooled level , the marginal gain from informativeness is zero for almost every ; the endpoints satisfy where is the adjacent separating value at .
Observe that if were strictly positive on a subset of the pooled interval with positive measure, Lemma 5 would generate a profitable deviation eliminating pooling via Theorem 4. Thus the marginal gain must be (essentially) zero throughout the pooled block, and matching at the boundaries follows by continuity of and monotonicity of .
In the uniform–exponential–power-cost benchmark, the sufficient boundary defined by is the frontier where pooling begins to be (conservatively) sustainable by the bounds. Proposition 2 shows that for , all pooling equilibria are eliminated by IC; only separating (and possibly semi-separating with flat segments satisfying Proposition 3) can survive.
Remark**.**
The same logic extends to noisy tests and correlated primitives under the MLR and log-concavity conditions used in Section 7: IC eliminates any pooling configuration with strictly positive local marginal gains for a unique subset of types.
7 Extensions
The baseline model isolates a simple force: a single cutoff both allocates information about ability (pass/fail) and, when the agent is sophisticated, signals the environment through the advisor’s choice. The extensions below ask whether this mechanism is an artifact of our most convenient assumptions (multiplicative technology, independence, noiseless tests, zero testing costs, full observability of complexity) or whether it travels to nearby settings that show up in practice. In each case we motivate the departure with a concrete use case (licensing, grading, audits, internal assignment, or regulation), connect to existing literatures on persuasion, certification, and inspection, and explain how our results carry over and what new comparative statics emerge.
7.1 Additive technology
In many applications effort “adds points” rather than scaling ability. Exam prep pushes a test score upward; compliance staff add items to a checklist; a junior analyst’s hours add to a team’s baseline. The USMLE Step 1 reform is a useful mental model: before pass/fail, study time moved the raw score by roughly additive bumps; after pass/fail the institution still sets a threshold that partitions the posterior, but the technology is closer to than (USMLE Program,, 2020). The question is whether our knife-edge conclusions—monotone thresholds in complexity and the separating/pooling taxonomy—hinge on multiplicativity or reflect a more general “moving-kink” structure.
With and log-concave posteriors, our core mechanism is intact: pass groups work (weakly) harder than fail groups, the advisor’s problem has increasing differences in , and optimal thresholds are (weakly) increasing in complexity. Intuitively, the cutoff still creates a rightward-moving kink in the success region as rises, so Topkis-style monotone comparative statics go through. The practical takeaway is that whether coaching “adds” points or “scales” returns, harder projects should rationally induce tougher cutoffs, and pass cohorts should exert more effort ex post.
Proposition 4**.**
With success technology , log-concave on-path posteriors , and strictly convex , we have , the advisor’s value exhibits increasing differences in in both regimes, and the separating/pooling/semi-separating taxonomy carries over.
Proof sketch.
Interior FOCs: . Log-concavity makes decreasing, so the kernel is decreasing in ; strict convexity gives uniqueness (if interior). Pass truncation FOSD-dominates fail, yielding . The advisor’s payoff is again a sum of terms with kinks whose locations shift right with , giving increasing differences. ∎
7.2 Correlation between ability and complexity
Hard tasks rarely arrive at random. Firms route thorny clients to senior teams; teaching hospitals assign complex cases to high-ability residents; selective programs track stronger students into harder courses. In all of these, ability and complexity are positively correlated by design. Conversely, in some organizations difficult, neglected work lands on over-stretched, lower-ability units.141414Internal assignment and “tracking” in education are salient examples: sorting raises the informational bite of pass/fail because “pass” now reflects both ability and task difficulty. Our comparative statics quantify this bite. Because our threshold is both a filter and a signal, sorting can amplify or mute its informational bite: when high sees high , a pass communicates more about a tough environment than when cases are randomly assigned.
Under MLR-type correlation and log-concave posteriors, the pass group still works (weakly) harder, preserves increasing differences, and optimal thresholds remain (weakly) increasing in . Positive sorting expands the region where separating policies are attractive (passes are especially informative about high ), while negative sorting compresses it. For readers, the intuition is simple: if “hard things go to strong people,” a high bar does double duty—screening and credible signaling about difficulty—so institutions have more to gain from letting thresholds move with complexity.
Proposition 5**.**
Suppose has a joint density with satisfying MLR in with respect to , and the relevant conditional/posterior densities are log-concave; let be strictly convex. Then , the advisor’s value has increasing differences in , and the optimal threshold is (weakly) increasing in in both regimes. Positive (negative) correlation enlarges (shrinks) the separating region.
MLR preserves single-crossing after pass/fail truncation and after conditioning on . The decreasing-kernel and convexity arguments for the effort problems go through. Increasing differences follow from the same moving-kink logic, now integrated against the conditional distributions. Comparative statics of the separating region follow from the effect of correlation on the informativeness of the pass/fail split about .
7.3 Imperfect (noisy) tests
Real cutoffs are measured with error. Exams are hand-graded or curved; audits have false positives and negatives; and algorithmic “pass” decisions have ROC trade-offs. In residency selection and professional licensing, committees regularly wrestle with borderline files and noisy assessments; in compliance, sampling misses some violations and flags some innocents (e.g., the inspection models in Mookherjee and Png,, 1989). If pass/fail is noisy, two forces collide: the pass group learns less about (truncation is blurrier) and the observed threshold is a less precise signal about .151515Noise can be statistical (grading error) or strategic (limited audit intensity). In inspection models (Mookherjee and Png,, 1989), similar trade‐offs are summarized by ROC curves; our message is that noisier instruments weaken both selection and signaling.
With MLR noise and strictly convex costs, and increasing differences survive, but the value of informativeness shrinks smoothly with noise. As error rates rise, the sufficient region where separation dominates shrinks toward pooling. Intuitively, noise flattens the marginal benefit curve of raising because both (i) the pass cohort is less selected on ability and (ii) a posted cutoff carries less credible information about complexity. This gives a clean, testable prediction: when instruments are noisy (new exam formats; shallow audits), institutions should temper how aggressively they let thresholds move with perceived difficulty.
Proposition 6**.**
Let the pass/fail test comparing to have false-negative and false-positive rates that satisfy MLR in . With strictly convex , we retain and increasing differences in .
MLR ensures the posterior after MLR-dominates that after , so the same kernel and convexity arguments yield and uniqueness (if interior). The moving-kink argument for increasing differences is unchanged. Noise attenuates truncation and thus the gain from informativeness, shrinking the separating inequality in B continuously in the error rates.
7.4 Costly testing and partial commitment
Setting and moving thresholds is rarely free.161616Cut‐score setting is resource‐intensive (psychometrics, item calibration, legal review). When posting the bar is costly, one should expect inaction bands—thresholds that do not move for small changes in . Our proposition formalizes this. Psychometric work to validate a new cut score, proctoring time, extra audits, or legal review of a policy change all generate costs; in some settings the advisor can only commit to a coarse menu (“low/medium/high bar”) rather than a fine mapping. Think of regulators ratcheting a minimum-hours rule (FAA), a dean’s office tuning a qualifying-exam bar, or a compliance team deciding how intensively to screen vendors. The natural question is whether small complexities are worth “turning on” the threshold at all, and how commitment frictions change the shape of optimal policies.
Adding a convex posting cost creates a no-test region: for low , the marginal information value is dominated by marginal testing cost, so is optimal. Beyond a cutoff , the optimal threshold again rises (weakly) with complexity. Under partial commitment (menus or discretized bars), the same feasible-separation logic applies: our sufficient separating region still certifies when informative thresholds are worthwhile, while the Intuitive Criterion bites harder against pooling. Policy implication: institutions should expect inaction bands—intervals of complexity where it is optimal to keep the bar unchanged—and wider bands when testing is expensive or governance restricts fine tuning.
Proposition 7**.**
If posting incurs a continuous, increasing, convex cost with , then there exists such that for all , and for the optimal threshold is (weakly) increasing in .
Proof sketch.
Subtracting preserves increasing differences in , so monotonicity survives. For small , marginal gains from information are dominated by marginal testing costs, creating a no-test region. The closed-form replacement follows by evaluating the feasible separating policy at and subtracting per pass. ∎
If the advisor can commit only to a correspondence (menus or discretized thresholds), the Intuitive Criterion becomes more powerful, but the conservative separating region from B still applies (it comes from a feasible policy).
7.5 Advisor does not observe complexity
Sometimes the designer must pick one bar for everything. A central board sets a single passing score for multiple cohorts; a regulator chooses a uniform minimum standard without seeing firms’ task difficulty; a firm adopts a global promotion bar despite heterogeneous business lines. This “global threshold” is a natural benchmark for the value of observing complexity: how much is lost when the advisor cannot condition the cutoff on ?
Without observing , the advisor optimizes a constant threshold against the prior, collapsing the signaling margin. The value of observing is weakly positive and becomes strictly positive exactly when our sufficient separating condition holds on a set of complexities of positive measure: the designer would like to raise the bar on tougher tasks and lower it on easy ones. When pooling is optimal everywhere by our bounds, the value of information is (essentially) zero. Empirically, this yields a “price of ignorance” bound: in environments where we diagnose separation (e.g., hard rotations, high-variance lines of business), letting managers or committees tailor thresholds to realized difficulty should produce measurable gains.
Proposition 8**.**
If the advisor cannot observe , she chooses a constant threshold to maximize . The value of observing is weakly nonnegative171717Formally, by pointwise optimization in . This is the classic value‐of‐information logic from Blackwell, (1953)., and strictly positive whenever the sufficient separating inequality holds on a set of complexities of positive measure under some feasible policy; it is zero when the pooling inequality holds everywhere.
Comparing to gives weakly nonnegative value by optimizing pointwise in . Strict positivity follows if a feasible separating policy yields strictly higher payoff than any constant policy on a positive-measure set; otherwise pooling is optimal and the value is zero.
Across additive technologies, MLR correlation, noisy instruments, and posting costs, two primitives drive design: (i) decreasing kernels that keep effort problems well behaved, and (ii) right–moving kinks that preserve increasing differences for the advisor. As these weaken (more noise, steeper costs), the case for separation shrinks smoothly, matching the comparative statics in our benchmark figures.
8 Conclusion
We studied a simple advising problem where a benevolent advisor observes project complexity and posts a pass–fail threshold before the agent chooses costly continuation effort. The threshold both allocates information about ability (pass/fail) and, when the agent is sophisticated, signals the environment through the advisor’s choice.
Two takeaways organize the results. First, under standard shape restrictions on primitives and convex effort costs, the advisor’s optimal threshold is monotone in complexity in both informational regimes: (Sections 3 and 4). This monotonicity delivers a clean equilibrium taxonomy into separating, pooling, and semi-separating policies. Second, we provide explicit, graphable inequalities—comparing a feasible separating policy with a conservative pooling benchmark—that partition the parameter space and allow straight–to–figure diagnostics. In the benchmark with uniform ability, exponential complexity, and power costs, the boundary can be traced in for any given (Section 5).
Refinement sharpens the picture. The Intuitive Criterion eliminates pooling whenever a small threshold deviation is uniquely profitable for a type (Section 6). In the sufficient separating region identified by our closed forms, pooling cannot survive the refinement, so only separating (or semi-separating with flat zero–marginal–gain blocks) remains.
Robustness exercises show the mechanism is not an artifact of the multiplicative technology or independence. The same monotone policy and taxonomy survive under an additive technology, correlation satisfying MLR, noisy tests, and testing costs (Section 7). When the advisor cannot observe , the two informational regimes collapse and the value of observing is weakly positive, becoming strictly positive when separation is feasible on a nontrivial set.
The model yields testable predictions. Thresholds should be (weakly) increasing in objective measures of complexity; pass groups exert weakly more effort than fail groups181818If an institution raises on objectively harder cohorts (or years), we should observe (i) more post‐threshold effort by the pass group and (ii) stronger RD‐style behavior around the bar when the bar is more informative.; and environments with higher project value or lower cost curvature exhibit more separation. When the threshold is endogenously informative, observed changes in should co-move with post-threshold effort in the directions implied by our FOCs. These implications connect to empirical settings that leverage policy cutoffs and RD-style designs.
Two directions seem most useful next. On the theory side, allowing multi-level (or continuous-score) disclosures and endogenous test noise would let us endogenize coarseness and quantify the information–incentive trade-off beyond binary thresholds. On the empirical side, calibrating the benchmark inequalities to data where both thresholds and post-threshold behavior are observed (e.g., qualifying exams, compliance screens, internal promotion bars) would map observed partitions directly into primitives and recover the implied separating region.
Overall, the paper provides a tractable way to reason about when and how a simple threshold both informs and motivates. The closed-form partitions make the knife-edges transparent, while the refinement results clarify which pooling arguments are robust to credible off-path beliefs.
Appendix A Proofs
Lemmas A.1–A.2 collect the monotonicity and decreasing–kernel primitives used throughout. The naive results (Section 3) rely on the moving‐kink representation and Berge/Topkis; the signaling results (Section 4) add only on–path Bayes and the Blackwell–informativeness step. The refinement arguments formalize the one–sided derivative (“MVI”) logic used by the Intuitive Criterion.
We use for the indicator, assume is with , , on compacts, and—where invoked—log-concavity and strict positivity of the relevant densities on compact subsets. When we appeal to monotone comparative statics we use standard increasing-differences (Topkis) and single-crossing (Milgrom–Shannon) arguments.
A.1 Preliminaries
Lemma A.1**.**
Given and density on , define
[TABLE]
Then first-order stochastically dominates .
Proof.
For any nondecreasing , because the support of is to the right of while that of is to the left and both are renormalizations of . Equivalently, the CDF under lies weakly below that under for all arguments. ∎
Lemma A.2**.**
Let be a continuous density that is weakly decreasing ( wherever the derivative exists). Then for any probability measure on with finite first moment, the map
[TABLE]
is continuous and weakly decreasing in . If on a set of positive measure and has support with positive mass away from [math], then is strictly decreasing.
Proof.
Continuity follows by dominated convergence. For ,
[TABLE]
since is weakly decreasing and . Strict inequality holds under the stated strictness conditions. ∎
A.2 Proofs for Section 3
Proof of Lemma 1.
Fix and . The agent solves
[TABLE]
where and . The map is continuous and bounded by , while is coercive; hence attains a maximizer on by Weierstrass.
Whenever the solution is interior, the FOC is
[TABLE]
By Lemma A.2, the right-hand side is weakly decreasing in , while is strictly increasing; there is at most one interior solution. Corner solutions are covered by .
Since first-order stochastically dominates by Lemma A.1 and the right-hand side of the FOC is increasing in the distribution of under FOSD, we obtain (with the same corner convention). Monotonicity in follows from the implicit-function theorem applied to with decreasing in by Lemma A.2.
Finally, suppose changes from to with pointwise (a shift toward greater complexity), and both have weakly decreasing densities on . Then for all and every , , so the FOC’s right-hand side (and thus the marginal benefit curve) shifts weakly downward; the unique interior solution (if any) weakly decreases by the implicit-function theorem. This covers the benchmark families we use (e.g., exponential). ∎
Proof of Theorem 1.
Write, for and ,
[TABLE]
Since depend only on (not on ), we may rewrite the success part as
[TABLE]
Fix and consider the function of given by and (with ). For any and , the difference
[TABLE]
is nonnegative because the “kink” at shifts right as rises. Thus each term has increasing differences in , and so does (cost terms do not depend on ). Topkis’ theorem then implies that the argmax correspondence is nonempty and has a nondecreasing selection. For interior optima, differentiating the first-order condition in yields . ∎
Proof of Proposition 1.
Fix and write . The map is continuous: the indicator is right–continuous; is continuous in by the implicit–function theorem (Lemma A.2 with ); and the integrands are dominated by integrable bounds. Hence, by Berge’s Maximum Theorem, for each there exists and is upper hemicontinuous.
Fix any . For small, and , so the success probability is locally flat in , while the cost term
[TABLE]
is strictly increasing in near [math] (continuity of and strict convexity of with ). Therefore there exists such that is optimal for all .
Fix . By Step 1, for each the maximizer over any compact exists. Since is continuous and (by inspection of (3)) piecewise continuously differentiable with finitely many kinks in on compacts (the kinks come from the points where changes regime), we can pick large enough that any maximizer lies in . If is differentiable at the maximizer, the envelope theorem yields the first–order condition and interiority; otherwise the optimum occurs at a finite kink.
Write the success part of as
[TABLE]
which vanishes uniformly in on compacts as (bounded convergence; the thresholds diverge). Hence uniformly on compacts. Berge’s maximum theorem plus epi-convergence then imply that any limit point of maximizers belongs to ; uniqueness of the minimizer yields convergence of the maximizers. The boundary cases follow from the definition of at [math] and . ∎
A.3 Proofs for Section 4
Proof of Lemma 2.
Fix and . The agent solves
[TABLE]
with as above. Existence of a maximizer follows by Weierstrass (bounded benefit, coercive cost). For interior solutions,
[TABLE]
By Lemma A.2 (applied to ), the right-hand side is weakly decreasing in , so there is at most one interior solution. Pass truncation FOSD-dominates fail (Lemma A.1), hence .
Informativeness. Let two on-path posteriors and at satisfy (Blackwell more informative). Write the objective as
[TABLE]
For each fixed and , the map is affine (hence both convex and concave) in . Therefore, by Blackwell’s theorem, higher informativeness weakly raises the value of the best action whenever the problem has single crossing in (Milgrom–Weber monotonicity). Single crossing holds because, when is log-concave, the marginal condition has the “decreasing differences” property in : more informative tightens the posterior around its realized and—by log-concavity—raises the value of near in the convex order. Thus . In the benchmark exponential family (Section 5), this reduces to a simple FOSD comparison of posteriors and the conclusion follows directly.191919Formally, under exponential the posterior on any pooled interval remains exponential with a larger rate when the interval is shorter (more informative), making uniformly larger and shifting up the FOC right-hand side for all . ∎
Proof of Theorem 2.
Fix and the associated on-path posterior. As in the naïve case,
[TABLE]
with depending only on (not on ). The same “moving-kink” argument as in Theorem 1 shows that for any fixed posterior, has increasing differences in . Taking expectations over the on-path posterior preserves increasing differences. Topkis then yields a nondecreasing optimal selection ; interior optima satisfy by differentiating the FOC. ∎
Proof of Theorem 3.
Increasing differences (Theorem 2) imply has a nondecreasing measurable selection (Topkis). On–path Bayes yields the posterior at any on–path ; the agent’s best–responses are well–defined by Lemma 2; advisor optimality holds by construction. The three classes in are logically disjoint and cover all nondecreasing maps.
If is singleton a.e., the (unique) nondecreasing selection is strictly increasing a.e. (otherwise two distinct types would share the same unique maximizer on a set of positive measure, contradicting increasing differences). This gives a separating PBE.
Fix . Let and specify off–path beliefs that assign zero continuation value to any deviation; then no type profits from deviating. On–path Bayes at is the distribution of truncated to the pooled set, which pins down the agent’s efforts. This is a standard construction.
Define on and pick a nondecreasing selection on with . For , is optimal by hypothesis. For (resp. ) any (resp. ) is strictly dominated by increasing differences and the separation of from ; thus the pasted policy is pointwise optimal. Bayes’ rule on–path and arbitrary off–path beliefs complete the PBE. ∎
A.4 Proofs for Section 6
We first formalize the local deviation used by the Intuitive Criterion.
Lemma 5**.**
Let be a pooling threshold and define the right-hand directional derivative
[TABLE]
whenever the limit exists (it does almost everywhere by Rademacher’s theorem under Assumption 1). Suppose there is with and a neighborhood of on which for all . Then for all sufficiently small ,
[TABLE]
Proof.
By definition of the one-sided derivative, there exists such that for all the above inequalities hold with weak/strict signs as stated. Continuity of in and local boundedness of the derivatives ensure we can pick a common for a small neighborhood . ∎
Proof of Theorem 4.
Consider a pooling PBE with on-path . If there exists and such that while for all in a neighborhood of , then upon observing the off-path the Intuitive Criterion requires the agent to assign posterior probability one to (the unique type that could benefit). At that posterior, the deviation is strictly profitable for , contradicting optimality of at . Lemma 5 guarantees such a deviation whenever and nearby types have nonpositive . Hence the pooling PBE fails the Intuitive Criterion. ∎
A.5 Proofs for Section 7
Proof of Proposition 4.
Under , the interior FOCs become . If is log-concave, then is decreasing on , so Lemma A.2 applies verbatim with in place of ; existence/uniqueness (if interior) and follow. The advisor’s payoff can again be written as a sum of terms of the form and with , so the same “moving-kink” argument yields increasing differences in and the same separating/pooling taxonomy. ∎
Proof of Proposition 5.
Let be the joint density. MLR of in implies that for the likelihood ratio is increasing in . This property is preserved after pass/fail truncation in and after conditioning on , so the posteriors used in the effort FOCs inherit single crossing. The map remains decreasing by Lemma A.2, ensuring uniqueness of interior efforts and . For the advisor, the same “moving-kink” representation as before establishes increasing differences in after integrating over the conditional distributions. Positive (negative) correlation between and makes pass more (less) informative about high and therefore enlarges (shrinks) the set where separation is attractive—this follows from the fact that the marginal value of informativeness is increasing in by Theorem 2. ∎
Proof of Proposition 6.
Let the binary test at produce with false-negative/positive rates that satisfy MLR in . Then the posterior of after MLR-dominates that after , which implies by the same kernel/convexity argument as before.
The advisor’s payoff continues to be a sum of terms with kinks located at and ; as rises the kinks move right, so increasing differences in are preserved. As noise increases, pass/fail truncation becomes less pronounced, shrinking the marginal value of informativeness continuously in and therefore shrinking the sufficient separating region of B; as we recover the baseline. ∎
Proof of Proposition 7.
Subtracting from the advisor’s payoff preserves increasing differences in (the cost term is independent of ). For small , Step 2 in the proof of Proposition 1 shows that marginal gains from raising are dominated by near [math], so there exists with for . For larger , Topkis implies the optimal is weakly increasing. In the closed-form sufficient condition (B), the per-pass rent is reduced by , which yields the stated replacement. ∎
Proof of Proposition 8.
Without observing , the advisor chooses a constant threshold to maximize . With observation, she chooses a measurable policy to maximize . Since by pointwise optimization in , the value of information is weakly nonnegative. It is strictly positive whenever there exists a positive-measure set of complexities for which a feasible separating policy strictly dominates all constant thresholds; conversely, if the pooling benchmark of B dominates for every , then constant policies are optimal and the value is zero. ∎
Appendix B Closed-form inequalities for equilibrium partitions
This appendix develops explicit inequalities that partition the primitive space into regions where separating, pooling, or semi–separating equilibria arise. Throughout we adopt the benchmark primitives
[TABLE]
The separating side is built from a feasible revealing policy (hence a lower bound on the advisor’s payoff). The pooling side is an ability–wise benchmark that (conservatively) upper bounds the payoff attainable under pooling. Therefore, whenever the separating lower bound exceeds the pooling upper bound, separation is guaranteed by our bounds; when it does not, pooling is sustainably better by the bound (the true boundary may be more favorable to separation, especially under refinements).
B.1 Quadratic costs
Pooling upper benchmark
Under pooling (agent does not learn ), an ability-wise upper benchmark replaces by :
[TABLE]
The FOC is . Let so that and z=W\!\big{(}\tfrac{V}{2}\lambda^{2}\big{)}. Using for gives
[TABLE]
with the understanding that if the interior FOC yields a negative value, the maximizer is the corner and .
Separating feasible benchmark
Fix and consider the monotone policy .
In the pass group the agent sets , guaranteeing success whenever and delivering net . Averaging over yields
[TABLE]
Corollary 3**.**
With and fixed , as we have while ; as , and .
Proof.
Let . Using the expansion ,
[TABLE]
Hence
[TABLE]
For pooling, set and use as . Then
[TABLE]
Plugging into (8) gives
[TABLE]
Therefore, as ,
[TABLE]
so for all sufficiently small , yielding separation by the bounds.
As , in (9) we have and , so . For pooling, choose ; then as , and letting shows . Hence and , proving the large- claim. Monotonicity of the zero of in follows from the implicit function theorem since at the crossing and (both terms scale up with but the separating term does so at lower order near the boundary), so the crossing shifts to larger when increases. ∎
B.2 General power costs: ,
Under the same ability-wise benchmark, the FOC is . Let . Then
[TABLE]
with the corner (hence ) enforced whenever it dominates.
With and ,
[TABLE]
Proposition 9**.**
With (), let be as above and . If is positive (zero, negative), then separation holds by these sufficient bounds (boundary; pooling is sustainable by the bounds), respectively.
Proof.
The feasible separating policy with pass effort delivers the value in (10); hence is a lower bound on the advisor’s optimum.
For pooling, replacing by gives success probability for any effort , so any pooling policy is bounded above by
[TABLE]
The interior FOC is . Writing yields and the value
[TABLE]
with the corner (hence value [math]) taken if it dominates. Therefore is an upper bound on the payoff attainable by any pooling policy.
If , every pooling policy yields strictly less than the feasible separating policy; the benevolent advisor thus strictly prefers a separating policy, so no pooling equilibrium can be optimal. If , pooling is sustainable by the benchmark; and defines the boundary by the two-sided bounds. The statements follow. ∎
Corollary 4**.**
For and fixed , as , and ; as , and .
Proof.
Set . As before,
[TABLE]
For pooling, let solve . As , we have and hence , so
[TABLE]
Using and ,
[TABLE]
where . Since , the bracket simplifies to , so
[TABLE]
Thus, as ,
[TABLE]
so for sufficiently small and separation obtains by the bounds.
For , set and note ; taking shows . In (10), , hence the bracket tends to and . This establishes the large- claim. The outward shift of the zero of in follows from the implicit function theorem, using continuity and the fact that the separating side scales linearly in while the pooling side scales sublinearly near the crossing. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Athey, (2001) Athey, S. (2001). Single crossing properties and the existence of pure strategy equilibria in games of incomplete information. Econometrica , 69(4):861–889.
- 2Athey, (2002) Athey, S. (2002). Monotone comparative statics under uncertainty. Quarterly Journal of Economics , 117(1):187–223.
- 3Bagnoli and Bergstrom, (2005) Bagnoli, M. and Bergstrom, T. (2005). Log-concave probability and its applications. Econometric Theory , 26(2):445–469.
- 4Banks and Sobel, (1987) Banks, J. S. and Sobel, J. (1987). Equilibrium selection in signaling games. Econometrica , 55(3):647–661.
- 5Bertola, (2025) Bertola (2025). Equilibrium failure rates in tests of endogenous pass thresholds. SSRN Working Paper.
- 6Blackwell, (1953) Blackwell, D. (1953). Equivalent comparisons of experiments. Annals of Mathematical Statistics , 24(2):265–272.
- 7Boleslavsky and Cotton, (2015) Boleslavsky, R. and Cotton, C. (2015). Grading standards and education quality. American Economic Journal: Microeconomics , 7(2):248–279.
- 8Cattaneo et al., (2020) Cattaneo, M. D., Idrobo, N., and Titiunik, R. (2020). A Practical Introduction to Regression Discontinuity Designs . Cambridge Elements: Quantitative and Computational Methods for Social Science. Cambridge University Press.
