The Best-or-Worst and the Postdoc problems
L. Bayon, P. Fortuny Ayuso, J.M. Grau, A.M. Oller-Marcen, M.M. Ruiz

TL;DR
This paper analyzes two variants of the secretary problem, establishing their optimal strategies and exploring how additional costs or different payoff structures affect these strategies.
Contribution
It proves that the Best-or-Worst and Postdoc problems share the same optimal stopping rule in their standard form and examines how various costs and payoffs alter optimal strategies.
Findings
Both variants share the same optimal stopping rule in standard form.
Additional costs or payoffs significantly change the optimal strategies.
Different payments based on candidate quality influence the strategy choices.
Abstract
We consider two variants of the secretary problem, the\emph{ Best-or-Worst} and the \emph{Postdoc} problems, which are closely related. First, we prove that both variants, in their standard form with binary payoff 1 or 0, share the same optimal stopping rule. We also consider additional cost/perquisites depending on the number of interviewed candidates. In these situations the optimal strategies are very different. Finally, we also focus on the Best-or-Worst variant with different payments depending on whether the selected candidate is the best or the worst.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Auction Theory and Applications · Cryptography and Data Security
The Best-or-Worst and the Postdoc problems
L. Bayón
Departamento de Matemáticas, Universidad de Oviedo
Avda. Calvo Sotelo s/n, 33007 Oviedo, Spain
,
P. Fortuny Ayuso
Departamento de Matemáticas, Universidad de Oviedo
Avda. Calvo Sotelo s/n, 33007 Oviedo, Spain
,
J.M. Grau
Departamento de Matemáticas, Universidad de Oviedo
Avda. Calvo Sotelo s/n, 33007 Oviedo, Spain
,
A. M. Oller-Marcén
Centro Universitario de la Defensa de Zaragoza
Ctra. Huesca s/n, 50090 Zaragoza, Spain
and
M.M. Ruiz
Departamento de Matemáticas, Universidad de Oviedo
Avda. Calvo Sotelo s/n, 33007 Oviedo, Spain
Abstract.
We consider two variants of the secretary problem, the* Best-or-Worst* and the Postdoc problems, which are closely related. First, we prove that both variants, in their standard form with binary payoff 1 or 0, share the same optimal stopping rule. We also consider additional cost/perquisites depending on the number of interviewed candidates. In these situations the optimal strategies are very different. Finally, we also focus on the Best-or-Worst variant with different payments depending on whether the selected candidate is the best or the worst.
Key words and phrases:
Keywords: Secretary problem, Combinatorial Optimization
AMS 2010 Mathematics Subject Classification 60G40, 62L15
1. Introduction
The secretary problem is one of many names for a famous problem of optimal stopping theory. This problem can be stated as follows: an employer is willing to hire the best secretary out of rankable candidates. These candidates are interviewed one by one in random order. A decision about each particular candidate is to be made immediately after the interview. Once rejected, a candidate cannot be called back. During the interview, the employer can rank the candidate among all the preceding ones, but he is unaware of the quality of yet unseen candidates. The goal is then to determine the optimal strategy that maximizes the probability of selecting the best candidate.
This problem has a very elegant solution. Dynkin [4] and Lindley [13] independently proved that the best strategy consists in a so-called threshold strategy. Namely, in rejecting roughly the first (cutoff value) interviewed candidates and then choosing the first one that is better than all the preceding ones. Following this strategy, the probability of selecting the best candidate is at least , this being its approximate value for large values of . This well-known solution was later refined by Gilbert and Mosteller [11] showing that is a better approximation than , although the difference is never greater than 1.
This secretary problem has been addressed by many authors in different fields such as applied probability, statistics or decision theory. In [5], [7] or [17] extensive bibliographies on the topic can be found. On the other hand, different generalizations of this classical problem have been recently considered in the framework of partially ordered objects [8, 9, 10] or matroids [1, 15]. It is also worth mentioning the work of Bearden [2], where the author considers a situation where the employer receives a payoff for selecting a candidate equal to the “score” of the candidate (in the classical problem the payoff is 1 if the candidate is really the best and 0 otherwise). In this situation, the optimal cutoff value is roughly the square root of the number of candidates.
In this paper we focus on two closely related variants of the secretary problem. The so-called Best-or-Worst and Postdoc variants. In the Best-or-Worst variant, the classic secretary problem is modified so that the goal is to select either the best or the worst candidate, indifferent between the two cases. This variant can only be found on [6] as a multicriteria problem in the perfect negative dependence case. Here we present it in greater detail. In the Postdoc variant, instead of selecting the best candidate, the goal is to select the second best candidate. This problem was proposed to Robert J. Vanderbei by Eugene Dynkin in 1980 with the following motivating story that explains the name of the problem: we are trying to hire a postdoc, since the best candidate will receive (and accept) an offer from Harvard, we are interested in hiring the second best candidate. Vanderbei himself solved the problem in 1983 using dynamic programming [18]. However, he never published his work because he learned that Rose had already published his own solution using different techniques [14]. Moreover, Szajowski had already solved the problem of picking the -th better candidate for [16].
In the present paper, for these two variants, we study the standard problem (binary payoff function 1 or 0), showing that both have the same optimal cutoff rule strategy and also the problems considering payoff functions that depend on the number of performed interviews, showing that in this case they have very different optimal strategies.
The paper is organized as follows: in Section 2, we present some technical results, in Section 3, we revisit the classic secretary problem and also solve two new situations with payoff functions that depend on the number of performed interviews. In Section 4 we focus on the Best-or-Worst variant, solving the problem for three different payoff functions and also presenting a variant in which the choice of the best or the worst candidate is no longer indifferent. In Section 5 we solve the three versions of the Postdoc variant and, finally, we compare the obtained results in Section 6.
2. Two technical results
The following result can be widely applied in different optimal stopping problems and it will be extensively used throughout the paper. For a sequence of continuous real functions defined on a closed interval, it determines the asymptotic behavior of the sequence , where is the value for which the function reaches its maximum.
Proposition 1**.**
Let be a sequence of real functions with and let be the value for which the function reaches its maximum. Assume that the sequence of functions given by converges uniformly on to a function and that is the only global maximum of in . Then,
- i)
.
- ii)
.
- iii)
If then .
Proof.
- i)
Let us consider the sequence and assume that is a subsequence that converges to . Then,
[TABLE]
Consequently, since uniformly on , if we take limits we get
[TABLE]
and since is the only global maximum of , it follows that .
Thus, we have proved that every convergent subsequence of converges to the same limit . Since is defined on a compact set this implies that itself must also converge to .
- ii)
It is enough to observe that
[TABLE]
where the last equality holds because uniformly on .
- iii)
If , then it also holds that and we can reason as in the previous point.
∎
Remark**.**
The condition of uniform convergence is required to ensure, for instance, that . In fact, it is easy to give counterexamples to Proposition 1 if convergence is not uniform.
Observe that Proposition 1 implies that that . Moreover, it also implies that . This means that is a good estimate for and that, for large values of , the maximum value of approaches .
Proposition 1 admits the following two-variable version that can be proved in the same way.
Proposition 2**.**
Let be a sequence of two variable real functions with G_{n}\in\mathcal{C}\big{(}\{(x,y)\in[0,n]^{2}:x\leq y\}\big{)} and let be a point for which reaches its maximum. Assume that the sequence given by converges uniformly on to a function and that is the only global maximum of in . Then,
- i)
* for .*
- ii)
**
- iii)
If for , then .
3. A new look at the classic secretary problem
In the classical secretary problem, let be the number of candidates and let us consider a cutoff value . If is an integer, the probability of successfully selecting the best candidate in the -th interview is . Thus, the probability function of succeeding in the classical secretary problem with candidates using as cutoff value, is given by
[TABLE]
The goal is now to determine the value of that maximizes this probability (i.e., to determine the optimal cutoff value) and to compute this maximum probability. This can be done using Proposition 1 in the following way. First, we extend to a real variable function by
[TABLE]
where is the so-called digamma function. Then, it can be seen with little effort that the sequence of functions defined by converges uniformly on to the function and the remaining is just some elementary calculus.
Remark**.**
In [5] the following rather lax reasoning showing that tends to is given. If we let tend to infinity and write as the limit of , then using for and for , the sum becomes a Riemann approximation to an integral
[TABLE]
Proposition 1 provides a more rigorous approach.
We introduce a more general situation. Let be a function (payoff function) and assume that a payoff of is received if the -th candidate is selected. In this setting, the expected payoff is
[TABLE]
Note that in the classical situation
[TABLE]
and the expected payoff coincides with the probability of successfully selecting the best candidate.
Now, let us modify the classical situation considering that performing each interview has a constant cost of . Clearly, in this situation the payoff function is given by
[TABLE]
and the expected payoff is
[TABLE]
The following result provides the optimal cutoff value and the maximum expected payoff in this setting. In what follows, we denote by the main branch of the so-called Lambert- function, defined by .
Proposition 3**.**
Given an integer , let us consider the function
[TABLE]
defined for every integer and let be the value for which the function reaches its maximum. Then,
- i)
.
- ii)
.
Proof.
First, we extend to a real variable function by
[TABLE]
Now, it can be seen that converges uniformly in to . To conclude the proof it is enough to apply Proposition 1 together with some straightforward computations. ∎
This result means that the optimal strategy in this setting consists in rejecting roughly the first interviewed candidates and then accepting the first candidate which is better than all the preceding ones. Following this strategy, the maximum expected payoff is asymptotically equal to .
Remark**.**
The constant (A106533 in OEIS) appears in [7] (erroneously approximated as 0.20388) in the context of the Best-Choice Duration Problem considering a payoff of . Furthermore, as a noteworthy curiosity, it should be pointed out that this constant has appeared in a completely different context from the one addressed here (the Daley-Kendall model) and it is known as the rumour’s constant [3, 12].
Now, let us consider that performing each interview has an perquisite of . Clearly, in this situation the payoff function is given by
[TABLE]
and the expected payoff is
[TABLE]
The following result provides the optimal cutoff value and the maximum expected payoff in this setting.
Proposition 4**.**
Given an integer , let us consider the function
[TABLE]
defined for every integer and let be the value for which the function reaches its maximum. Then,
- i)
.
- ii)
.
Proof.
First, we extend to a real variable function by
[TABLE]
Now it can be seen that converges uniformly in to . To conclude the proof it is enough to apply Proposition 1 together with some straightforward computations. ∎
This result means that the optimal strategy in this setting consists in rejecting roughly the first interviewed candidates and then accepting the first candidate which is better than all the preceding ones. Following this strategy, the maximum expected payoff is asymptotically equal to .
4. The Best-or-Worst variant
In this section we focus on the Best-or-Worst variant, as described in the introduction, in which the goal is to select either the best or the worst candidate, indifferent between the two cases. First of all we prove that, just like in the classic problem, the optimal strategy is a threshold strategy.
Theorem 1**.**
For the Best-or-Worst variant, if is the number of objects, there exists such that the following strategy is optimal:
- (1)
Reject the first interviewed candidates. 2. (2)
After that, accept the first candidate which is either better or worse than all the preceding ones.
Proof.
For the sake of brevity, a candidate which is either better or worse than all the preceding ones will be called a nice candidate.
Since the game under consideration is finite, there must exist an optimal strategy (in the sense that it maximizes the probability of success). Hence, we can define as the probability of success following an optimal strategy when rejecting a candidate in the -th interview (regardless of its being a nice candidate or not). We can also define as the probability of success accepting a nice candidate in the -th interview. Any optimal strategy will reject any non-nice candidate since the probability of being a successful choice will be [math].
Probability is , which increases with . On the other hand, the function is non-increasing because
[TABLE]
Thus, since is increasing and is non-increasing and given that and , there exists a natural number for which:
[TABLE]
[TABLE]
As a consequence of this fact, the following strategy must be optimal: for each -th interview with do the following:
- •
Reject the -th candidate if or if it is not a nice candidate.
- •
Accept the -th candidate if and it is a nice candidate.
Note that the optimality of this strategy follows from the fact that, in each interview, we are choosing the action with greatest probability of success. ∎
Once that we have determined the optimal strategy, we focus on determining the probability of success in the -th interview. To do so, let be the number of candidates and let us consider a cutoff value . If is an integer, the probability of successfully selecting the best or the worst candidate in the -th interview is . Thus, the probability function of succeeding in the Best-or-Worst variant with candidates using as cutoff value, is given by
[TABLE]
where the last equality follows using telescopic sums.
Remark**.**
Note that for , it is straightforward to see that the probability of success is
[TABLE]
The goal is now to determine the value of that maximizes the probability (i.e., to determine the optimal cutoff value) and to compute this maximum probability. We do so in the following result.
Theorem 2**.**
Given a positive integer , let us consider the function
[TABLE]
defined for every integer and let be the value for which the function reaches its maximum. Then,
- i)
.
- ii)
The maximum value of is:
[TABLE]
Proof.
- i)
Since is the equation of a parabola in the variable , it is clear that
[TABLE]
Now,
[TABLE]
so it follows that
[TABLE]
Consequently,
[TABLE]
as claimed.
- ii)
It is enough to apply the previous result.
If is even, then and
[TABLE]
Moreover, in this case
[TABLE]
so it follows that
[TABLE]
as claimed.
Otherwise, if is odd, then and
[TABLE]
In this case
[TABLE]
so we also have that
[TABLE]
and the proof is complete.
∎
This result means that, for , optimal strategy in this setting consists in rejecting roughly the first interviewed candidates and then accepting the first candidate which is either better or worse than all the preceding ones. Following this strategy, the maximum probability of success is . In the cases , it is evident that an optimal cutoff value is , i.e. to accept the first candidate that we consider The probability of success is 1 in both cases according to the fact that .
Remark**.**
Unlike in the classic secretary problem, the probability of success in the Best-or-Worst variant is not strictly increasing in . In fact, we have that for every .
We are now going to consider the Best-or-Worst variant with the payoff function given in (2); i.e., we assume that performing each interview has a constant cost of . Under this assumption it can be proved that the optimal strategy is the same threshold strategy given in Theorem 1. Moreover, in this setting, the expected payoff with candidates and cutoff value is given by
[TABLE]
As usual, the goal is to determine the optimal cutoff value that maximizes the expected payoff and to compute this maximum expected payoff. We do so in the following result.
Theorem 3**.**
Given an integer , let us consider the function defined above for every integer and let be the value for which the function reaches its maximum. Also, let
[TABLE]
be the solution to the equation . Then, the following hold:
- i)
.
- ii)
**
Proof.
First, observe that
[TABLE]
Now, we can extend to a real variable function by
[TABLE]
Furthermore, it can be seen that the sequence of functions converges uniformly in to the function .
To conclude the proof it is enough to apply Proposition 1 together with some straightforward computations. ∎
Remark**.**
The constant also appears related to rumour theory [3, 12] and to Gabriel’s Horn (see A101314 in OEIS).
Now, let us consider the Best-or-Worst variant with the payoff function given in (3); i.e., we assume that performing each interview has an additional payoff of . Under this assumption, since the payoff increases with the number of interviews, it can be proved that the optimal strategy is again the same threshold strategy given in Theorem 1. Moreover, in this setting, the expected payoff with candidates and cutoff value is given by
[TABLE]
The optimal cutoff value that maximizes the expected payoff and this maximum expected payoff are determined in following result.
Theorem 4**.**
Given an integer , let us consider the function defined above for every integer and let be the value for which the function reaches its maximum. Also let
[TABLE]
be the solution to the equation . Then, the following hold:
- i)
.
- ii)
**
Proof.
First, observe that
[TABLE]
Now, we can extend to a real variable function by
[TABLE]
Furthermore, it can be seen that the sequence of functions converges uniformly on to .
To conclude the proof it is enough to apply Proposition 1 together with some straightforward computations. ∎
So far, we have considered the Best-or-Worst variant in which the goal is to select either the best or the worst candidate, indifferent between the two cases. To finish this section we are going to further modify the Best-or-Worst variant. In particular we are going to consider different payoff depending on whether we select the best or the worst candidate. In paticular we are going to consider the following payoff function, with .
[TABLE]
In this new setting the optimal strategy has two thresholds, as stated in the following result, whose proof is analogue to that of Theorem 1.
Theorem 5**.**
For the Best-or-Worst variant, if is the number of candidates and the payments for selecting the worst and the best candidates are, respectively, , there exist such that the following strategy is optimal:
- (1)
Reject the first interviewed candidates. 2. (2)
Accept the first candidate which is better than all the preceding ones until reaching the -th candidate. 3. (3)
After that, accept the first candidate which is either better or worse than all the preceding ones.
Now, let be the number of candidates and let us consider cutoff values . Then, if is an integer, the probability of successfully selecting the best candidate in the -th interview is given by
[TABLE]
On the other hand, if is an integer, the probability of successfully selecting the best or the worst candidate in the -th interview is given by
[TABLE]
Because, according to the optimal strategy we can only select the worst candidate if .
Consequently, the expected payoff with candidates and cutoff values is given by
[TABLE]
The following result determines the cutoff values as well as the corresponding maximum expected payoff.
Theorem 6**.**
Given a positive integer , let us consider the function defined above for every pair of integers in the set and let be the point for which reaches its maximum. Then,
- i)
**
- ii)
**
- iii)
**
Proof.
Let us define the sequence of functions by . Then,
[TABLE]
and the convergence is uniform on .
Hence, we can apply Proposition 2. To do so, observe that is a concave function on the convex set with a negative definite hessian matrix. Since has only one critical point, namely
[TABLE]
and
[TABLE]
the result follows. ∎
This result means that the optimal strategy in this setting consists in rejecting roughly the first interviewed candidates, then accepting the first candidate which is better than all the preceding ones until reaching roughly the candidate and, finally accepting the first candidate which is either better or worse than all the preceding ones. Following this strategy, the maximum expected payoff is asymptotically equal to .
Remark**.**
If the cuttof values obtained in Theorem 6 are, approximately, and . This means that the optimal strategy ignores the objective of obtaining the worst candidate and we recover the original secretary problem. In addition, if , then both cutoff values coincide with and we recover the original Best-or-Worst variant.
5. The Postdoc variant
In this section we focus on the Postdoc variant, as described in the introduction, in which the goal is to select the second best candidate. First of all we have to prove that, just like in classic problem, the optimal strategy is a threshold strategy.
In this variant it is not obvious that the optimal strategy has only one threshold. This is because the candidate considered in a given interview could be selected both if it is better or the second better than all the preceding ones and in both cases it could end up being the second best candidate. However, we are going to see that selecting a candidate which is better than all the preceding ones is never preferable to waiting for a candidate which is the second better than all the preceding ones.
Assume for a moment that we are following a threshold strategy. Let be the number of candidates and let us consider a cutoff value . If is an integer, the probability of successfully selecting the best or the worst candidate in the -th interview is . Thus, the probability function of succeeding in the Postdoc variant with candidates using as cutoff value and provided we are following a threshold strategy for the second best candidate, is given by
[TABLE]
Note that the following holds:
[TABLE]
On the other hand, let us denote by the probability of success after the -th interview provided we have already selected a candidate which is better than all the preceding ones. Then, the probability of finding the second best candidate in the -th interview is and, furthermore, the probability of not finding a better candidate among all the remaining interviews is . On the other hand, the probability of not obtaining the second best candidate in the -th interview is and the probability of success in this case will be . Hence,
[TABLE]
Thus, we have seen that and both satisfy the same recurrence relation in . Moreover, it holds that so, consequently, we obtain that for every .
Note that this means that the optimal strategy can neglect if a given candidate is better than all the preceding ones and focus only on whether the candidate is the second better than all the preceding ones and thus the optimal strategy has only one threshold.
Theorem 7**.**
For the Postdoc variant, if is the number of candidates, there exists such that the following strategy is optimal:
- (1)
Reject the first interviewed candidates. 2. (2)
After that, accept the first candidate which is the second best until then.
Proof.
Just use the same ideas as in Theorem 1. ∎
Thus, the probability function of succeeding in the Postdoc variant with candidates using as cutoff value, is given by
[TABLE]
Observe that we have obtained that . Consequently, if we follow the previous strategy, the optimal cutoff value is the same as in the Best-or-Worst variant; i.e., ) and the maximum probability of success is one half of the maximum probability of success in the Best-or-Worst variant (see Theorem 2).
We are now going to consider the Postdoc variant with the payoff function given in (2); i.e., we assume that performing each interview has a constant cost of . Under this assumption it can be proved that the optimal strategy has two thresholds.
Theorem 8**.**
For the Postdoc variant, if is the number of candidates and if the payoff function is given by (2), there exist , such that the following strategy is optimal:
- (1)
Reject the first interviewed candidates. 2. (2)
Accept the first candidate which is better than all the preceding ones until reaching the -th candidate. 3. (3)
After that, accept the first candidate which is either better or second better than all the preceding ones.
Proof.
Proceed as in Theorem 1 with each threshold separately. ∎
Under this strategy, the probability of successfully selecting the second best candidate in the -th interview is given by the function
[TABLE]
Consequently, the expected payoff with candidates and cutoff values is given by
[TABLE]
In the following result we determine the optimal cutoff values and the maximum expected payoff.
Theorem 9**.**
Given a positive integer let us consider the function defined above for every and let be the point for which reaches its maximum. Then,
- i)
**
- ii)
**
- iii)
**
Proof.
First of all, observe that
[TABLE]
Thus, if we define the sequence of functions by , it follows that
[TABLE]
and the convergence is uniform on .
Using elementary techniques we get that reaches its absolute maximum at the point with is the solution to and is the solution to .
The fact that concludes the proof. ∎
Finally, let us consider the Postdoc variant with the payoff function given in (3); i.e., we assume that performing each interview has an additional payoff of . Under this assumption, it is clear that no optimal strategy will accept a candidate which is better than all the preceding ones because, if the search continues, the probability of success is the same and the payoff will be greater. Hence, we must only consider strategies with one threshold for the second best candidate, as in Theorem 7, ignoring if the interviewed candidate in better than the preceding ones. In this setting, the expected payoff with candidates and cutoff value is given by
[TABLE]
The optimal cutoff value that maximizes the expected payoff and this maximum expected payoff are determined in the following result.
Theorem 10**.**
Given an integer , let us consider the function defined above for every integer and let be the value for which the function reaches its maximum. Then, the following hold:
- i)
**
- ii)
**
Proof.
Since is a degree 3 polynomial, we can explicitly obtain the exact value of by elementary methods. Namely,
[TABLE]
The result follows immediately. ∎
Remark**.**
Note that we can further refine the previous result by noting that . In this case, is the optimal cutoff value for all up to 10000, without any exception.
6. Conclusions
In this paper, we have analyzed two variants of the secretary problem which happen to be closely related: the Postdoc and the Best-or-Worst variants. Both of them have the same optimal threshold strategy and the mean payoff for the first one is twice as for the second one.
We now show a comparative table of the asymptotic optimal cutoff value (ACV) given by and the the asymptotic maximum expected payoff (AMP) in the classical secretary problem, in the Best-or-Worst variant and in the Postdoc variant with payoff functions , and . In the case of the Postdoc variant with payoff function , in the cell corresponding to we show the two thresholds related to the optimal strategy in that setting.
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Babaioff, N. Immorlica and R. Kleinberg. Matroids, secretary problems, and online mechanisms. Proc. SODA . 434-443, 2007.
- 2[2] J.N. Bearden. A new secretary problem with rank-based selection and cardinal payoffs. Journal of Mathematical Psychology , 50: 58-59. 2006.
- 3[3] D.J. Daley and D.G. Kendall. Stochastic rumours. Journal of the Institute of Mathematics and Its Applications , 1:42-55. 1965.
- 4[4] E.B. Dynkin. The optimum choice of the instant for stopping a markov process. Soviet Mathematics - Doklady , 4:627-629, 1963.
- 5[5] T.S. Ferguson. Who solved the secretary problem? Statistical Science , 4(3):282-296. 1989.
- 6[6] T.S. Ferguson. The Best-Choice Problems with Dependent Criteria. Contemporary Mathematics . 25:135-151. 1992.
- 7[7] T. S. Ferguson, J. P. Hardwick and M. Tamaki. Maximizing the duration of owning a relatively best object. Contemporary Mathematics: Strategies for Sequential Search and Selection in Real Time, American Mathematics Association (T. Ferguson and S. Samuels, eds) , 125:37-58. 1991.
- 8[8] R. Freij and J. Wastlund. Partially ordered secretaries, Electron. Comm. Probab. , 15:504-507. 2010.
