Urn sampling distributions giving alternate correspondences between two optimal stopping problems

2016 ◽  
Vol 48 (3) ◽  
pp. 726-743 ◽  
Author(s):  
Mitsushi Tamaki

Abstract The best-choice problem and the duration problem, known as versions of the secretary problem, are concerned with choosing an object from those that appear sequentially. Let (B,p) denote the best-choice problem and (D,p) the duration problem when the total number N of objects is a bounded random variable with prior p=(p1, p2,...,pn) for a known upper bound n. Gnedin (2005) discovered the correspondence relation between these two quite different optimal stopping problems. That is, for any given prior p, there exists another prior q such that (D,p) is equivalent to (B,q). In this paper, motivated by his discovery, we attempt to find the alternate correspondence {p(m),m≥0}, i.e. an infinite sequence of priors such that (D,p(m-1)) is equivalent to (B,p(m)) for all m≥1, starting with p(0)=(0,...,0,1). To be more precise, the duration problem is distinguished into (D1,p) or (D2,p), referred to as model 1 or model 2, depending on whether the planning horizon is N or n. The aforementioned problem is model 1. For model 2 as well, we can find the similar alternate correspondence {p[m],m≥ 0}. We treat both the no-information model and the full-information model and examine the limiting behaviors of their optimal rules and optimal values related to the alternate correspondences as n→∞. A generalization of the no-information model is given. It is worth mentioning that the alternate correspondences for model 1 and model 2 are respectively related to the urn sampling models without replacement and with replacement.

1983 ◽  
Vol 20 (1) ◽  
pp. 165-171 ◽  
Author(s):  
Joseph D. Petruccelli

We consider the problem of maximizing the probability of choosing the largest from a sequence of N observations when N is a bounded random variable. The present paper gives a necessary and sufficient condition, based on the distribution of N, for the optimal stopping rule to have a particularly simple form: what Rasmussen and Robbins (1975) call an s(r) rule. A second result indicates that optimal stopping rules for this problem can, with one restriction, take virtually any form.


1984 ◽  
Vol 16 (01) ◽  
pp. 111-130
Author(s):  
Joseph D. Petruccelli

From one point of view this paper adds to a previous formulation of the best-choice problem (Petruccelli (1981)) the possibility that the number of available observations, rather than being known, is a bounded random variable N with known distribution. From another perspective, it expands the formulations of Presman and Sonin (1972) and Rasmussen and Robbins (1975) to include recall and uncertainty of selection of observations. The behaviour of optimal stopping rules is examined under various assumptions on the general model. For optimal stopping rules and their probabilities of best choice relations are obtained between the bounded and unbounded N cases. Two particular classes of stopping rules which generalize the s(r) rules of Rasmussen and Robbins (1975) are considered in detail.


1984 ◽  
Vol 16 (1) ◽  
pp. 111-130 ◽  
Author(s):  
Joseph D. Petruccelli

From one point of view this paper adds to a previous formulation of the best-choice problem (Petruccelli (1981)) the possibility that the number of available observations, rather than being known, is a bounded random variable N with known distribution. From another perspective, it expands the formulations of Presman and Sonin (1972) and Rasmussen and Robbins (1975) to include recall and uncertainty of selection of observations. The behaviour of optimal stopping rules is examined under various assumptions on the general model. For optimal stopping rules and their probabilities of best choice relations are obtained between the bounded and unbounded N cases. Two particular classes of stopping rules which generalize the s(r) rules of Rasmussen and Robbins (1975) are considered in detail.


1983 ◽  
Vol 20 (01) ◽  
pp. 165-171 ◽  
Author(s):  
Joseph D. Petruccelli

We consider the problem of maximizing the probability of choosing the largest from a sequence of N observations when N is a bounded random variable. The present paper gives a necessary and sufficient condition, based on the distribution of N, for the optimal stopping rule to have a particularly simple form: what Rasmussen and Robbins (1975) call an s(r) rule. A second result indicates that optimal stopping rules for this problem can, with one restriction, take virtually any form.


2015 ◽  
Vol 52 (4) ◽  
pp. 926-940 ◽  
Author(s):  
Mitsushi Tamaki

As a class of optimal stopping problems with monotone thresholds, we define the candidate-choice problem (CCP) and derive two formulae for calculating its expected payoff. We apply the first formula to a particular CCP, i.e. the best-choice duration problem treated by Ferguson et al. (1992). The recall case is also examined as a comparison. We also derive the distribution of the stopping time of the CCP and find, as a by-product, that the best-choice problem has a remarkable feature in that the optimal probability of choosing the best is just the expected value of the (proportional) stopping time. The similarity between the best-choice duration problem and the best-choice problem with uniform freeze studied by Samuel-Cahn (1996) is recognized.


2015 ◽  
Vol 52 (04) ◽  
pp. 926-940 ◽  
Author(s):  
Mitsushi Tamaki

As a class of optimal stopping problems with monotone thresholds, we define the candidate-choice problem (CCP) and derive two formulae for calculating its expected payoff. We apply the first formula to a particular CCP, i.e. the best-choice duration problem treated by Ferguson et al. (1992). The recall case is also examined as a comparison. We also derive the distribution of the stopping time of the CCP and find, as a by-product, that the best-choice problem has a remarkable feature in that the optimal probability of choosing the best is just the expected value of the (proportional) stopping time. The similarity between the best-choice duration problem and the best-choice problem with uniform freeze studied by Samuel-Cahn (1996) is recognized.


2013 ◽  
Vol 45 (04) ◽  
pp. 1028-1048 ◽  
Author(s):  
Mitsushi Tamaki

As a version of the secretary problem, Ferguson, Hardwick and Tamaki (1992) considered an optimal stopping problem called the duration problem. The basic duration problem is the classical duration problem, in which the objective is to maximize the time of possession of a relatively best object when a known number of rankable objects appear in random order. In this paper we generalize this classical problem in two directions by allowing the number N (of available objects) to be a random variable with a known upper bound n and also allowing the objects to appear in accordance with Bernoulli trials. Two models can be considered for our random horizon duration problem according to whether the planning horizon is N or n. Since the form of the optimal rule is in general complicated, our main concern is to give to each model a sufficient condition for the optimal rule to be simple. For N having a uniform, generalized uniform, or curtailed geometric distribution, the optimal rule is shown to be simple in the so-called secretary case. The asymptotic results, as n → ∞, will also be given for these priors.


2013 ◽  
Vol 45 (4) ◽  
pp. 1028-1048
Author(s):  
Mitsushi Tamaki

As a version of the secretary problem, Ferguson, Hardwick and Tamaki (1992) considered an optimal stopping problem called the duration problem. The basic duration problem is the classical duration problem, in which the objective is to maximize the time of possession of a relatively best object when a known number of rankable objects appear in random order. In this paper we generalize this classical problem in two directions by allowing the number N (of available objects) to be a random variable with a known upper bound n and also allowing the objects to appear in accordance with Bernoulli trials. Two models can be considered for our random horizon duration problem according to whether the planning horizon is N or n. Since the form of the optimal rule is in general complicated, our main concern is to give to each model a sufficient condition for the optimal rule to be simple. For N having a uniform, generalized uniform, or curtailed geometric distribution, the optimal rule is shown to be simple in the so-called secretary case. The asymptotic results, as n → ∞, will also be given for these priors.


1988 ◽  
Vol 25 (3) ◽  
pp. 544-552 ◽  
Author(s):  
Masami Yasuda

This paper treats stopping problems on Markov chains in which the OLA (one-step look ahead) policy is optimal. Its associated optimal value can be explicitly expressed by a potential for a charge function of the difference between the immediate reward and the one-step-after reward. As an application to the best choice problem, we shall obtain the value of three problems: the classical secretary problem, a problem with a refusal probability and a problem with a random number of objects.


Sign in / Sign up

Export Citation Format

Share Document