A conjecture on the Feldman bandit problem

Maher Nouiehed; Sheldon M. Ross

doi:10.1017/jpr.2018.19

A conjecture on the Feldman bandit problem

Journal of Applied Probability ◽

10.1017/jpr.2018.19 ◽

2018 ◽

Vol 55 (1) ◽

pp. 318-324

Author(s):

Maher Nouiehed ◽

Sheldon M. Ross

Keyword(s):

Bandit Problem ◽

Expected Number

Abstract We consider the Bernoulli bandit problem where one of the arms has win probability α and the others β, with the identity of the α arm specified by initial probabilities. With u = max(α, β), v = min(α, β), call an arm with win probability u a good arm. Whereas it is known that the strategy of always playing the arm with the largest probability of being a good arm maximizes the expected number of wins in the first n games for all n, we conjecture that it also stochastically maximizes the number of wins. That is, we conjecture that this strategy maximizes the probability of at least k wins in the first n games for all k, n. The conjecture is proven when k = 1, and k = n, and when there are only two arms and k = n - 1.

Download Full-text

Randomised allocation of treatments in sequential trials

Advances in Applied Probability ◽

10.1017/s0001867800033449 ◽

1980 ◽

Vol 12 (01) ◽

pp. 174-182 ◽

Cited By ~ 3

Author(s):

John Bather

Keyword(s):

Finite Number ◽

Finite Sequence ◽

Decision Procedures ◽

Bandit Problem ◽

Expected Number ◽

Long Run ◽

Sequential Trials

Given a finite number of different experiments with unknown probabilities p 1, p 2, ···, p k of success, the multi-armed bandit problem is concerned with maximising the expected number of successes in a sequence of trials. There are many policies which ensure that the proportion of successes converges to p = max (p 1, p 2, ···, p k ), in the long run. This property is established for a class of decision procedures which rely on randomisation, at each stage, in selecting the experiment for the next trial. Further, it is suggested that some of these procedures might perform well over any finite sequence of trials.

Download Full-text

Randomised allocation of treatments in sequential trials

Advances in Applied Probability ◽

10.2307/1426500 ◽

1980 ◽

Vol 12 (1) ◽

pp. 174-182 ◽

Cited By ~ 12

Author(s):

John Bather

Keyword(s):

Finite Number ◽

Finite Sequence ◽

Decision Procedures ◽

Bandit Problem ◽

Expected Number ◽

Long Run ◽

Sequential Trials

Given a finite number of different experiments with unknown probabilities p1, p2, ···, pk of success, the multi-armed bandit problem is concerned with maximising the expected number of successes in a sequence of trials. There are many policies which ensure that the proportion of successes converges to p = max (p1, p2, ···, pk), in the long run. This property is established for a class of decision procedures which rely on randomisation, at each stage, in selecting the experiment for the next trial. Further, it is suggested that some of these procedures might perform well over any finite sequence of trials.

Download Full-text

Some results for the two armed bandit problem

Optimization ◽

10.1080/02331937608842354 ◽

1976 ◽

Vol 7 (3) ◽

pp. 471-475 ◽

Cited By ~ 1

Author(s):

P.W. Jones

Keyword(s):

Bandit Problem

Download Full-text

Exploration by Pigeons on an N-Armed Bandit Problem

PsycEXTRA Dataset ◽

10.1037/e603982013-050 ◽

2007 ◽

Author(s):

Dennis Garlick ◽

Aaron P. Blaisdell

Keyword(s):

Bandit Problem

Download Full-text

Do We Learn from Mistakes of Others? A Test of Observational Learning in the Bandit Problem

SSRN Electronic Journal ◽

10.2139/ssrn.2714315 ◽

2014 ◽

Author(s):

Igor Asanov

Keyword(s):

Observational Learning ◽

Bandit Problem

Download Full-text

The effect of change in population size on DNA polymorphism.

Genetics ◽

10.1093/genetics/123.3.597 ◽

1989 ◽

Vol 123 (3) ◽

pp. 597-601 ◽

Cited By ~ 61

Author(s):

F Tajima

Keyword(s):

Population Size ◽

Dna Sequences ◽

Dna Polymorphism ◽

Population Bottleneck ◽

Expected Number ◽

Original Population ◽

Current Population ◽

Segregating Sites

Abstract The expected number of segregating sites and the expectation of the average number of nucleotide differences among DNA sequences randomly sampled from a population, which is not in equilibrium, have been developed. The results obtained indicate that, in the case where the population size has changed drastically, the number of segregating sites is influenced by the size of the current population more strongly than is the average number of nucleotide differences, while the average number of nucleotide differences is affected by the size of the original population more severely than is the number of segregating sites. The results also indicate that the average number of nucleotide differences is affected by a population bottleneck more strongly than is the number of segregating sites.

Download Full-text

Intercepting a Stealthy Network

ACM Transactions on Sensor Networks ◽

10.1145/3431223 ◽

2021 ◽

Vol 17 (2) ◽

pp. 1-39

Author(s):

Mai Ben Adar Bessos ◽

Amir Herzberg

Keyword(s):

Mobile Devices ◽

Short Range ◽

Low Energy ◽

Expected Number ◽

Relay Communication

We investigate an understudied threat: networks of stealthy routers (S-Routers) , relaying messages to a hidden destination . The S-Routers relay communication along a path of multiple short-range, low-energy hops, to avoid remote localization by triangulation. Mobile devices called Interceptors can detect communication by an S-Router, but only when the Interceptor is next to the transmitting S-Router. We examine algorithms for a set of mobile Interceptors to find the destination of the communication relayed by the S-Routers. The algorithms are compared according to the number of communicating rounds before the destination is found, i.e., rounds in which data is transmitted from the source to the destination . We evaluate the algorithms analytically and using simulations, including against a parametric, optimized strategy for the S-Routers. Our main result is an Interceptors algorithm that bounds the expected number of communicating rounds by a term quasilinear in the number of S-Routers. For the case where S-Routers transmit at every round (“continuously”), we present an algorithm that improves this bound.

Download Full-text

Average Number of Nucleotide Differences in a Sample From a Single Subpopulation: A Test for Population Subdivision

Genetics ◽

10.1093/genetics/117.1.149 ◽

1987 ◽

Vol 117 (1) ◽

pp. 149-153

Author(s):

Curtis Strobeck

Keyword(s):

Monte Carlo Simulation ◽

Population Structure ◽

Random Mating ◽

Population Subdivision ◽

Expected Number ◽

Migration Rates ◽

Mating Population ◽

Unbiased Estimates ◽

Increasing Function ◽

Stepping Stone Model

ABSTRACT Unbiased estimates of θ = 4Nµ in a random mating population can be based on either the number of alleles or the average number of nucleotide differences in a sample. However, if there is population structure and the sample is drawn from a single subpopulation, these two estimates of θ behave differently. The expected number of alleles in a sample is an increasing function of the migration rates, whereas the expected average number of nucleotide differences is shown to be independent of the migration rates and equal to 4N Tµ for a general model of population structure which includes both the island model and the circular stepping-stone model. This contrast in the behavior of these two estimates of θ is used as the basis of a test for population subdivision. Using a Monte-Carlo simulation developed so that independent samples from a single subpopulation could be obtained quickly, this test is shown to be a useful method to determine if there is population subdivision.

Download Full-text

Incorporating Geographic Distribution into the Expected Number of Deaths in a Comparative Study

Statistical Sciences and Data Analysis ◽

10.1515/9783112318867-004 ◽

1993 ◽

pp. 25-32

Keyword(s):

Comparative Study ◽

Geographic Distribution ◽

Expected Number

Download Full-text

A Search for a General Phenomenon of Adaptive Mutability

Genetics ◽

10.1093/genetics/143.2.645 ◽

1996 ◽

Vol 143 (2) ◽

pp. 645-659 ◽

Cited By ~ 6

Author(s):

Timothy Galitski ◽

John R Roth

Keyword(s):

Rank Correlation ◽

Selective Medium ◽

Chromosomal Gene ◽

Galactosidase Activity ◽

Bacteriophage Mu ◽

Reversion Rate ◽

General Phenomenon ◽

Expected Number ◽

Leaky Mutants ◽

Residual Growth

Abstract The most prominent systems for the study of adaptive mutability depend on the specialized activities of genetic elements like bacteriophage Mu and the F plasmid. Searching for general adaptive mutability, we have investigated the behavior of Salmonella typhimurium strains with chromosomal lacZ mutations. We have studied 30 revertible nonsense, missense, frameshift, and insertion alleles. One-third of the mutants produced ≥10 late revertant colonies (appearing three to seven days after plating on selective medium). For the prolific mutants, the number of late revertants showed rank correlation with the residual β-galactosidase activity; for the same mutants, revertant number showed no correlation with the nonselective reversion rate (from fluctuation tests). Leaky mutants, which grew slowly on selective medium, produced late revertants whereas tight nongrowing mutants generally did not produce late revertants. However, the number of late revertants was not proportional to residual growth. Using total residual growth and the nonselective reversion rate, the expected number of late revertants was calculated. For several leaky mutants, the observed revertant number exceeded the expected number. We suggest that excess late revertants from these mutants arise from general adaptive mutability available to any chromosomal gene.

Download Full-text