Bounds and good policies in stationary finite–stage Markovian decision problems

Gerhard Hübner

doi:10.2307/1426499

Bounds and good policies in stationary finite–stage Markovian decision problems

Advances in Applied Probability ◽

10.2307/1426499 ◽

1980 ◽

Vol 12 (1) ◽

pp. 154-173 ◽

Cited By ~ 9

Author(s):

Gerhard Hübner

Keyword(s):

Decision Model ◽

Transition Probabilities ◽

Planning Horizon ◽

Decision Problems ◽

Optimal Decisions ◽

Optimal Value ◽

Action Spaces ◽

Stationary Problems ◽

Markovian Decision Problems

A stationary Markovian decision model is considered with general state and action spaces where the transition probabilities are weakened to be bounded transition measures (this is useful for many applications). New and improved bounds are given for the optimal value of stationary problems with a large planning horizon if either only a few steps of iteration are carried out or, in addition, a solution of the infinite-stage problem is known. Similar estimates are obtained for the quality of policies which are composed of nearly optimal decisions from the first few steps or from the infinite-stage solution.

Download Full-text

Bounds and good policies in stationary finite–stage Markovian decision problems

Advances in Applied Probability ◽

10.1017/s0001867800033437 ◽

1980 ◽

Vol 12 (01) ◽

pp. 154-173

Author(s):

Gerhard Hübner

Keyword(s):

Decision Model ◽

Transition Probabilities ◽

Planning Horizon ◽

Decision Problems ◽

Optimal Decisions ◽

Optimal Value ◽

Action Spaces ◽

Stationary Problems ◽

Markovian Decision Problems

Download Full-text

Approximating general Markovian decision-problems by clustering their state- and action-spaces

Mathematische Operationsforschung und Statistik Series Optimization ◽

10.1080/02331938408842915 ◽

1984 ◽

Vol 15 (1) ◽

pp. 135-144 ◽

Cited By ~ 2

Author(s):

Willibald Doeringer

Keyword(s):

Decision Problems ◽

Action Spaces ◽

Markovian Decision Problems

Download Full-text

Feature Extraction and Classification of Citrus Juice by Using an Enhanced L-KSVD on Data Obtained from Electronic Nose

Sensors ◽

10.3390/s19040916 ◽

2019 ◽

Vol 19 (4) ◽

pp. 916 ◽

Cited By ~ 2

Author(s):

Wen Cao ◽

Chunmei Liu ◽

Pengfei Jia

Keyword(s):

Feature Extraction ◽

Kernel Function ◽

Electronic Nose ◽

Classification Accuracy ◽

Extraction Methods ◽

Object Function ◽

Optimal Value ◽

Processed Products

Aroma plays a significant role in the quality of citrus fruits and processed products. The detection and analysis of citrus volatiles can be measured by an electronic nose (E-nose); in this paper, an E-nose is employed to classify the juice which is stored for different days. Feature extraction and classification are two important requirements for an E-nose. During the training process, a classifier can optimize its own parameters to achieve a better classification accuracy but cannot decide its input data which is treated by feature extraction methods, so the classification result is not always ideal. Label consistent KSVD (L-KSVD) is a novel technique which can extract the feature and classify the data at the same time, and such an operation can improve the classification accuracy. We propose an enhanced L-KSVD called E-LCKSVD for E-nose in this paper. During E-LCKSVD, we introduce a kernel function to the traditional L-KSVD and present a new initialization technique of its dictionary; finally, the weighted coefficients of different parts of its object function is studied, and enhanced quantum-behaved particle swarm optimization (EQPSO) is employed to optimize these coefficients. During the experimental section, we firstly find the classification accuracy of KSVD, and L-KSVD is improved with the help of the kernel function; this can prove that their ability of dealing nonlinear data is improved. Then, we compare the results of different dictionary initialization techniques and prove our proposed method is better. Finally, we find the optimal value of the weighted coefficients of the object function of E-LCKSVD that can make E-nose reach a better performance.

Download Full-text

A modified Gauss-Seidel-algorithm with exclusion of suboptimal actions for A class of semi-Markovian decision problems

Optimization ◽

10.1080/02331930008844514 ◽

2000 ◽

Vol 48 (4) ◽

pp. 429-451 ◽

Cited By ~ 1

Author(s):

V Nollau ◽

D Hudak

Keyword(s):

Decision Problems ◽

Markovian Decision Problems

Download Full-text

Assessing the Quality of Data with a Decision Model

Quality Aspects in Spatial Data Mining ◽

10.1201/9781420069273.ch2 ◽

2008 ◽

pp. 15-24

Author(s):

Andrew Frank

Keyword(s):

Decision Model ◽

Quality Of Data

Download Full-text

A Comparison of Policy Iteration Methods for Solving Continuous-State, Infinite-Horizon Markovian Decision Problems Using Random, Quasi-random, and Deterministic Discretizations

SSRN Electronic Journal ◽

10.2139/ssrn.37768 ◽

1997 ◽

Cited By ~ 10

Author(s):

John P. Rust

Keyword(s):

Infinite Horizon ◽

Policy Iteration ◽

Decision Problems ◽

Continuous State ◽

Iteration Methods ◽

Markovian Decision Problems

Download Full-text

Bayesian dynamic programming

Advances in Applied Probability ◽

10.2307/1426080 ◽

1975 ◽

Vol 7 (2) ◽

pp. 330-348 ◽

Cited By ~ 52

Author(s):

Ulrich Rieder

Keyword(s):

Dynamic Programming ◽

Weak Convergence ◽

Decision Model ◽

Transition Probabilities ◽

General State ◽

Parameter Spaces ◽

State Action ◽

Total Rewards ◽

Bayesian Dynamic Programming ◽

Dynamic Decision Model

We consider a non-stationary Bayesian dynamic decision model with general state, action and parameter spaces. It is shown that this model can be reduced to a non-Markovian (resp. Markovian) decision model with completely known transition probabilities. Under rather weak convergence assumptions on the expected total rewards some general results are presented concerning the restriction on deterministic generalized Markov policies, the criteria of optimality and the existence of Bayes policies. These facts are based on the above transformations and on results of Hindererand Schäl.

Download Full-text

A method of clustering for discounted markovian decision problems

Mathematische Operationsforschung und Statistik Series Optimization ◽

10.1080/02331938108842713 ◽

1981 ◽

Vol 12 (1) ◽

pp. 137-147 ◽

Cited By ~ 3

Author(s):

A. Hahnewald-busch ◽

V. Nollau

Keyword(s):

Decision Problems ◽

Markovian Decision Problems

Download Full-text

Type and Cotype Constants and the Linear Stability of Wigner’s Symmetry Theorem

Symmetry ◽

10.3390/sym11091107 ◽

2019 ◽

Vol 11 (9) ◽

pp. 1107

Author(s):

Javier Cuesta

Keyword(s):

Banach Spaces ◽

Linear Stability ◽

Linear Extension ◽

Transition Probabilities ◽

Linear Map ◽

Additive Error ◽

Type And Cotype

We study the relation between almost-symmetries and the geometry of Banach spaces. We show that any almost-linear extension of a transformation that preserves transition probabilities up to an additive error admits an approximation by a linear map, and the quality of the approximation depends on the type and cotype constants of the involved spaces.

Download Full-text

On the equivalence of mixed and behavior strategies in finitely additive decision problems

Journal of Applied Probability ◽

10.1017/jpr.2019.47 ◽

2019 ◽

Vol 56 (3) ◽

pp. 810-829

Author(s):

János Flesch ◽

Dries Vermeulen ◽

Anna Zseleva

Keyword(s):

Mixed Strategy ◽

Action Space ◽

Decision Problems ◽

Probability Measures ◽

Infinite Time ◽

Behavior Strategy ◽

Behavior Strategies ◽

And Behavior ◽

Arbitrary Action ◽

Action Spaces

AbstractWe consider decision problems with arbitrary action spaces, deterministic transitions, and infinite time horizon. In the usual setup when probability measures are countably additive, a general version of Kuhn’s theorem implies under fairly general conditions that for every mixed strategy of the decision maker there exists an equivalent behavior strategy. We examine to what extent this remains valid when probability measures are only assumed to be finitely additive. Under the classical approach of Dubins and Savage (2014), we prove the following statements: (1) If the action space is finite, every mixed strategy has an equivalent behavior strategy. (2) Even if the action space is infinite, at least one optimal mixed strategy has an equivalent behavior strategy. The approach by Dubins and Savage turns out to be essentially maximal: these two statements are no longer valid if we take any extension of their approach that considers all singleton plays.

Download Full-text