A State-Space Representation Model and Learning Algorithm for Real-Time Decision-Making Under Uncertainty

Volume 9: Mechanical Systems and Control, Parts A, B, and C ◽

10.1115/imece2007-41258 ◽

2007 ◽

Cited By ~ 4

Author(s):

Andreas A. Malikopoulos ◽

Panos Y. Papalambros ◽

Dennis N. Assanis

Keyword(s):

Decision Making ◽

Optimal Control ◽

Real Time ◽

Learning Algorithm ◽

Control Policy ◽

Sequential Decision Making ◽

Space Representation ◽

Decision Making Under Uncertainty ◽

Sequential Decision ◽

State Space Representation

Modeling dynamic systems incurring stochastic disturbances for deriving a control policy is a ubiquitous task in engineering. However, in some instances obtaining a model of a system may be impractical or impossible. Alternative approaches have been developed using a simulation-based stochastic framework, in which the system interacts with its environment in real time and obtains information that can be processed to produce an optimal control policy. In this context, the problem of developing a policy for controlling the system’s behavior is formulated as a sequential decision-making problem under uncertainty. This paper considers real-time sequential decision-making under uncertainty modeled as a Markov Decision Process (MDP). A state-space representation model is constructed through a learning mechanism and is used to improve system performance over time. The model allows decision making based on gradually enhanced knowledge of system response as it transitions from one state to another, in conjunction with actions taken at each state. A learning algorithm is implemented realizing in real time the optimal control policy associated with the state transitions. The proposed method is demonstrated on the single cart-pole balancing problem and a vehicle cruise control problem.

Download Full-text

A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty

Journal of Dynamic Systems Measurement and Control ◽

10.1115/1.3117200 ◽

2009 ◽

Vol 131 (4) ◽

Cited By ~ 9

Author(s):

Andreas A. Malikopoulos ◽

Panos Y. Papalambros ◽

Dennis N. Assanis

Keyword(s):

Decision Making ◽

Real Time ◽

Control Policy ◽

Learning Model ◽

Sequential Decision Making ◽

State Transitions ◽

Space Representation ◽

System Response ◽

Sequential Decision ◽

Cruise Control

Modeling dynamic systems incurring stochastic disturbances for deriving a control policy is a ubiquitous task in engineering. However, in some instances obtaining a model of a system may be impractical or impossible. Alternative approaches have been developed using a simulation-based stochastic framework, in which the system interacts with its environment in real time and obtains information that can be processed to produce an optimal control policy. In this context, the problem of developing a policy for controlling the system’s behavior is formulated as a sequential decision-making problem under uncertainty. This paper considers the problem of deriving a control policy for a dynamic system with unknown dynamics in real time, formulated as a sequential decision-making under uncertainty. The evolution of the system is modeled as a controlled Markov chain. A new state-space representation model and a learning mechanism are proposed that can be used to improve system performance over time. The major difference between the existing methods and the proposed learning model is that the latter utilizes an evaluation function, which considers the expected cost that can be achieved by state transitions forward in time. The model allows decision-making based on gradually enhanced knowledge of system response as it transitions from one state to another, in conjunction with actions taken at each state. The proposed model is demonstrated on the single cart-pole balancing problem and a vehicle cruise-control problem.

Download Full-text

Sequential Decision Making Under Uncertainty: Ordinal Uninorms vs. the Hurwicz Criterion

Communications in Computer and Information Science - Information Processing and Management of Uncertainty in Knowledge-Based Systems. Applications ◽

10.1007/978-3-319-91479-4_48 ◽

2018 ◽

pp. 578-590 ◽

Cited By ~ 3

Author(s):

Hélène Fargier ◽

Romain Guillaume

Keyword(s):

Decision Making ◽

Sequential Decision Making ◽

Decision Making Under Uncertainty ◽

Sequential Decision ◽

Hurwicz Criterion

Download Full-text

Preventing Disparate Treatment in Sequential Decision Making

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/311 ◽

2018 ◽

Cited By ~ 1

Author(s):

Hoda Heidari ◽

Andreas Krause

Keyword(s):

Decision Making ◽

Learning Algorithm ◽

Feature Space ◽

Sequential Decision Making ◽

Data Sets ◽

Sequential Decision ◽

Real World Data ◽

Time Step ◽

Job Application ◽

Disparate Treatment

We study fairness in sequential decision making environments, where at each time step a learning algorithm receives data corresponding to a new individual (e.g. a new job application) and must make an irrevocable decision about him/her (e.g. whether to hire the applicant) based on observations made so far. In order to prevent cases of disparate treatment, our time-dependent notion of fairness requires algorithmic decisions to be consistent: if two individuals are similar in the feature space and arrive during the same time epoch, the algorithm must assign them to similar outcomes. We propose a general framework for post-processing predictions made by a black-box learning model, that guarantees the resulting sequence of outcomes is consistent. We show theoretically that imposing consistency will not significantly slow down learning. Our experiments on two real-world data sets illustrate and confirm this finding in practice.

Download Full-text