scholarly journals Exploiting Model Equivalences for Solving Interactive Dynamic Influence Diagrams

2012 ◽  
Vol 43 ◽  
pp. 211-255 ◽  
Author(s):  
Y. Zeng ◽  
P. Doshi

We focus on the problem of sequential decision making in partially observable environments shared with other agents of uncertain types having similar or conflicting objectives. This problem has been previously formalized by multiple frameworks one of which is the interactive dynamic influence diagram (I-DID), which generalizes the well-known influence diagram to the multiagent setting. I-DIDs are graphical models and may be used to compute the policy of an agent given its belief over the physical state and others' models, which changes as the agent acts and observes in the multiagent setting. As we may expect, solving I-DIDs is computationally hard. This is predominantly due to the large space of candidate models ascribed to the other agents and its exponential growth over time. We present two methods for reducing the size of the model space and stemming its exponential growth. Both these methods involve aggregating individual models into equivalence classes. Our first method groups together behaviorally equivalent models and selects only those models for updating which will result in predictive behaviors that are distinct from others in the updated model space. The second method further compacts the model space by focusing on portions of the behavioral predictions. Specifically, we cluster actionally equivalent models that prescribe identical actions at a single time step. Exactly identifying the equivalences would require us to solve all models in the initial set. We avoid this by selectively solving some of the models, thereby introducing an approximation. We discuss the error introduced by the approximation, and empirically demonstrate the improved efficiency in solving I-DIDs due to the equivalences.

2018 ◽  
Vol 13 (5) ◽  
pp. 895-907
Author(s):  
Liang Tan

The influence diagram is a probabilistic model for presenting decision problems as a directed graph. In this study, the dynamic influence diagram and the interactive dynamic influence diagram are used to model the three parties to service innovation: customers, suppliers, and service enterprises. The models analyze the decisions of these dierent parties and describe the process by which service enterprises should consider their own innovation conditions as well as those of the other parties, that is, customers and suppliers. Moreover, during the process of service innovation, service enterprises should be in constant communication with customers and suppliers. After the customers and suppliers respond, service enterprises can modify their innovation decision-making, and improve service innovation quality and income.


2020 ◽  
Vol 34 (10) ◽  
pp. 13851-13852
Author(s):  
Junkyu Lee

This paper presents a systematic way of decomposing a limited memory influence diagram (LIMID) to a tree of single-stage decision problems, or submodels and solving it by message passing. The relevance in LIMIDs is formalized by the notion of the partial evaluation of the maximum expected utility, and the graph separation criteria for identifying submodels follow. The submodel decomposition provides a graphical model approach for updating the beliefs and propagating the conditional expected utilities for solving LIMIDs with the worst-case complexity bounded by the maximum treewidth of the individual submodels.


2007 ◽  
Vol 29 ◽  
pp. 421-489 ◽  
Author(s):  
C. Pralet ◽  
G. Verfaillie ◽  
T. Schiex

Numerous formalisms and dedicated algorithms have been designed in the last decades to model and solve decision making problems. Some formalisms, such as constraint networks, can express "simple" decision problems, while others are designed to take into account uncertainties, unfeasible decisions, and utilities. Even in a single formalism, several variants are often proposed to model different types of uncertainty (probability, possibility...) or utility (additive or not). In this article, we introduce an algebraic graphical model that encompasses a large number of such formalisms: (1) we first adapt previous structures from Friedman, Chu and Halpern for representing uncertainty, utility, and expected utility in order to deal with generic forms of sequential decision making; (2) on these structures, we then introduce composite graphical models that express information via variables linked by "local" functions, thanks to conditional independence; (3) on these graphical models, we finally define a simple class of queries which can represent various scenarios in terms of observabilities and controllabilities. A natural decision-tree semantics for such queries is completed by an equivalent operational semantics, which induces generic algorithms. The proposed framework, called the Plausibility-Feasibility-Utility (PFU) framework, not only provides a better understanding of the links between existing formalisms, but it also covers yet unpublished frameworks (such as possibilistic influence diagrams) and unifies formalisms such as quantified boolean formulas and influence diagrams. Our backtrack and variable elimination generic algorithms are a first step towards unified algorithms.


Sign in / Sign up

Export Citation Format

Share Document