scholarly journals DESPOT: Online POMDP Planning with Regularization

2017 ◽  
Vol 58 ◽  
pp. 231-266 ◽  
Author(s):  
Nan Ye ◽  
Adhiraj Somani ◽  
David Hsu ◽  
Wee Sun Lee

The partially observable Markov decision process (POMDP) provides a principled general framework for planning under uncertainty, but solving POMDPs optimally is computationally intractable, due to the "curse of dimensionality" and the "curse of history". To overcome these challenges, we introduce the Determinized Sparse Partially Observable Tree (DESPOT), a sparse approximation of the standard belief tree, for online planning under uncertainty. A DESPOT focuses online planning on a set of randomly sampled scenarios and compactly captures the "execution" of all policies under these scenarios. We show that the best policy obtained from a DESPOT is near-optimal, with a regret bound that depends on the representation size of the optimal policy. Leveraging this result, we give an anytime online planning algorithm, which searches a DESPOT for a policy that optimizes a regularized objective function. Regularization balances the estimated value of a policy under the sampled scenarios and the policy size, thus avoiding overfitting. The algorithm demonstrates strong experimental results, compared with some of the best online POMDP algorithms available. It has also been incorporated into an autonomous driving system for real-time vehicle control. The source code for the algorithm is available online.

2018 ◽  
Vol 38 (2-3) ◽  
pp. 162-181 ◽  
Author(s):  
Yuanfu Luo ◽  
Haoyu Bai ◽  
David Hsu ◽  
Wee Sun Lee

The partially observable Markov decision process (POMDP) provides a principled general framework for robot planning under uncertainty. Leveraging the idea of Monte Carlo sampling, recent POMDP planning algorithms have scaled up to various challenging robotic tasks, including, real-time online planning for autonomous vehicles. To further improve online planning performance, this paper presents IS-DESPOT, which introduces importance sampling to DESPOT, a state-of-the-art sampling-based POMDP algorithm for planning under uncertainty. Importance sampling improves DESPOT’s performance when there are critical, but rare events, which are difficult to sample. We prove that IS-DESPOT retains the theoretical guarantee of DESPOT. We demonstrate empirically that importance sampling significantly improves the performance of online POMDP planning for suitable tasks. We also present a general method for learning the importance sampling distribution.


Author(s):  
Wulf Loh ◽  
Janina Loh

In this chapter, we give a brief overview of the traditional notion of responsibility and introduce a concept of distributed responsibility within a responsibility network of engineers, driver, and autonomous driving system. In order to evaluate this concept, we explore the notion of man–machine hybrid systems with regard to self-driving cars and conclude that the unit comprising the car and the operator/driver consists of such a hybrid system that can assume a shared responsibility different from the responsibility of other actors in the responsibility network. Discussing certain moral dilemma situations that are structured much like trolley cases, we deduce that as long as there is something like a driver in autonomous cars as part of the hybrid system, she will have to bear the responsibility for making the morally relevant decisions that are not covered by traffic rules.


2021 ◽  
Vol 6 (4) ◽  
pp. 7301-7308
Author(s):  
Tianze Wu ◽  
Baofu Wu ◽  
Sa Wang ◽  
Liangkai Liu ◽  
Shaoshan Liu ◽  
...  

2015 ◽  
Vol 16 (4) ◽  
pp. 1999-2013 ◽  
Author(s):  
Inwook Shim ◽  
Jongwon Choi ◽  
Seunghak Shin ◽  
Tae-Hyun Oh ◽  
Unghui Lee ◽  
...  

2021 ◽  
Author(s):  
Jingqin Zhang ◽  
Jun Hou ◽  
Jinwen Hu ◽  
Chunhui Zhao ◽  
Zhao Xu ◽  
...  

2021 ◽  
Author(s):  
Kazunari Takasaki ◽  
Kota Hisafuru ◽  
Ryotaro Negishi ◽  
Kazuki Yamashita ◽  
Keisuke Fukada ◽  
...  

2021 ◽  
Author(s):  
Emanuele Ferrandino ◽  
Antonino Capillo ◽  
Enrico De Santis ◽  
Fabio Mascioli ◽  
Antonello Rizzi

Sign in / Sign up

Export Citation Format

Share Document