HMM-based Temporal Difference Learning with State Transition Updating for Tracking Human Communicational Behaviors

Minh Anh T. Ho;  ; Yoji Yamada; Yoji Umetani

doi:10.20965/jrm.2003.p0271

HMM-based Temporal Difference Learning with State Transition Updating for Tracking Human Communicational Behaviors

Journal of Robotics and Mechatronics ◽

10.20965/jrm.2003.p0271 ◽

2003 ◽

Vol 15 (3) ◽

pp. 271-277 ◽

Cited By ~ 1

Author(s):

Minh Anh T. Ho ◽

◽

Yoji Yamada ◽

Yoji Umetani

Keyword(s):

State Transition ◽

Markov Models ◽

Transition Probability ◽

Original System ◽

Temporal Difference ◽

Value Functions ◽

Constraint Factor ◽

Organization Framework ◽

Sign Sequence ◽

Updating Procedure

In our original system, we used hidden Markov models (HMMs) to model rough gesture patterns. We later utilized temporal difference (TD) learning to adjust the action model of the tracker for its behavior in the tracking task. We integrated the above two methods into an algorithm by assigning state transition probability in HMMs as a reward in TD learning. Identification of the sign gesture context through wavelet analysis autonomously provides a reward value for optimizing the attentive visual attentive tracker's AVAT's action patterns. A bound of state value functions as a constraint factor for the updating procedure in TD models has been determined to recognize whether predictive models need to be updated according with action models. Experimental results of extracting an operator's hand sign sequence during natural walking demonstrates AVAT development in the perceptual organization framework.

Download Full-text

State Transition Probability Based Sensing Duration Optimization Algorithm in Cognitive Radio

IEICE Transactions on Communications ◽

10.1587/transcom.e93.b.3258 ◽

2010 ◽

Vol E93-B (12) ◽

pp. 3258-3265 ◽

Cited By ~ 1

Author(s):

Jin-long WANG ◽

Xiao ZHANG ◽

Qihui WU

Keyword(s):

Cognitive Radio ◽

Optimization Algorithm ◽

State Transition ◽

Transition Probability ◽

State Transition Probability

Download Full-text

Hidden Markov models with duration-dependent state transition probabilities

Electronics Letters ◽

10.1049/el:19910392 ◽

1991 ◽

Vol 27 (8) ◽

pp. 625 ◽

Cited By ~ 7

Author(s):

S.V. Vaseghi

Keyword(s):

Hidden Markov Models ◽

State Transition ◽

Markov Models ◽

Transition Probabilities ◽

Hidden Markov ◽

Dependent State

Download Full-text

Asynchronous quadratic control for constrained hidden markov jump linear systems with incomplete MTPM and MOCPM

IMA Journal of Mathematical Control and Information ◽

10.1093/imamci/dnab012 ◽

2021 ◽

Author(s):

Jin Zhu ◽

Kai Xia ◽

Geir E Dullerud

Keyword(s):

Linear Systems ◽

Controller Design ◽

Hidden Markov ◽

Transition Probability ◽

Original System ◽

Transition Probability Matrix ◽

Markov Jump ◽

Jump Linear Systems ◽

Weighting Matrix ◽

Markov Jump Linear Systems

Abstract This paper investigates the quadratic optimal control problem for constrained Markov jump linear systems with incomplete mode transition probability matrix (MTPM). Considering original system mode is not accessible, observed mode is utilized for asynchronous controller design where mode observation conditional probability matrix (MOCPM), which characterizes the emission between original modes and observed modes is assumed to be partially known. An LMI optimization problem is formulated for such constrained hidden Markov jump linear systems with incomplete MTPM and MOCPM. Based on this, a feasible state-feedback controller can be designed with the application of free-connection weighting matrix method. The desired controller, dependent on observed mode, is an asynchronous one which can minimize the upper bound of quadratic cost and satisfy restrictions on system states and control variables. Furthermore, clustering observation where observed modes recast into several clusters, is explored for simplifying the computational complexity. Numerical examples are provided to illustrate the validity.

Download Full-text

State transition probability for the Markov Model dealing with on/off cooling schedule in dwellings

Energy and Buildings ◽

10.1016/j.enbuild.2004.02.002 ◽

2005 ◽

Vol 37 (3) ◽

pp. 181-187 ◽

Cited By ~ 34

Author(s):

Jun Tanimoto ◽

Aya Hagishima

Keyword(s):

Markov Model ◽

State Transition ◽

Transition Probability ◽

State Transition Probability ◽

Cooling Schedule

Download Full-text

Isolated word recognition using continuous state transition-probability and DP-matching

10.1109/icassp.1989.266418 ◽

2003 ◽

Cited By ~ 1

Author(s):

T. Takara

Keyword(s):

Word Recognition ◽

State Transition ◽

Transition Probability ◽

State Transition Probability ◽

Continuous State ◽

Isolated Word ◽

Isolated Word Recognition

Download Full-text

Grey theory–based BP-NN co-training for dense sequence long-term tendency prediction

Grey Systems Theory and Application ◽

10.1108/gs-02-2020-0024 ◽

2020 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Yuling Hong ◽

Yingjie Yang ◽

Qishan Zhang

Keyword(s):

State Transition ◽

Transition Probability ◽

Training Model ◽

Transition Probability Matrix ◽

Content Type ◽

Fine Grained ◽

State Transition Probability ◽

Dense Sequence ◽

Popularity Prediction

PurposeThe purpose of this paper is to solve the problems existing in topic popularity prediction in online social networks and advance a fine-grained and long-term prediction model for lack of sufficient data.Design/methodology/approachBased on GM(1,1) and neural networks, a co-training model for topic tendency prediction is proposed in this paper. The interpolation based on GM(1,1) is employed to generate fine-grained prediction values of topic popularity time series and two neural network models are considered to achieve convergence by transmitting training parameters via their loss functions.FindingsThe experiment results indicate that the integrated model can effectively predict dense sequence with higher performance than other algorithms, such as NN and RBF_LSSVM. Furthermore, the Markov chain state transition probability matrix model is used to improve the prediction results.Practical implicationsFine-grained and long-term topic popularity prediction, further improvement could be made by predicting any interpolation in the time interval of popularity data points.Originality/valueThe paper succeeds in constructing a co-training model with GM(1,1) and neural networks. Markov chain state transition probability matrix is deployed for further improvement of popularity tendency prediction.

Download Full-text

05/02304 State transition probability for the Markov Model dealing with on/off cooling schedule in dwellings

Fuel and Energy Abstracts ◽

10.1016/s0140-6701(05)82313-6 ◽

2005 ◽

Vol 46 (5) ◽

pp. 336

Keyword(s):

Markov Model ◽

State Transition ◽

Transition Probability ◽

State Transition Probability ◽

Cooling Schedule

Download Full-text

Infrastructure State Transition Probability Computation Using Duration Models

Applications of Advanced Technologies in Transportation (2002) ◽

10.1061/40632(245)64 ◽

2002 ◽

Cited By ~ 1

Author(s):

Rabi G. Mishalani ◽

Samer M. Madanat

Keyword(s):

State Transition ◽

Transition Probability ◽

Duration Models ◽

State Transition Probability

Download Full-text

Stochastic traffic control based on regional state transition probability model

2016 IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI) ◽

10.1109/soli.2016.7551667 ◽

2016 ◽

Author(s):

Yunwen Xu ◽

Yugeng Xi ◽

Dewei Li

Keyword(s):

Traffic Control ◽

State Transition ◽

Transition Probability ◽

Probability Model ◽

State Transition Probability ◽

Regional State ◽

Transition Probability Model

Download Full-text

Extracting State Transition Dynamics from Multiple Spike Trains Using Hidden Markov Models with Correlated Poisson Distribution

Neural Computation ◽

10.1162/neco.2010.08-08-838 ◽

2010 ◽

Vol 22 (9) ◽

pp. 2369-2389 ◽

Cited By ~ 3

Author(s):

Kentaro Katahira ◽

Jun Nishikawa ◽

Kazuo Okanoya ◽

Masato Okada

Keyword(s):

Hidden Markov Models ◽

Poisson Distribution ◽

State Transition ◽

Markov Models ◽

Hidden Markov ◽

Synthetic Data ◽

Poisson Model ◽

Spike Trains ◽

Output Distribution ◽

Multivariate Poisson Distribution

Neural activity is nonstationary and varies across time. Hidden Markov models (HMMs) have been used to track the state transition among quasi-stationary discrete neural states. Within this context, an independent Poisson model has been used for the output distribution of HMMs; hence, the model is incapable of tracking the change in correlation without modulating the firing rate. To achieve this, we applied a multivariate Poisson distribution with correlation terms for the output distribution of HMMs. We formulated a variational Bayes (VB) inference for the model. The VB could automatically determine the appropriate number of hidden states and correlation types while avoiding the overlearning problem. We developed an efficient algorithm for computing posteriors using the recursive relationship of a multivariate Poisson distribution. We demonstrated the performance of our method on synthetic data and real spike trains recorded from a songbird.

Download Full-text