scholarly journals A Reinforcement Learning Model Based on Temporal Difference Algorithm

IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 121922-121930 ◽  
Author(s):  
Xiali Li ◽  
Zhengyu Lv ◽  
Song Wang ◽  
Zhi Wei ◽  
Licheng Wu
2020 ◽  
Author(s):  
Ben Lonnqvist ◽  
Micha Elsner ◽  
Amelia R. Hunt ◽  
Alasdair D F Clarke

Experiments on the efficiency of human search sometimes reveal large differences between individual participants. We argue that reward-driven task-specific learning may account for some of this variation. In a computational reinforcement learning model of this process, a wide variety of strategies emerge, despite all simulated participants having the same visual acuity. We conduct a visual search experiment, and replicate previous findings that participant preferences about where to search are highly varied, with a distribution comparable to the simulated results. Thus, task-specific learning is an under-explored mechanism by which large inter-participant differences can arise.


Sign in / Sign up

Export Citation Format

Share Document