tile coding Latest Research Papers

Gaussian Based Non-linear Function Approximation for Reinforcement Learning

SN Computer Science ◽

10.1007/s42979-021-00642-4 ◽

2021 ◽

Vol 2 (3) ◽

Author(s):

Abbas Haider ◽

Glenn Hawe ◽

Hui Wang ◽

Bryan Scotney

Keyword(s):

Reinforcement Learning ◽

Linear Function ◽

Function Approximation ◽

World Market ◽

Information Loss ◽

State Spaces ◽

State Information ◽

Linear Function Approximation ◽

Non Linear ◽

Tile Coding

AbstractReinforcement learning (RL) problems with continuous states and discrete actions (CSDA) can be found in classic examples such as Cart Pole and Puck World, as well as real world applications such as Market Making. Solutions to CSDA problems typically involve a function approximation (FA) of the mapping from states to actions and can be linear or nonlinear. Linear FAs such as tile-coding (Sutton and Barto in Reinforcement learning, 2nd ed, 2009) suffer from state information loss due to state discretization, whilst non-linear FAs such as DQN (Mnih et al. in Playing atari with deep reinforcement learning, https://arxiv.org/abs/1312.5602, 2013) are practically infeasible in infinitely large state spaces due to their cubic time complexity ($$O(n^3)$$ O ( n 3 ) ). In this paper, we propose a novel, general solution to CSDA problems, called Gaussian distribution based non-linear function approximation (GBNLFA). Experimentation on three CSDA RL problems (Cart Pole, Puck World, Market Marking) demonstrates the superiority of GBNLFA over state-of-the-art FAs, namely tile-coding and DQN. In particular, GBNLFA resolves the state information loss problem with linear FAs and provides an asymptotically faster algorithm (O(n)) than linear FAs ($$O(n^2)$$ O ( n 2 ) ) and neural network based nonlinear FAs ($$O(n^3)$$ O ( n 3 ) ).

Download Full-text

Autonomous Mobility Management for 5G Ultra-Dense HetNets via Reinforcement Learning With Tile Coding Function Approximation

IEEE Access ◽

10.1109/access.2021.3095555 ◽

2021 ◽

pp. 1-1

Author(s):

Qianyu Liu ◽

Chiew Foong Kwong ◽

Sijia Zhou ◽

Tianhao Ye ◽

Lincan Li ◽

...

Keyword(s):

Reinforcement Learning ◽

Mobility Management ◽

Function Approximation ◽

Tile Coding ◽

Autonomous Mobility

Download Full-text

A Baseline Approach for Goalkeeper Strategy using Sarsa with Tile Coding on the Half Field Offense Environment

2020 19th Brazilian Symposium on Computer Games and Digital Entertainment (SBGames) ◽

10.1109/sbgames51465.2020.00012 ◽

2020 ◽

Author(s):

Victor G. Ferreira Barbosa ◽

Rosalvo Ferreira de Oliveira Neto ◽

Roberto V. L. Gomes Rodrigues

Keyword(s):

Tile Coding ◽

Baseline Approach

Download Full-text

Learning Sparse Representations in Reinforcement Learning with Sparse Coding

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/287 ◽

2017 ◽

Author(s):

Lei Le ◽

Raksha Kumaraswamy ◽

Martha White

Keyword(s):

Reinforcement Learning ◽

Sparse Representation ◽

Policy Evaluation ◽

Sparse Coding ◽

Representation Learning ◽

Sparse Representations ◽

Learning Approaches ◽

Local Minima ◽

Global Minima ◽

Tile Coding

A variety of representation learning approaches have been investigated for reinforcement learning; much less attention, however, has been given to investigating the utility of sparse coding. Outside of reinforcement learning, sparse coding representations have been widely used, with non-convex objectives that result in discriminative representations. In this work, we develop a supervised sparse coding objective for policy evaluation. Despite the non-convexity of this objective, we prove that all local minima are global minima, making the approach amenable to simple optimization strategies. We empirically show that it is key to use a supervised objective, rather than the more straightforward unsupervised sparse coding approach. We then compare the learned representations to a canonical fixed sparse representation, called tile-coding, demonstrating that the sparse coding representation outperforms a wide variety of tile-coding representations.

Download Full-text

Representing high-dimensional data to intelligent prostheses and other wearable assistive robots: A first comparison of tile coding and selective Kanerva coding

2017 International Conference on Rehabilitation Robotics (ICORR) ◽

10.1109/icorr.2017.8009451 ◽

2017 ◽

Cited By ~ 5

Author(s):

Jaden B. Travnik ◽

Patrick M. Pilarski

Keyword(s):

High Dimensional Data ◽

High Dimensional ◽

Assistive Robots ◽

Tile Coding

Download Full-text

Upper Bounds on the Performance of Discretisation in Reinforcement Learning

South African Computer Journal ◽

10.18489/sacj.v0i57.284 ◽

2015 ◽

Author(s):

Michael Robin Mitchley

Keyword(s):

Reinforcement Learning ◽

Value Function ◽

Value Function Approximation ◽

Learning Framework ◽

A Value ◽

Continuous State Space ◽

Policy Representation ◽

Continuous State ◽

Tile Coding ◽

Policy Mapping

Reinforcement learning is a machine learning framework whereby an agent learns to perform a task by maximising its total reward received for selecting actions in each state. The policy mapping states to actions that the agent learns is either represented explicitly, or implicitly through a value function. It is common in reinforcement learning to discretise a continuous state space using tile coding or binary features. We prove an upper bound on the performance of discretisation for direct policy representation or value function approximation.

Download Full-text

Performance Evaluation of Tile Coding in Reinforcement Learning

Applicative 2015 on - Applicative 2015 ◽

10.1145/2814940.2814975 ◽

2015 ◽

Author(s):

Kenji Ota ◽

Tomoko Ozeki

Keyword(s):

Performance Evaluation ◽

Reinforcement Learning ◽

Tile Coding

Download Full-text

A new method of reducing boundary artifacts for JPEG2000 multi-tile coding

2015 IEEE International Conference on Imaging Systems and Techniques (IST) ◽

10.1109/ist.2015.7294570 ◽

2015 ◽

Cited By ~ 1

Author(s):

Jianxin Wang ◽

En Zhu

Keyword(s):

New Method ◽

Tile Coding

Download Full-text

XCSF with tile coding in discontinuous action-value landscapes

Evolutionary Intelligence ◽

10.1007/s12065-015-0129-7 ◽

2015 ◽

Vol 8 (2-3) ◽

pp. 117-132 ◽

Cited By ~ 1

Author(s):

Pier Luca Lanzi ◽

Daniele Loiacono

Keyword(s):

Tile Coding ◽

Action Value

Download Full-text

Tile coding based camera modeling for 3D sensing

2014 IEEE 6th International Conference on Awareness Science and Technology (iCAST) ◽

10.1109/icawst.2014.6981828 ◽

2014 ◽

Author(s):

Toshihiko Watanabe ◽

Yuichi Saito ◽

Takeshi Kamai ◽

Tomoki Ishimaru

Keyword(s):

Tile Coding ◽

3D Sensing

Download Full-text

tile coding
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Gaussian Based Non-linear Function Approximation for Reinforcement Learning

Autonomous Mobility Management for 5G Ultra-Dense HetNets via Reinforcement Learning With Tile Coding Function Approximation

A Baseline Approach for Goalkeeper Strategy using Sarsa with Tile Coding on the Half Field Offense Environment

Learning Sparse Representations in Reinforcement Learning with Sparse Coding

Representing high-dimensional data to intelligent prostheses and other wearable assistive robots: A first comparison of tile coding and selective Kanerva coding

Upper Bounds on the Performance of Discretisation in Reinforcement Learning

Performance Evaluation of Tile Coding in Reinforcement Learning

A new method of reducing boundary artifacts for JPEG2000 multi-tile coding

XCSF with tile coding in discontinuous action-value landscapes

Tile coding based camera modeling for 3D sensing

Export Citation Format

tile codingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Gaussian Based Non-linear Function Approximation for Reinforcement Learning

Autonomous Mobility Management for 5G Ultra-Dense HetNets via Reinforcement Learning With Tile Coding Function Approximation

A Baseline Approach for Goalkeeper Strategy using Sarsa with Tile Coding on the Half Field Offense Environment

Learning Sparse Representations in Reinforcement Learning with Sparse Coding

Representing high-dimensional data to intelligent prostheses and other wearable assistive robots: A first comparison of tile coding and selective Kanerva coding

Upper Bounds on the Performance of Discretisation in Reinforcement Learning

Performance Evaluation of Tile Coding in Reinforcement Learning

A new method of reducing boundary artifacts for JPEG2000 multi-tile coding

XCSF with tile coding in discontinuous action-value landscapes

Tile coding based camera modeling for 3D sensing

tile coding
Recently Published Documents