A Continuous-State Reinforcement Learning Strategy for Link Adaptation in OFDM Wireless Systems

P.H.P. Carvalho; R.D. Vieira; J.P. Leite

doi:10.14209/jcis.2015.6

A Continuous-State Reinforcement Learning Strategy for Link Adaptation in OFDM Wireless Systems

Journal of Communication and Information Systems ◽

10.14209/jcis.2015.6 ◽

2015 ◽

Vol 30 (1) ◽

pp. 47-57 ◽

Cited By ~ 1

Author(s):

P.H.P. Carvalho ◽

R.D. Vieira ◽

J.P. Leite

Keyword(s):

Reinforcement Learning ◽

Learning Strategy ◽

Wireless Systems ◽

Link Adaptation ◽

Continuous State

Download Full-text

Reinforcement Learning for Link Adaptation in MIMO-OFDM Wireless Systems

2010 IEEE Global Telecommunications Conference GLOBECOM 2010 ◽

10.1109/glocom.2010.5683371 ◽

2010 ◽

Cited By ~ 10

Author(s):

Sungho Yun ◽

Constantine Caramanis

Keyword(s):

Reinforcement Learning ◽

Wireless Systems ◽

Link Adaptation ◽

Mimo Ofdm

Download Full-text

Collision-free path planning for welding manipulator via hybrid algorithm of deep reinforcement learning and inverse kinematics

Complex & Intelligent Systems ◽

10.1007/s40747-021-00366-1 ◽

2021 ◽

Author(s):

Jie Zhong ◽

Tao Wang ◽

Lianglun Cheng

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Free Path ◽

Inverse Kinematics ◽

Multiple Dimensions ◽

Continuous State ◽

Planning Algorithm ◽

Convergence Performance ◽

Path Planner ◽

Action Spaces

AbstractIn actual welding scenarios, an effective path planner is needed to find a collision-free path in the configuration space for the welding manipulator with obstacles around. However, as a state-of-the-art method, the sampling-based planner only satisfies the probability completeness and its computational complexity is sensitive with state dimension. In this paper, we propose a path planner for welding manipulators based on deep reinforcement learning for solving path planning problems in high-dimensional continuous state and action spaces. Compared with the sampling-based method, it is more robust and is less sensitive with state dimension. In detail, to improve the learning efficiency, we introduce the inverse kinematics module to provide prior knowledge while a gain module is also designed to avoid the local optimal policy, we integrate them into the training algorithm. To evaluate our proposed planning algorithm in multiple dimensions, we conducted multiple sets of path planning experiments for welding manipulators. The results show that our method not only improves the convergence performance but also is superior in terms of optimality and robustness of planning compared with most other planning algorithms.

Download Full-text

Reinforcement learning versus swarm intelligence for autonomous multi-HAPS coordination

SN Applied Sciences ◽

10.1007/s42452-021-04658-6 ◽

2021 ◽

Vol 3 (6) ◽

Author(s):

Ogbonnaya Anicho ◽

Philip B. Charlesworth ◽

Gurvinder S. Baicher ◽

Atulya K. Nagar

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Swarm Intelligence ◽

Performance Indicators ◽

Convergence Rates ◽

Tuning Parameters ◽

Continuous State Space ◽

Continuous State ◽

User Coverage ◽

Better Than

AbstractThis work analyses the performance of Reinforcement Learning (RL) versus Swarm Intelligence (SI) for coordinating multiple unmanned High Altitude Platform Stations (HAPS) for communications area coverage. It builds upon previous work which looked at various elements of both algorithms. The main aim of this paper is to address the continuous state-space challenge within this work by using partitioning to manage the high dimensionality problem. This enabled comparing the performance of the classical cases of both RL and SI establishing a baseline for future comparisons of improved versions. From previous work, SI was observed to perform better across various key performance indicators. However, after tuning parameters and empirically choosing suitable partitioning ratio for the RL state space, it was observed that the SI algorithm still maintained superior coordination capability by achieving higher mean overall user coverage (about 20% better than the RL algorithm), in addition to faster convergence rates. Though the RL technique showed better average peak user coverage, the unpredictable coverage dip was a key weakness, making SI a more suitable algorithm within the context of this work.

Download Full-text

Hybrid beamforming algorithm using reinforcement learning for millimeter wave wireless systems

2019 XVIII Workshop on Information Processing and Control (RPIC) ◽

10.1109/rpic.2019.8882140 ◽

2019 ◽

Cited By ~ 1

Author(s):

Enrique M. Lizarraga ◽

Gabriel N. Maggio ◽

Alexis A. Dowhuszko

Keyword(s):

Reinforcement Learning ◽

Millimeter Wave ◽

Wireless Systems ◽

Hybrid Beamforming ◽

Beamforming Algorithm

Download Full-text

Medical QoS provision based on reinforcement learning in ultrasound streaming over 3.5G wireless systems

IEEE Journal on Selected Areas in Communications ◽

10.1109/jsac.2009.090517 ◽

2009 ◽

Vol 27 (4) ◽

pp. 566-574 ◽

Cited By ~ 54

Author(s):

Robert Istepanian ◽

Nada Philip ◽

Maria Martini

Keyword(s):

Reinforcement Learning ◽

Wireless Systems

Download Full-text

Maximum entropy inverse reinforcement learning in continuous state spaces with path integrals

2011 IEEE/RSJ International Conference on Intelligent Robots and Systems ◽

10.1109/iros.2011.6048804 ◽

2011 ◽

Cited By ~ 1

Author(s):

N. Aghasadeghi ◽

T. Bretl

Keyword(s):

Reinforcement Learning ◽

Maximum Entropy ◽

Path Integrals ◽

Inverse Reinforcement Learning ◽

State Spaces ◽

Continuous State

Download Full-text

Reinforcement Learning Based Link Adaptation in 5G URLLC

10.1109/icscc51209.2021.9528117 ◽

2021 ◽

Author(s):

Praveen S ◽

Jihas Khan ◽

Lillykutty Jacob

Keyword(s):

Reinforcement Learning ◽

Link Adaptation

Download Full-text

A New Hybrid Deep Neural Architectural Search based Ensemble Reinforcement Learning Strategy for Wind Power Forecasting

IEEE Transactions on Industry Applications ◽

10.1109/tia.2021.3126272 ◽

2021 ◽

pp. 1-1

Author(s):

Seyed Mohammad Jafar Jalali ◽

Gerardo J. Osorio ◽

Sajad Ahmadian ◽

Mohamed Lotfi ◽

Vasco Campos ◽

...

Keyword(s):

Reinforcement Learning ◽

Wind Power ◽

Learning Strategy ◽

Wind Power Forecasting ◽

Power Forecasting

Download Full-text

Reinforcement Learning Strategy for Solving the Resource-Constrained Project Scheduling Problem by a Team of A-Teams

Intelligent Information and Database Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-319-05458-2_21 ◽

2014 ◽

pp. 197-206 ◽

Cited By ~ 2

Author(s):

Piotr Jędrzejowicz ◽

Ewa Ratajczak-Ropel

Keyword(s):

Reinforcement Learning ◽

Project Scheduling ◽

Learning Strategy ◽

Scheduling Problem ◽

Resource Constrained ◽

Resource Constrained Project Scheduling ◽

Project Scheduling Problem

Download Full-text

Novel link adaptation algorithm for multichannel wireless systems with datastream repetition

2015 International Siberian Conference on Control and Communications (SIBCON) ◽

10.1109/sibcon.2015.7147100 ◽

2015 ◽

Author(s):

D.E. Chickrin ◽

P.A. Kokunin

Keyword(s):

Wireless Systems ◽

Link Adaptation ◽

Adaptation Algorithm

Download Full-text