RLAM: A Dynamic and Efficient Reinforcement Learning-Based Adaptive Mapping Scheme in Mobile WiMAX Networks

M. Louta; P. Sarigiannidis; S. Misra; P. Nicopolitidis; G. Papadimitriou

doi:10.1155/2014/213056

RLAM: A Dynamic and Efficient Reinforcement Learning-Based Adaptive Mapping Scheme in Mobile WiMAX Networks

Mobile Information Systems ◽

10.1155/2014/213056 ◽

2014 ◽

Vol 10 (2) ◽

pp. 173-196 ◽

Cited By ~ 2

Author(s):

M. Louta ◽

P. Sarigiannidis ◽

S. Misra ◽

P. Nicopolitidis ◽

G. Papadimitriou

Keyword(s):

Reinforcement Learning ◽

Learning Automata ◽

Base Station ◽

Research Literature ◽

Average Error ◽

Mobile Wimax ◽

Width Ratio ◽

Dynamic Adjustment ◽

Extensive Evaluation ◽

Improved Performance

WiMAX (Worldwide Interoperability for Microwave Access) constitutes a candidate networking technology towards the 4G vision realization. By adopting the Orthogonal Frequency Division Multiple Access (OFDMA) technique, the latest IEEE 802.16x amendments manage to provide QoS-aware access services with full mobility support. A number of interesting scheduling and mapping schemes have been proposed in research literature. However, they neglect a considerable asset of the OFDMA-based wireless systems: the dynamic adjustment of the downlink-to-uplink width ratio. In order to fully exploit the supported mobile WiMAX features, we design, develop, and evaluate a rigorous adaptive model, which inherits its main aspects from the reinforcement learning field. The model proposed endeavours to efficiently determine the downlink-to-uplinkwidth ratio, on a frame-by-frame basis, taking into account both the downlink and uplink traffic in the Base Station (BS). Extensive evaluation results indicate that the model proposed succeeds in providing quite accurate estimations, keeping the average error rate below 15% with respect to the optimal sub-frame configurations. Additionally, it presents improved performance compared to other learning methods (e.g., learning automata) and notable improvements compared to static schemes that maintain a fixed predefined ratio in terms of service ratio and resource utilization.

Download Full-text

E-FUCA: enhancement in fuzzy unequal clustering and routing for sustainable wireless sensor network

Complex & Intelligent Systems ◽

10.1007/s40747-021-00392-z ◽

2021 ◽

Author(s):

Pawan Singh Mehra

Keyword(s):

Wireless Sensor Network ◽

Sensor Network ◽

Cluster Head ◽

Average Energy ◽

Base Station ◽

Wireless Sensor ◽

Unequal Clustering ◽

Energy Hole ◽

Intelligent Decision ◽

Improved Performance

AbstractWith huge cheap micro-sensing devices deployed, wireless sensor network (WSN) gathers information from the region and delivers it to the base station (BS) for further decision. The hotspot problem occurs when cluster head (CH) nearer to BS may die prematurely due to uneven energy depletion resulting in partitioning the network. To overcome the issue of hotspot or energy hole, unequal clustering is used where variable size clusters are formed. Motivated from the aforesaid discussion, we propose an enhanced fuzzy unequal clustering and routing protocol (E-FUCA) where vital parameters are considered during CH candidate selection, and intelligent decision using fuzzy logic (FL) is taken by non-CH nodes during the selection of their CH for the formation of clusters. To further extend the lifetime, we have used FL for the next-hop choice for efficient routing. We have conducted the simulation experiments for four scenarios and compared the propound protocol’s performance with recent similar protocols. The experimental results validate the improved performance of E-FUCA with its comparative in respect of better lifetime, protracted stability period, and enhanced average energy.

Download Full-text

Mobility-Aware Trajectory Design for Aerial Base Station Using Deep Reinforcement Learning

2020 International Conference on Wireless Communications and Signal Processing (WCSP) ◽

10.1109/wcsp49889.2020.9299676 ◽

2020 ◽

Author(s):

Guoliang Hao ◽

Wanli Ni ◽

Hui Tian ◽

Leilei Cao

Keyword(s):

Reinforcement Learning ◽

Base Station ◽

Trajectory Design

Download Full-text

Selective network discovery via deep reinforcement learning on embedded spaces

Applied Network Science ◽

10.1007/s41109-021-00365-8 ◽

2021 ◽

Vol 6 (1) ◽

Author(s):

Peter Morales ◽

Rajmonda Sulo Caceres ◽

Tina Eliassi-Rad

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Sequential Decision ◽

Network Discovery ◽

Learning Tasks ◽

Partially Observed ◽

Decision Making Problem ◽

Resource Collection ◽

Improved Performance ◽

Discovery Algorithms

AbstractComplex networks are often either too large for full exploration, partially accessible, or partially observed. Downstream learning tasks on these incomplete networks can produce low quality results. In addition, reducing the incompleteness of the network can be costly and nontrivial. As a result, network discovery algorithms optimized for specific downstream learning tasks given resource collection constraints are of great interest. In this paper, we formulate the task-specific network discovery problem as a sequential decision-making problem. Our downstream task is selective harvesting, the optimal collection of vertices with a particular attribute. We propose a framework, called network actor critic (NAC), which learns a policy and notion of future reward in an offline setting via a deep reinforcement learning algorithm. The NAC paradigm utilizes a task-specific network embedding to reduce the state space complexity. A detailed comparative analysis of popular network embeddings is presented with respect to their role in supporting offline planning. Furthermore, a quantitative study is presented on various synthetic and real benchmarks using NAC and several baselines. We show that offline models of reward and network discovery policies lead to significantly improved performance when compared to competitive online discovery algorithms. Finally, we outline learning regimes where planning is critical in addressing sparse and changing reward signals.

Download Full-text

Deep Reinforcement Learning With Spatio-Temporal Traffic Forecasting for Data-Driven Base Station Sleep Control

IEEE/ACM Transactions on Networking ◽

10.1109/tnet.2021.3053771 ◽

2021 ◽

pp. 1-14

Author(s):

Qiong Wu ◽

Xu Chen ◽

Zhi Zhou ◽

Liang Chen ◽

Junshan Zhang

Keyword(s):

Reinforcement Learning ◽

Base Station ◽

Data Driven ◽

Traffic Forecasting ◽

Spatio Temporal ◽

Sleep Control

Download Full-text

Optimising Performance for NB-IoT UE Devices through Data Driven Models

Journal of Sensor and Actuator Networks ◽

10.3390/jsan10010021 ◽

2021 ◽

Vol 10 (1) ◽

pp. 21

Author(s):

Omar Nassef ◽

Toktam Mahmoodi ◽

Foivos Michelinakis ◽

Kashif Mahmood ◽

Ahmed Elmokashfi

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Gradient Descent ◽

Deep Neural Network ◽

Narrow Band ◽

Learning Algorithm ◽

Base Station ◽

User Equipment ◽

Data Driven ◽

Superior Performance

This paper presents a data driven framework for performance optimisation of Narrow-Band IoT user equipment. The proposed framework is an edge micro-service that suggests one-time configurations to user equipment communicating with a base station. Suggested configurations are delivered from a Configuration Advocate, to improve energy consumption, delay, throughput or a combination of those metrics, depending on the user-end device and the application. Reinforcement learning utilising gradient descent and genetic algorithm is adopted synchronously with machine and deep learning algorithms to predict the environmental states and suggest an optimal configuration. The results highlight the adaptability of the Deep Neural Network in the prediction of intermediary environmental states, additionally the results present superior performance of the genetic reinforcement learning algorithm regarding its performance optimisation.

Download Full-text

Trajectory Optimization for Autonomous Flying Base Station via Reinforcement Learning

2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC) ◽

10.1109/spawc.2018.8445768 ◽

2018 ◽

Cited By ~ 22

Author(s):

Harald Bayerlein ◽

Paul De Kerret ◽

David Gesbert

Keyword(s):

Reinforcement Learning ◽

Trajectory Optimization ◽

Base Station

Download Full-text

Continuous action reinforcement learning automata and their application to adaptive digital filter design

Engineering Applications of Artificial Intelligence ◽

10.1016/s0952-1976(01)00034-3 ◽

2001 ◽

Vol 14 (5) ◽

pp. 549-561 ◽

Cited By ~ 32

Author(s):

M.N. Howell ◽

T.J. Gordon

Keyword(s):

Reinforcement Learning ◽

Digital Filter ◽

Filter Design ◽

Learning Automata ◽

Continuous Action ◽

Digital Filter Design ◽

Reinforcement Learning Automata

Download Full-text

Applications of Cellular Learning Automata and Reinforcement Learning in Global Optimization

Cellular Learning Automata: Theory and Applications - Studies in Systems, Decision and Control ◽

10.1007/978-3-030-53141-6_4 ◽

2020 ◽

pp. 157-224

Author(s):

Reza Vafashoar ◽

Hossein Morshedlou ◽

Alireza Rezvanian ◽

Mohammad Reza Meybodi

Keyword(s):

Global Optimization ◽

Reinforcement Learning ◽

Learning Automata ◽

Cellular Learning Automata

Download Full-text

A Reinforcement Learning Approach for Interference Management in Heterogeneous Wireless Networks

International Journal of Interactive Mobile Technologies (iJIM) ◽

10.3991/ijim.v15i12.20751 ◽

2021 ◽

Vol 15 (12) ◽

pp. 65

Author(s):

Akindele Segun Afolabi ◽

Shehu Ahmed ◽

Olubunmi Adewale Akinola

Keyword(s):

Reinforcement Learning ◽

Power Level ◽

Heterogeneous Wireless Networks ◽

Interference Management ◽

Base Station ◽

User Equipment ◽

Base Stations ◽

Multi Agent Systems ◽

Q Learning ◽

Macro Cell

<span lang="EN-US">Due to the increased demand for scarce wireless bandwidth, it has become insufficient to serve the network user equipment using macrocell base stations only. Network densification through the addition of low power nodes (picocell) to conventional high power nodes addresses the bandwidth dearth issue, but unfortunately introduces unwanted interference into the network which causes a reduction in throughput. This paper developed a reinforcement learning model that assisted in coordinating interference in a heterogeneous network comprising macro-cell and pico-cell base stations. The learning mechanism was derived based on Q-learning, which consisted of agent, state, action, and reward. The base station was modeled as the agent, while the state represented the condition of the user equipment in terms of Signal to Interference Plus Noise Ratio. The action was represented by the transmission power level and the reward was given in terms of throughput. Simulation results showed that the proposed Q-learning scheme improved the performances of average user equipment throughput in the network. In particular, </span><span lang="EN-US">multi-agent systems with a normal learning rate increased the throughput of associated user equipment by a whooping 212.5% compared to a macrocell-only scheme.</span>

Download Full-text

Mobile Robot Wall-Following Control Using Fuzzy Logic Controller with Improved Differential Search and Reinforcement Learning

Mathematics ◽

10.3390/math8081254 ◽

2020 ◽

Vol 8 (8) ◽

pp. 1254 ◽

Cited By ~ 1

Author(s):

Cheng-Hung Chen ◽

Shiou-Yun Jeng ◽

Cheng-Jian Lin

Keyword(s):

Fuzzy Logic ◽

Reinforcement Learning ◽

Mobile Robot ◽

Fuzzy Logic Controller ◽

Search Algorithm ◽

Experimental Results ◽

Average Error ◽

Stopover Site ◽

Accumulated Reward ◽

Wall Following

In this study, a fuzzy logic controller with the reinforcement improved differential search algorithm (FLC_R-IDS) is proposed for solving a mobile robot wall-following control problem. This study uses the reward and punishment mechanisms of reinforcement learning to train the mobile robot wall-following control. The proposed improved differential search algorithm uses parameter adaptation to adjust the control parameters. To improve the exploration of the algorithm, a change in the number of superorganisms is required as it involves a stopover site. This study uses reinforcement learning to guide the behavior of the robot. When the mobile robot satisfies three reward conditions, it gets reward +1. The accumulated reward value is used to evaluate the controller and to replace the next controller training. Experimental results show that, compared with the traditional differential search algorithm and the chaos differential search algorithm, the average error value of the proposed FLC_R-IDS in the three experimental environments is reduced by 12.44%, 22.54% and 25.98%, respectively. Final, the experimental results also show that the real mobile robot using the proposed method can effectively implement the wall-following control.

Download Full-text