An Intelligent TCP Congestion Control Method Based on Deep Q Network

Yinfeng Wang; Longxiang Wang; Xiaoshe Dong

doi:10.3390/fi13100261

An Intelligent TCP Congestion Control Method Based on Deep Q Network

Future Internet ◽

10.3390/fi13100261 ◽

2021 ◽

Vol 13 (10) ◽

pp. 261

Author(s):

Yinfeng Wang ◽

Longxiang Wang ◽

Xiaoshe Dong

Keyword(s):

Congestion Control ◽

Control Method ◽

Control Process ◽

Comparison Method ◽

Data Migration ◽

Network Environment ◽

Reward Function ◽

Markov Decision ◽

Experience Replay ◽

Tcp Congestion Control

To optimize the data migration performance between different supercomputing centers in China, we present TCP-DQN, which is an intelligent TCP congestion control method based on DQN (Deep Q network). The TCP congestion control process is abstracted as a partially observed Markov decision process. In this process, an agent is constructed to interact with the network environment. The agent adjusts the size of the congestion window by observing the characteristics of the network state. The network environment feeds back the reward to the agent, and the agent tries to maximize the expected reward in an episode. We designed a weighted reward function to balance the throughput and delay. Compared with traditional Q-learning, DQN uses double-layer neural networks and experience replay to reduce the oscillation problem that may occur in gradient descent. We implemented the TCP-DQN method and compared it with mainstream congestion control algorithms such as cubic, Highspeed and NewReno. The results show that the throughput of TCP-DQN can reach more than 2 times of the comparison method while the latency is close to the three compared methods.

Download Full-text

TCP congestion control method of improving friendliness over satellite Internet

2009 7th International Conference on Information, Communications and Signal Processing (ICICS) ◽

10.1109/icics.2009.5397708 ◽

2009 ◽

Cited By ~ 3

Author(s):

Hiroyasu Obata ◽

Shingo Nishimoto ◽

Kenji Ishida

Keyword(s):

Congestion Control ◽

Control Method ◽

Tcp Congestion Control

Download Full-text

TCP-STAR: TCP Congestion Control Method for Satellite Internet

IEICE Transactions on Communications ◽

10.1093/ietcom/e89-b.6.1766 ◽

2006 ◽

Vol E89-B (6) ◽

pp. 1766-1773 ◽

Cited By ~ 5

Author(s):

H. OBATA

Keyword(s):

Congestion Control ◽

Control Method ◽

Tcp Congestion Control

Download Full-text

A TCP congestion control method for real-time communication based on channel occupancy of a wireless LAN

2010 16th Asia-Pacific Conference on Communications (APCC) ◽

10.1109/apcc.2010.5679755 ◽

2010 ◽

Cited By ~ 5

Author(s):

Toshiyuki Hirai ◽

Satoshi Ohzahata ◽

Konosuke Kawashima

Keyword(s):

Congestion Control ◽

Real Time ◽

Wireless Lan ◽

Control Method ◽

Tcp Congestion Control ◽

Channel Occupancy

Download Full-text

A New TCP Congestion Control Method Considering Adaptability over Satellite Internet

25th IEEE International Conference on Distributed Computing Systems Workshops ◽

10.1109/icdcsw.2005.17 ◽

2005 ◽

Cited By ~ 5

Author(s):

H. Obata ◽

S. Takeuchi ◽

K. Ishida

Keyword(s):

Congestion Control ◽

Control Method ◽

Tcp Congestion Control

Download Full-text

A TCP congestion control method for securing stable throughput

2008 7th Asia-Pacific Symposium on Information and Telecommunication Technologies ◽

10.1109/apsitt.2008.4653549 ◽

2008 ◽

Cited By ~ 5

Author(s):

Hiroyasu Obata ◽

Kentaro Akase ◽

Kenji Ishida

Keyword(s):

Congestion Control ◽

Control Method ◽

Tcp Congestion Control ◽

Stable Throughput

Download Full-text

A DTN Congestion Control Method based on Node Store Status

10.23940/ijpe.18.10.p19.24322440 ◽

2018 ◽

Author(s):

Wei Jiang

Keyword(s):

Congestion Control ◽

Control Method

Download Full-text

ICMA: AN EFFICIENT INTEGRATED CONGESTION CONTROL APPROACH

Recent Patents on Engineering ◽

10.2174/1872212114666191231150916 ◽

2019 ◽

Vol 14 ◽

Author(s):

Tayyab Khan ◽

Karan Singh ◽

Kamlesh C. Purohit

Keyword(s):

Congestion Control ◽

Packet Loss ◽

Control Method ◽

Multicast Communication ◽

Bandwidth Utilization ◽

File Transfer ◽

Control Approach ◽

Communication Technique ◽

Control Scheme ◽

Multiparty Video Conferencing

Background: With the growing popularity of various group communication applications such as file transfer, multimedia events, distance learning, email distribution, multiparty video conferencing and teleconferencing, multicasting seems to be a useful tool for efficient multipoint data distribution. An efficient communication technique depends on the various parameters like processing speed, buffer storage, and amount of data flow between the nodes. If data exceeds beyond the capacity of a link or node, then it introduces congestion in the network. A series of multicast congestion control algorithms have been developed, but due to the heterogeneous network environment, these approaches do not respond nor reduce congestion quickly whenever network behavior changes. Objective: Multicasting is a robust and efficient one-to-many (1: M) group transmission (communication) technique to reduced communication cost, bandwidth consumption, processing time and delays with similar reliability (dependability) as of regular unicast. This patent presents a novel and comprehensive congestion control method known as integrated multicast congestion control approach (ICMA) to reduce packet loss. Methods: The proposed mechanism is based on leave-join and flow control mechanism along with proportional integrated and derivate (PID) controller to reduce packet loss, depending on the congestion status. In the proposed approach, Proportional integrated and derivate controller computes expected incoming rate at each router and feedback this rate to upstream routers of the multicast network to stabilize their local buffer occupancy. Results: Simulation results on NS-2 exhibit the immense performance of the proposed approach in terms of delay, throughput, bandwidth utilization, and packet loss than other existing methods. Conclusion: The proposed congestion control scheme provides better bandwidth utilization and throughput than other existing approaches. Moreover, we have discussed existing congestion control schemes with their research gaps. In the future, we are planning to explore the fairness and quality of service issue in multicast communication.

Download Full-text

Inverse reinforcement learning in contextual MDPs

Machine Learning ◽

10.1007/s10994-021-05984-x ◽

2021 ◽

Author(s):

Stav Belogolovsky ◽

Philip Korsunsky ◽

Shie Mannor ◽

Chen Tessler ◽

Tom Zahavy

Keyword(s):

Reinforcement Learning ◽

Optimization Problem ◽

Decision Processes ◽

Inverse Reinforcement Learning ◽

Convex Optimization Problem ◽

Reward Function ◽

Dynamic Treatment Regime ◽

Markov Decision ◽

Dynamic Treatment ◽

Recorded Data

AbstractWe consider the task of Inverse Reinforcement Learning in Contextual Markov Decision Processes (MDPs). In this setting, contexts, which define the reward and transition kernel, are sampled from a distribution. In addition, although the reward is a function of the context, it is not provided to the agent. Instead, the agent observes demonstrations from an optimal policy. The goal is to learn the reward mapping, such that the agent will act optimally even when encountering previously unseen contexts, also known as zero-shot transfer. We formulate this problem as a non-differential convex optimization problem and propose a novel algorithm to compute its subgradients. Based on this scheme, we analyze several methods both theoretically, where we compare the sample complexity and scalability, and empirically. Most importantly, we show both theoretically and empirically that our algorithms perform zero-shot transfer (generalize to new and unseen contexts). Specifically, we present empirical experiments in a dynamic treatment regime, where the goal is to learn a reward function which explains the behavior of expert physicians based on recorded data of them treating patients diagnosed with sepsis.

Download Full-text

TCP Congestion Control with Multiagent Reinforcement and Transfer Learning

2021 IEEE 11th Annual Computing and Communication Workshop and Conference (CCWC) ◽

10.1109/ccwc51732.2021.9376056 ◽

2021 ◽

Author(s):

Shahrukh Khan Kasi ◽

Saptarshi Das ◽

Subir Biswas

Keyword(s):

Congestion Control ◽

Transfer Learning ◽

Tcp Congestion Control

Download Full-text

Micro level analysis of TCP congestion control algorithm in multi-hop wireless networks

2017 International Conference on Computer Communication and Informatics (ICCCI) ◽

10.1109/iccci.2017.8117765 ◽

2017 ◽

Cited By ~ 1

Author(s):

D. Sree Arthi ◽

S. Malini ◽

M. Joseph Auxilius Jude ◽

V. C. Diniesh

Keyword(s):

Wireless Networks ◽

Congestion Control ◽

Control Algorithm ◽

Congestion Control Algorithm ◽

Micro Level ◽

Tcp Congestion Control ◽

Level Analysis

Download Full-text