Reinforcement Learning for Efficient Network Penetration Testing

Mohamed C. Ghanem; Thomas M. Chen

doi:10.3390/info11010006

Reinforcement Learning for Efficient Network Penetration Testing

Information ◽

10.3390/info11010006 ◽

2019 ◽

Vol 11 (1) ◽

pp. 6 ◽

Cited By ~ 3

Author(s):

Mohamed C. Ghanem ◽

Thomas M. Chen

Keyword(s):

Reinforcement Learning ◽

Computer Network ◽

Complex Problem ◽

Machine Learning Techniques ◽

Testing System ◽

Penetration Testing ◽

Learning Module ◽

Learning Techniques ◽

Partially Observed ◽

Markov Decision

Penetration testing (also known as pentesting or PT) is a common practice for actively assessing the defenses of a computer network by planning and executing all possible attacks to discover and exploit existing vulnerabilities. Current penetration testing methods are increasingly becoming non-standard, composite and resource-consuming despite the use of evolving tools. In this paper, we propose and evaluate an AI-based pentesting system which makes use of machine learning techniques, namely reinforcement learning (RL) to learn and reproduce average and complex pentesting activities. The proposed system is named Intelligent Automated Penetration Testing System (IAPTS) consisting of a module that integrates with industrial PT frameworks to enable them to capture information, learn from experience, and reproduce tests in future similar testing cases. IAPTS aims to save human resources while producing much-enhanced results in terms of time consumption, reliability and frequency of testing. IAPTS takes the approach of modeling PT environments and tasks as a partially observed Markov decision process (POMDP) problem which is solved by POMDP-solver. Although the scope of this paper is limited to network infrastructures PT planning and not the entire practice, the obtained results support the hypothesis that RL can enhance PT beyond the capabilities of any human PT expert in terms of time consumed, covered attacking vectors, accuracy and reliability of the outputs. In addition, this work tackles the complex problem of expertise capturing and re-use by allowing the IAPTS learning module to store and re-use PT policies in the same way that a human PT expert would learn but in a more efficient way.

Download Full-text

Optimal Policies for Quantum Markov Decision Processes

International Journal of Automation and Computing ◽

10.1007/s11633-021-1278-z ◽

2021 ◽

Author(s):

Ming-Sheng Ying ◽

Yuan Feng ◽

Sheng-Gang Ying

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Quantum Systems ◽

Sequential Decision Making ◽

Mathematical Framework ◽

Sequential Decision ◽

Learning Techniques ◽

Optimal Policies ◽

Markov Decision ◽

Programming Algorithms

AbstractMarkov decision process (MDP) offers a general framework for modelling sequential decision making where outcomes are random. In particular, it serves as a mathematical framework for reinforcement learning. This paper introduces an extension of MDP, namely quantum MDP (qMDP), that can serve as a mathematical model of decision making about quantum systems. We develop dynamic programming algorithms for policy evaluation and finding optimal policies for qMDPs in the case of finite-horizon. The results obtained in this paper provide some useful mathematical tools for reinforcement learning techniques applied to the quantum world.

Download Full-text

Predicting land deformation by integrating InSAR data and cone penetration testing through machine learning techniques

Proceedings of the International Association of Hydrological Sciences ◽

10.5194/piahs-382-525-2020 ◽

2020 ◽

Vol 382 ◽

pp. 525-529

Author(s):

Melika Sajadian ◽

Ana Teixeira ◽

Faraz S. Tehrani ◽

Mathias Lemmens

Keyword(s):

Machine Learning ◽

Soil Mechanics ◽

Machine Learning Techniques ◽

Cone Penetration ◽

Penetration Testing ◽

Cone Penetration Testing ◽

Learning Techniques ◽

Land Deformation ◽

Spatio Temporal

Abstract. Built environments developed on compressible soils are susceptible to land deformation. The spatio-temporal monitoring and analysis of these deformations are necessary for sustainable development of cities. Techniques such as Interferometric Synthetic Aperture Radar (InSAR) or predictions based on soil mechanics using in situ characterization, such as Cone Penetration Testing (CPT) can be used for assessing such land deformations. Despite the combined advantages of these two methods, the relationship between them has not yet been investigated. Therefore, the major objective of this study is to reconcile InSAR measurements and CPT measurements using machine learning techniques in an attempt to better predict land deformation.

Download Full-text

Cloud Load Balancing and Reinforcement Learning

Advances in Business Information Systems and Analytics - Cloud Computing Technologies for Green Enterprises ◽

10.4018/978-1-5225-3038-1.ch011 ◽

2018 ◽

pp. 266-291

Author(s):

Abdelghafour Harraz ◽

Mostapha Zbakh

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning ◽

Load Balancing ◽

Decision Process ◽

Cloud System ◽

Human Intervention ◽

Q Learning ◽

State Action ◽

Learning Techniques ◽

Markov Decision

Artificial Intelligence allows to create engines that are able to explore, learn environments and therefore create policies that permit to control them in real time with no human intervention. It can be applied, through its Reinforcement Learning techniques component, using frameworks such as temporal differences, State-Action-Reward-State-Action (SARSA), Q Learning to name a few, to systems that are be perceived as a Markov Decision Process, this opens door in front of applying Reinforcement Learning to Cloud Load Balancing to be able to dispatch load dynamically to a given Cloud System. The authors will describe different techniques that can used to implement a Reinforcement Learning based engine in a cloud system.

Download Full-text

Machine Learning Techniques for Underwater Wireless Sensor Networks

Energy-Efficient Underwater Wireless Communications and Networking - Advances in Environmental Engineering and Green Technologies ◽

10.4018/978-1-7998-3640-7.ch013 ◽

2021 ◽

pp. 194-211

Author(s):

Deepti Rani ◽

Anju Sangwan ◽

Anupma Sangwan ◽

Tajinder Singh

Keyword(s):

Machine Learning ◽

Wireless Sensor Networks ◽

Sensor Networks ◽

Information Seeking ◽

Complex Problem ◽

Machine Learning Techniques ◽

Wireless Sensor ◽

Learning Techniques ◽

Primary Focus

With the enormous growth of sensor networks, information seeking from such networks has become an invaluable source of knowledge for various organizations to enhance the comprehension of people interests. Not only wireless sensor networks (WSNs) but its various classes also remain the hot topics of research. In this chapter, the primary focus is to understand the concept of sensor network in underwater scenario. Various mechanisms are used to recognize the activities underwater using sensor which examines the real-time events. With these features, a few challenges are also associated with sensor networks, which are addressed here. Machine learning (ML) techniques are the perfect key of success to resolve such issues due to their feasibility and adaption in complex problem environment. Therefore, various ML techniques have been explained to enhance the operational performance of WSNs, especially in underwater WSNs (UWSNs). The main objective of this chapter is to understand the concepts of UWSNs and role of ML to address the performance issues of UWSNs.

Download Full-text

Controlling a Simulated Robot Using Machine Learning Techniques

ASME 2010 World Conference on Innovative Virtual Reality ◽

10.1115/winvr2010-3705 ◽

2010 ◽

Author(s):

Jonathan Becker ◽

Aveek Purohit ◽

Zheng Sun

Keyword(s):

Machine Learning ◽

Reinforcement Learning ◽

Linear Regression ◽

Pid Controller ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Gaming Environment ◽

Using Data

USARSim group at NIST developed a simulated robot that operated in the Unreal Tournament 3 (UT3) gaming environment. They used a software PID controller to control the robot in UT3 worlds. Unfortunately, the PID controller did not work well, so NIST asked us to develop a better controller using machine learning techniques. In the process, we characterized the software PID controller and the robot’s behavior in UT3 worlds. Using data collected from our simulations, we compared different machine learning techniques including linear regression and reinforcement learning (RL). Finally, we implemented a RL based controller in Matlab and ran it in the UT3 environment via a TCP/IP link between Matlab and UT3.

Download Full-text

EFFECTS OF COMMUNICATION ON GROUP LEARNING RATES IN A MULTI-AGENT ENVIRONMENT

Advances in Complex Systems ◽

10.1142/s0219525903000979 ◽

2003 ◽

Vol 06 (03) ◽

pp. 405-426 ◽

Cited By ~ 1

Author(s):

PAUL DARBYSHIRE

Keyword(s):

Reinforcement Learning ◽

Cognitive Abilities ◽

Complex Adaptive System ◽

Machine Learning Techniques ◽

Simulation Techniques ◽

Learning Rates ◽

Learning Techniques ◽

Complex Adaptive ◽

Rate Of Learning ◽

Multi Agent

Distillations utilize multi-agent based modeling and simulation techniques to study warfare as a complex adaptive system at the conceptual level. The focus is placed on the interactions between the agents to facilitate study of cause and effect between individual interactions and overall system behavior. Current distillations do not utilize machine-learning techniques to model the cognitive abilities of individual combatants but employ agent control paradigms to represent agents as highly instinctual entities. For a team of agents implementing a reinforcement-learning paradigm, the rate of learning is not sufficient for agents to adapt to this hostile environment. However, by allowing the agents to communicate their respective rewards for actions performed as the simulation progresses, the rate of learning can be increased sufficiently to significantly increase the teams chances of survival. This paper presents the results of trials to measure the success of a team-based approach to the reinforcement-learning problem in a distillation, using reward communication to increase learning rates.

Download Full-text

Modular Neural Network for Learning Visual Features, Routes, and Operation Through Human Driving Data Toward Automatic Driving System

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2020.p0368 ◽

2020 ◽

Vol 24 (3) ◽

pp. 368-376

Author(s):

Shun Otsubo ◽

Yasutake Takahashi ◽

Masaki Haruna ◽

◽

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Visual Information ◽

Machine Learning Techniques ◽

Driving System ◽

Modular Neural Networks ◽

Automatic Driving ◽

Learning Module ◽

Learning Techniques ◽

Almost All

This paper proposes an automatic driving system based on a combination of modular neural networks processing human driving data. Research on automatic driving vehicles has been actively conducted in recent years. Machine learning techniques are often utilized to realize an automatic driving system capable of imitating human driving operations. Almost all of them adopt a large monolithic learning module, as typified by deep learning. However, it is inefficient to use a monolithic deep learning module to learn human driving operations (accelerating, braking, and steering) using the visual information obtained from a human driving a vehicle. We propose combining a series of modular neural networks that independently learn visual feature quantities, routes, and driving maneuvers from human driving data, thereby imitating human driving operations and efficiently learning a plurality of routes. This paper demonstrates the effectiveness of the proposed method through experiments using a small vehicle.

Download Full-text

Q-Learning based Routing Protocol to Enhance Network Lifetime in WSNs

International journal of Computer Networks & Communications ◽

10.5121/ijcnc.2021.13204 ◽

2021 ◽

Vol 13 (2) ◽

pp. 57-80

Author(s):

Arunita Kundaliya ◽

D.K. Lobiyal

Keyword(s):

Reinforcement Learning ◽

Network Lifetime ◽

Residual Energy ◽

Efficient Solutions ◽

Machine Learning Techniques ◽

Q Learning ◽

Learning Techniques ◽

Aodv Protocol ◽

Optimal Action ◽

Additional Memory

In resource constraint Wireless Sensor Networks (WSNs), enhancement of network lifetime has been one of the significantly challenging issues for the researchers. Researchers have been exploiting machine learning techniques, in particular reinforcement learning, to achieve efficient solutions in the domain of WSN. The objective of this paper is to apply Q-learning, a reinforcement learning technique, to enhance the lifetime of the network, by developing distributed routing protocols. Q-learning is an attractive choice for routing due to its low computational requirements and additional memory demands. To facilitate an agent running at each node to take an optimal action, the approach considers node’s residual energy, hop length to sink and transmission power. The parameters, residual energy and hop length, are used to calculate the Q-value, which in turn is used to decide the optimal next-hop for routing. The proposed protocols’ performance is evaluated through NS3 simulations, and compared with AODV protocol in terms of network lifetime, throughput and end-to-end delay.

Download Full-text

Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques

International Journal of General Systems ◽

10.1080/03081079.2014.883387 ◽

2014 ◽

Vol 43 (6) ◽

pp. 649-669 ◽

Cited By ~ 7

Author(s):

Abhijit Gosavi

Keyword(s):

Dynamic Programming ◽

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Learning Techniques ◽

Markov Decision

Download Full-text

ANALYSIS OF THE APPLICATION OF REINFORCEMENT LEARNING ALGORITHMS ON THE STARCRAFT II VIDEO GAME

Revista Destaques Acadêmicos ◽

10.22410/issn.2176-3070.v11i4a2019.2403 ◽

2020 ◽

Vol 11 (4) ◽

Author(s):

Leandro Vian ◽

Marcelo De Gomensoro Malheiros

Keyword(s):

Reinforcement Learning ◽

Video Game ◽

Cost Effective ◽

Machine Learning Techniques ◽

Specific Training ◽

Board Games ◽

Learning Techniques ◽

Technical Issues ◽

Strategy Game ◽

Real Time Strategy Game

In recent years Machine Learning techniques have become the driving force behind the worldwide emergence of Artificial Intelligence, producing cost-effective and precise tools for pattern recognition and data analysis. A particular approach for the training of neural networks, Reinforcement Learning (RL), achieved prominence creating almost unbeatable artificial opponents in board games like Chess or Go, and also on video games. This paper gives an overview of Reinforcement Learning and tests this approach against a very popular real-time strategy game, Starcraft II. Our goal is to examine the tools and algorithms readily available for RL, also addressing different scenarios where a neural network can be linked to Starcraft II to learn by itself. This work describes both the technical issues involved and the preliminary results obtained by the application of two specific training strategies, A2C and DQN.

Download Full-text