Space-time tradeoff in regular expression matching with semi-deterministic finite automata

A Memory Efficient Regular Expression Matching by Compressing Deterministic Finite Automata

International Journal of Computer Applications ◽

10.5120/21815-5143 ◽

2015 ◽

Vol 122 (20) ◽

pp. 14-17

Author(s):

Utkarsha P.Pisolkar ◽

Shivaji R. Lahane

Keyword(s):

Regular Expression ◽

Finite Automata ◽

Deterministic Finite Automata ◽

Regular Expression Matching ◽

Memory Efficient

Download Full-text

Software Toolchain for Large-Scale RE-NFA Construction on FPGA

International Journal of Reconfigurable Computing ◽

10.1155/2009/301512 ◽

2009 ◽

Vol 2009 ◽

pp. 1-10 ◽

Cited By ~ 3

Author(s):

Yi-Hua E. Yang ◽

Viktor K. Prasanna

Keyword(s):

High Performance ◽

Large Scale ◽

Regular Expression ◽

Finite Automata ◽

Fixed Number ◽

Regular Expressions ◽

Pattern Complexity ◽

Regular Expression Matching ◽

Area Increase ◽

Prototype Software

We present a software toolchain for constructing large-scaleregular expression matching(REM) on FPGA. The software automates the conversion of regular expressions into compact and high-performance nondeterministic finite automata (RE-NFA). Each RE-NFA is described as an RTL regular expression matching engine (REME) in VHDL for FPGA implementation. Assuming a fixed number of fan-out transitions per state, ann-statem-bytes-per-cycle RE-NFA can be constructed inO(n×m)time andO(n×m)memory by our software. A large number of RE-NFAs are placed onto a two-dimensionalstaged pipeline, allowing scalability to thousands of RE-NFAs with linear area increase and little clock rate penalty due to scaling. On a PC with a 2 GHz Athlon64 processor and 2 GB memory, our prototype software constructs hundreds of RE-NFAs used by Snort in less than 10 seconds. We also designed a benchmark generator which can produce RE-NFAs with configurable pattern complexity parameters, including state count, state fan-in, loop-back and feed-forward distances. Several regular expressions with various complexities are used to test the performance of our RE-NFA construction software.

Download Full-text

Simultaneous Finite Automata: An Efficient Data-Parallel Model for Regular Expression Matching

2013 42nd International Conference on Parallel Processing ◽

10.1109/icpp.2013.31 ◽

2013 ◽

Cited By ~ 7

Author(s):

Ryoma Sinya ◽

Kiminori Matsuzaki ◽

Masataka Sassa

Keyword(s):

Regular Expression ◽

Finite Automata ◽

Parallel Model ◽

Data Parallel ◽

Regular Expression Matching ◽

Efficient Data

Download Full-text

Ambiguity in Finite Automata

DAIMI Report Series ◽

10.7146/dpb.v6i82.6498 ◽

1977 ◽

Vol 6 (82) ◽

Author(s):

Erik Meineche Schmidt

Keyword(s):

Regular Expression ◽

Finite Automata ◽

Regular Languages ◽

Deterministic Finite Automata

<p>The gain in succinctness of descriptions of regular languages when nondeterministic (unambiguous) finite automata are used rather than unambiguous (deterministic) finite automata, is not bounded by any polynomium.</p><p>The problem of deciding whether an unambiguous regular expression does not generate all words over its terminal alphabet, is in NP.</p>

Download Full-text

Scalable TCAM-based regular expression matching with compressed finite automata

Architectures for Networking and Communications Systems ◽

10.1109/ancs.2013.6665178 ◽

2013 ◽

Cited By ~ 11

Author(s):

Kun Huang ◽

Linxuan Ding ◽

Gaogang Xie ◽

Dafang Zhang ◽

Alex X. Liu ◽

...

Keyword(s):

Regular Expression ◽

Finite Automata ◽

Regular Expression Matching

Download Full-text

A novel regular expression matching algorithm based on multi-dimensional finite automata

2014 IEEE 15th International Conference on High Performance Switching and Routing (HPSR) ◽

10.1109/hpsr.2014.6900887 ◽

2014 ◽

Author(s):

Yangyang Gong ◽

Qinrang Liu ◽

Xiangyu Shao ◽

Cong Pan ◽

Huijuan Jiao

Keyword(s):

Regular Expression ◽

Finite Automata ◽

Matching Algorithm ◽

Regular Expression Matching

Download Full-text

A Method to Convert Regular Expression into Non-Deterministic Finite Automata

International Journal of Applied Science and Engineering ◽

10.30954/2322-0465.2.2019.3 ◽

2019 ◽

Vol 7 (2) ◽

Author(s):

Abhishek Singh

Keyword(s):

Regular Expression ◽

Finite Automata ◽

Deterministic Finite Automata

Download Full-text

StriFA: Stride Finite Automata for High-Speed Regular Expression Matching in Network Intrusion Detection Systems

IEEE Systems Journal ◽

10.1109/jsyst.2013.2244791 ◽

2013 ◽

Vol 7 (3) ◽

pp. 374-384 ◽

Cited By ~ 6

Author(s):

Xiaofei Wang ◽

Yang Xu ◽

Junchen Jiang ◽

Olga Ormond ◽

Bin Liu ◽

...

Keyword(s):

Intrusion Detection ◽

High Speed ◽

Regular Expression ◽

Finite Automata ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems ◽

Regular Expression Matching

Download Full-text

Efficient Construction of the Equation Automaton

Algorithms ◽

10.3390/a14080238 ◽

2021 ◽

Vol 14 (8) ◽

pp. 238

Author(s):

Faissal Ouardi ◽

Zineb Lotfi ◽

Bilal Elghadyry

Keyword(s):

Computational Complexity ◽

Fast Algorithm ◽

Time Complexity ◽

Regular Expression ◽

Finite Automata ◽

Fast Computation ◽

Minimization Algorithm ◽

Deterministic Finite Automata ◽

Efficient Construction ◽

Time And Space Complexity

This paper describes a fast algorithm for constructing directly the equation automaton from the well-known Thompson automaton associated with a regular expression. Allauzen and Mohri have presented a unified construction of small automata and gave a construction of the equation automaton with time and space complexity in O(mlogm+m2), where m denotes the number of Thompson automaton transitions. It is based on two classical automata operations, namely epsilon-removal and Hopcroft’s algorithm for deterministic Finite Automata (DFA) minimization. Using the notion of c-continuation, Ziadi et al. presented a fast computation of the equation automaton in O(m2) time complexity. In this paper, we design an output-sensitive algorithm combining advantages of the previous algorithms and show that its computational complexity can be reduced to O(m×|Q≡e|), where |Q≡e| denotes the number of states of the equation automaton, by an epsilon-removal and Bubenzer minimization algorithm of an Acyclic Deterministic Finite Automata (ADFA).

Download Full-text

Hybrid Finite Automata-Based Algorithm for Large Scale Regular Expression Matching

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.263-266.3108 ◽

2012 ◽

Vol 263-266 ◽

pp. 3108-3113

Author(s):

Wei He ◽

Yun Fei Guo ◽

Hong Chao Hu

Keyword(s):

Large Scale ◽

Regular Expression ◽

Finite Automata ◽

System Throughput ◽

Hybrid Automata ◽

Deterministic Finite Automaton ◽

State Explosion ◽

Regular Expression Matching ◽

High System ◽

Memory Utilization

Fast data transmission put forward high requirements on network content matching (NCM). Due to the high time complexity, Nondeterministic Finite Automata (NFA) was unable to meet the demand of regular expression matching (REM) which was the core of NCM; Transfer NFA to Deterministic Finite Automaton (DFA) could enhance the throughput, but led to state explosion, which increased demand for memory. To balance memory and throughput, state explosion in the transformation from NFA to DFA has been analyzed and a new method DC-DFA is presented for large scale REM. DC-DFA is based on hybrid automata structure which composed of NFA and DFA. DC-DFA introduces GradeOne classification to cut the memory usage and deep classification to improve throughput. The results show that for serious state explosion, DC-DFA could reduce 75% DFA states and improve memory utilization efficiently while maintain high system throughput.

Download Full-text