approximate pattern matching Latest Research Papers

MONI: A Pangenomics Index for Finding MEMs

10.1101/2021.07.06.451246 ◽

2021 ◽

Author(s):

Massimiliano Rossi ◽

Marco Oliva ◽

Ben Langmead ◽

Travis Gagie ◽

Christina Boucher

Keyword(s):

Pattern Matching ◽

Linear Time ◽

Repetitive Sequences ◽

Major Advance ◽

Human Chromosomes ◽

Time And Space ◽

Index Construction ◽

Approximate Pattern Matching ◽

Human Genomes ◽

Novel Algorithm

Recently, Gagie et al. proposed a version of the FM-index, called the r-index, that can store thousands of human genomes on a commodity computer. Then Kuhnle et al. showed how to build the r-index efficiently via a technique called prefix-free parsing (PFP) and demonstrated its effectiveness for exact pattern matching. Exact pattern matching can be leveraged to support approximate pattern matching but the r-index itself cannot support efficiently popular and important queries such as finding maximal exact matches (MEMs). To address this shortcoming, Bannai et al. introduced the concept of thresholds, and showed that storing them together with the r-index enables efficient MEM finding --- but they did not say how to find those thresholds. We present a novel algorithm that applies PFP to build the r-index and find the thresholds simultaneously and in linear time and space with respect to the size of the prefix-free parse. Our implementation called MONI can rapidly find MEMs between reads and large sequence collections of highly repetitive sequences. Compared to other read aligners -- PuffAligner, Bowtie2, BWA-MEM, and CHIC -- MONI used 2--11 times less memory and was 2--32 times faster for index construction. Moreover, MONI was less than one thousandth the size of competing indexes for large collections of human chromosomes. Thus, MONI represents a major advance in our ability to perform MEM finding against very large collections of related references. Availability: MONI is publicly available at https://github.com/maxrossi91/moni.

Download Full-text

Dynamic Partitioning of Search Patternsfor Approximate Pattern Matching using Search Schemes

iScience ◽

10.1016/j.isci.2021.102687 ◽

2021 ◽

pp. 102687

Author(s):

Luca Renders ◽

Kathleen Marchal ◽

Jan Fostier

Keyword(s):

Pattern Matching ◽

Dynamic Partitioning ◽

Approximate Pattern Matching

Download Full-text

Faster Approximate Pattern Matching: A Unified Approach

2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS) ◽

10.1109/focs46700.2020.00095 ◽

2020 ◽

Author(s):

Panagiotis Charalampopoulos ◽

Tomasz Kociumaka ◽

Philip Wellnitz

Keyword(s):

Pattern Matching ◽

Unified Approach ◽

Approximate Pattern Matching

Download Full-text

Approximate Pattern Matching for On-Chip Interconnect Traffic Prediction

Proceedings of the ACM International Conference on Parallel Architectures and Compilation Techniques ◽

10.1145/3410463.3414667 ◽

2020 ◽

Author(s):

Vignesh Adhinarayanan ◽

Wu-chun Feng

Keyword(s):

Pattern Matching ◽

Traffic Prediction ◽

Approximate Pattern Matching ◽

On Chip

Download Full-text

Efficient Approximate Pattern Matching Algorithm for Biological Sequences

Journal of Xidian University ◽

10.37896/jxu14.8/128 ◽

2020 ◽

Vol 14 (8) ◽

Keyword(s):

Pattern Matching ◽

Biological Sequences ◽

Matching Algorithm ◽

Approximate Pattern Matching ◽

Pattern Matching Algorithm

Download Full-text

NetDAP: (δ, γ) −approximate pattern matching with length constraints

Applied Intelligence ◽

10.1007/s10489-020-01778-1 ◽

2020 ◽

Vol 50 (11) ◽

pp. 4094-4116

Author(s):

Youxi Wu ◽

Jinquan Fan ◽

Yan Li ◽

Lei Guo ◽

Xindong Wu

Keyword(s):

Pattern Matching ◽

Approximate Pattern Matching

Download Full-text

Approximate Pattern Matching in Massive Graphs with Precision and Recall Guarantees

Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data ◽

10.1145/3318464.3380566 ◽

2020 ◽

Author(s):

Tashin Reza ◽

Matei Ripeanu ◽

Geoffrey Sanders ◽

Roger Pearce

Keyword(s):

Pattern Matching ◽

Massive Graphs ◽

Approximate Pattern Matching

Download Full-text

Approximate pattern matching on elastic-degenerate text

Theoretical Computer Science ◽

10.1016/j.tcs.2019.08.012 ◽

2020 ◽

Vol 812 ◽

pp. 109-122 ◽

Cited By ~ 1

Author(s):

Giulia Bernardini ◽

Nadia Pisanti ◽

Solon P. Pissis ◽

Giovanna Rosone

Keyword(s):

Pattern Matching ◽

Approximate Pattern Matching

Download Full-text

Approximate Pattern Matching using Hierarchical Graph Construction and Sparse Distributed Representation

Proceedings of the International Conference on Neuromorphic Systems - ICONS '19 ◽

10.1145/3354265.3354286 ◽

2019 ◽

Author(s):

Aakanksha Mathuria ◽

Dan W. Hammerstrom

Keyword(s):

Pattern Matching ◽

Distributed Representation ◽

Hierarchical Graph ◽

Approximate Pattern Matching

Download Full-text

State Complexity of Neighbourhoods and Approximate Pattern Matching

International Journal of Foundations of Computer Science ◽

10.1142/s0129054118400099 ◽

2018 ◽

Vol 29 (02) ◽

pp. 315-329 ◽

Cited By ~ 5

Author(s):

Timothy Ng ◽

David Rappaport ◽

Kai Salomaa

Keyword(s):

Lower Bound ◽

Pattern Matching ◽

Finite Automaton ◽

The State ◽

Worst Case ◽

State Complexity ◽

Approximate Pattern Matching ◽

The Given ◽

Nondeterministic Finite Automaton ◽

Distance Formula

The neighbourhood of a language [Formula: see text] with respect to an additive distance consists of all strings that have distance at most the given radius from some string of [Formula: see text]. We show that the worst case deterministic state complexity of a radius [Formula: see text] neighbourhood of a language recognized by an [Formula: see text] state nondeterministic finite automaton [Formula: see text] is [Formula: see text]. In the case where [Formula: see text] is deterministic we get the same lower bound for the state complexity of the neighbourhood if we use an additive quasi-distance. The lower bound constructions use an alphabet of size linear in [Formula: see text]. We show that the worst case state complexity of the set of strings that contain a substring within distance [Formula: see text] from a string recognized by [Formula: see text] is [Formula: see text].

Download Full-text

approximate pattern matching
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

MONI: A Pangenomics Index for Finding MEMs

Dynamic Partitioning of Search Patternsfor Approximate Pattern Matching using Search Schemes

Faster Approximate Pattern Matching: A Unified Approach

Approximate Pattern Matching for On-Chip Interconnect Traffic Prediction

Efficient Approximate Pattern Matching Algorithm for Biological Sequences

NetDAP: (δ, γ) −approximate pattern matching with length constraints

Approximate Pattern Matching in Massive Graphs with Precision and Recall Guarantees

Approximate pattern matching on elastic-degenerate text

Approximate Pattern Matching using Hierarchical Graph Construction and Sparse Distributed Representation

State Complexity of Neighbourhoods and Approximate Pattern Matching

Export Citation Format

approximate pattern matchingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

MONI: A Pangenomics Index for Finding MEMs

Dynamic Partitioning of Search Patternsfor Approximate Pattern Matching using Search Schemes

Faster Approximate Pattern Matching: A Unified Approach

Approximate Pattern Matching for On-Chip Interconnect Traffic Prediction

Efficient Approximate Pattern Matching Algorithm for Biological Sequences

NetDAP: (δ, γ) −approximate pattern matching with length constraints

Approximate Pattern Matching in Massive Graphs with Precision and Recall Guarantees

Approximate pattern matching on elastic-degenerate text

Approximate Pattern Matching using Hierarchical Graph Construction and Sparse Distributed Representation

State Complexity of Neighbourhoods and Approximate Pattern Matching

approximate pattern matching
Recently Published Documents