Correction to “Inference of Regular Expressions for Text Extraction from Examples”

Alberto Bartoli; Andrea De Lorenzo; Eric Medvet; Fabiano Tarlao

doi:10.1109/tkde.2016.2557978

Correction to “Inference of Regular Expressions for Text Extraction from Examples”

IEEE Transactions on Knowledge and Data Engineering ◽

10.1109/tkde.2016.2557978 ◽

2016 ◽

Vol 28 (7) ◽

pp. 1944-1944

Author(s):

Alberto Bartoli ◽

Andrea De Lorenzo ◽

Eric Medvet ◽

Fabiano Tarlao

Keyword(s):

Regular Expressions ◽

Text Extraction

Download Full-text

A Web Text Extraction Method Based on Regular Expressions and Text Density

2011 International Conference on Information Management, Innovation Management and Industrial Engineering ◽

10.1109/iciii.2011.73 ◽

2011 ◽

Author(s):

Fayun Li

Keyword(s):

Extraction Method ◽

Regular Expressions ◽

Text Extraction ◽

Text Density

Download Full-text

Inference of Regular Expressions for Text Extraction from Examples

IEEE Transactions on Knowledge and Data Engineering ◽

10.1109/tkde.2016.2515587 ◽

2016 ◽

Vol 28 (5) ◽

pp. 1217-1230 ◽

Cited By ~ 28

Author(s):

Alberto Bartoli ◽

Andrea De Lorenzo ◽

Eric Medvet ◽

Fabiano Tarlao

Keyword(s):

Regular Expressions ◽

Text Extraction

Download Full-text

ESP corpus design: compilation of the Veterinary Nursing Medical Chart Corpus and the Veterinary Nursing Wordlist

Corpora ◽

10.3366/cor.2020.0191 ◽

2020 ◽

Vol 15 (2) ◽

pp. 125-140

Author(s):

Yukiko Ohashi ◽

Noriaki Katagiri ◽

Katsutoshi Oka ◽

Michiko Hanada

Keyword(s):

Word List ◽

English For Specific Purposes ◽

Regular Expressions ◽

Annotation Scheme ◽

Corpus Design ◽

As Species ◽

Lexical Items ◽

Access To Data ◽

General Service

This paper reports on two research results: ( 1) designing an English for Specific Purposes (esp) corpus architecture complete with annotations structured by regular expressions; and ( 2) a case study to test the design to cater for creating a specific vocabulary list using the compiled corpus. The first half of this study involved designing a precisely structured esp corpus from 190 veterinary medical charts with a hierarchy of the data. The data hierarchy in the corpus consists of document types, outline elements and inline elements, such as species and breed. Perl scripts extracted the data attached to veterinary-specific categories, and the extraction led to creating wordlists. The second part of the research tested the corpus mode, creating a list of commonly observed lexical items in veterinary medicine. The coverage rate of the wordlists by General Service List (gsl) and Academic Word List (awl) was tested, with the result that 66.4 percent of all lexical items appeared in gsl and awl, whereas 33.7 percent appeared in none of those lists. The corpus compilation procedures as well as the annotation scheme introduced in this study enable the compilation of specific corpora with explicit annotations, allowing teachers to have access to data required for creating esp classroom materials.

Download Full-text

Text extraction algorithm based on binary clustering

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.00057 ◽

2009 ◽

Vol 29 (1) ◽

pp. 57-59

Author(s):

Wei DAI ◽

Shen-sheng ZHANG

Keyword(s):

Text Extraction ◽

Extraction Algorithm

Download Full-text

Text extraction in video image based on wavelet modulus maximum

IET International Conference on Wireless Mobile and Multimedia Networks Proceedings (ICWMMN 2006) ◽

10.1049/cp:20061477 ◽

2006 ◽

Author(s):

Xueyan Li ◽

Shuxu Guo ◽

Fengli Gao

Keyword(s):

Video Image ◽

Text Extraction ◽

Modulus Maximum ◽

Wavelet Modulus Maximum

Download Full-text

QMine: A Framework for Mining Quantitative Regular Expressions from System Traces

2020 IEEE 20th International Conference on Software Quality, Reliability and Security Companion (QRS-C) ◽

10.1109/qrs-c51114.2020.00070 ◽

2020 ◽

Author(s):

Pradeep K. Mahato ◽

Apurva Narayan

Keyword(s):

Regular Expressions

Download Full-text

A Complete Proof System for 1-Free Regular Expressions Modulo Bisimilarity

Proceedings of the 35th Annual ACM/IEEE Symposium on Logic in Computer Science ◽

10.1145/3373718.3394744 ◽

2020 ◽

Author(s):

Clemens Grabmayer ◽

Wan Fokkink

Keyword(s):

Proof System ◽

Regular Expressions ◽

Complete Proof

Download Full-text

Retaining all the path information for graph reachability queries based on regular expressions

2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) ◽

10.1109/fskd.2013.6816303 ◽

2013 ◽

Author(s):

Yifei Zhang ◽

Guoren Wang ◽

Changkuan Zhao ◽

Ende Zhang

Keyword(s):

Regular Expressions ◽

Path Information ◽

Graph Reachability ◽

Reachability Queries

Download Full-text

Object proposals for text extraction in the wild

2015 13th International Conference on Document Analysis and Recognition (ICDAR) ◽

10.1109/icdar.2015.7333753 ◽

2015 ◽

Cited By ~ 8

Author(s):

Lluis Gomez ◽

Dimosthenis Karatzas

Keyword(s):

Text Extraction ◽

Object Proposals ◽

In The Wild

Download Full-text

SSMBS: a web server to locate sequentially separated motifs in biological sequences

Journal of Applied Crystallography ◽

10.1107/s0021889809047050 ◽

2009 ◽

Vol 43 (1) ◽

pp. 203-205 ◽

Cited By ~ 1

Author(s):

Chetan Kumar ◽

K. Sekar

Keyword(s):

Amino Acids ◽

Web Server ◽

Nucleotide Sequences ◽

Regular Expressions ◽

Biological Sequences ◽

Sequence Motifs ◽

Specific Order ◽

The Web

The identification of sequence (amino acids or nucleotides) motifs in a particular order in biological sequences has proved to be of interest. This paper describes a computing server,SSMBS, which can locate and display the occurrences of user-defined biologically important sequence motifs (a maximum of five) present in a specific order in protein and nucleotide sequences. While the server can efficiently locate motifs specified using regular expressions, it can also find occurrences of long and complex motifs. The computation is carried out by an algorithm developed using the concepts of quantifiers in regular expressions. The web server is available to users around the clock at http://dicsoft1.physics.iisc.ernet.in/ssmbs/.

Download Full-text