nucleotide pair Latest Research Papers

DiTaxa: nucleotide-pair encoding of 16S rRNA for host phenotype and biomarker detection

Bioinformatics ◽

10.1093/bioinformatics/bty954 ◽

2018 ◽

Vol 35 (14) ◽

pp. 2498-2500 ◽

Cited By ~ 6

Author(s):

Ehsaneddin Asgari ◽

Philipp C Münch ◽

Till R Lesker ◽

Alice C McHardy ◽

Mohammad R K Mofrad

Keyword(s):

16S Rrna ◽

State Of The Art ◽

Operational Taxonomic Unit ◽

Supplementary Information ◽

Biomarker Detection ◽

New Paradigm ◽

Nucleotide Pair ◽

Bowel Diseases ◽

Benchmark Datasets ◽

Inflammatory Bowel

Abstract Summary Identifying distinctive taxa for micro-biome-related diseases is considered key to the establishment of diagnosis and therapy options in precision medicine and imposes high demands on the accuracy of micro-biome analysis techniques. We propose an alignment- and reference- free subsequence based 16S rRNA data analysis, as a new paradigm for micro-biome phenotype and biomarker detection. Our method, called DiTaxa, substitutes standard operational taxonomic unit (OTU)-clustering by segmenting 16S rRNA reads into the most frequent variable-length subsequences. We compared the performance of DiTaxa to the state-of-the-art methods in phenotype and biomarker detection, using human-associated 16S rRNA samples for periodontal disease, rheumatoid arthritis and inflammatory bowel diseases, as well as a synthetic benchmark dataset. DiTaxa performed competitively to the k-mer based state-of-the-art approach in phenotype prediction while outperforming the OTU-based state-of-the-art approach in finding biomarkers in both resolution and coverage evaluated over known links from literature and synthetic benchmark datasets. Availability and implementation DiTaxa is available under the Apache 2 license at http://llp.berkeley.edu/ditaxa. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Charge transport in a homogeneous molecular polynucleotide chain excited by initial displacements and velocities of nucleotide pair

10.17537/icmbb18.46 ◽

2018 ◽

Author(s):

A.N. Korshunova ◽

V.D. Lakhno

Keyword(s):

Charge Transport ◽

Nucleotide Pair ◽

Polynucleotide Chain

Download Full-text

Nucleotide-pair encoding of 16S rRNA sequences for host phenotype and biomarker detection

10.1101/334722 ◽

2018 ◽

Cited By ~ 1

Author(s):

Ehsaneddin Asgari ◽

Philipp C. Münch ◽

Till R. Lesker ◽

Alice C. McHardy ◽

Mohammad R.K. Mofrad

Keyword(s):

Rheumatoid Arthritis ◽

16S Rrna ◽

State Of The Art ◽

Data Representation ◽

The State ◽

Biomarker Detection ◽

New Paradigm ◽

Phenotype Prediction ◽

Nucleotide Pair ◽

Phenotype Classification

ABSTRACTIdentifying combinations of taxa distinctive for microbiome-associated diseases is considered key to the establishment of diagnosis and therapy options in precision medicine and imposes high demands on accuracy of microbiome analysis techniques. We propose subsequence based 16S rRNA data analysis, as a new paradigm for microbiome phenotype classification and biomarker detection. This method and software called DiTaxa substitutes standard OTU-clustering or sequence-level analysis by segmenting 16S rRNA reads into the most frequent variable-length subsequences. These subsequences are then used as data representation for downstream phenotype prediction, biomarker detection and taxonomic analysis. Our proposed sequence segmentation called nucleotide-pair encoding (NPE) is an unsupervised data-driven segmentation inspired by Byte-pair encoding, a data compression algorithm. The identified subsequences represent commonly occurring sequence portions, which we found to be distinctive for taxa at varying evolutionary distances and highly informative for predicting host phenotypes. We compared the performance of DiTaxa to the state-of-the-art methods in disease phenotype prediction and biomarker detection, using human-associated 16S rRNA samples for periodontal disease, rheumatoid arthritis and inflammatory bowel diseases, as well as a synthetic benchmark dataset. DiTaxa identified 17 out of 29 taxa with confirmed links to periodontitis (recall= 0.59), relative to 3 out of 29 taxa (recall= 0.10) by the state-of-the-art method. On synthetic benchmark data, DiTaxa obtained full precision and recall in biomarker detection, compared to 0.91 and 0.90, respectively. In addition, machine-learning classifiers trained to predict host disease phenotypes based on the NPE representation performed competitively to the state-of-the art using OTUs or k-mers. For the rheumatoid arthritis dataset, DiTaxa substantially outperformed OTU features with a macro-F1 score of 0.76 compared to 0.65. Due to the alignment- and reference free nature, DiTaxa can efficiently run on large datasets. The full analysis of a large 16S rRNA dataset of 1359 samples required ≈1.5 hours on 20 cores, while the standard pipeline needed ≈6.5 hours in the same setting.AvailabilityAn implementation of our method called DiTaxa is available under the Apache 2 licence at http://llp.berkeley.edu/ditaxa.

Download Full-text

Identifying N6-methyladenosine sites using multi-interval nucleotide pair position specificity and support vector machine

Scientific Reports ◽

10.1038/srep46757 ◽

2017 ◽

Vol 7 (1) ◽

Cited By ~ 40

Author(s):

Pengwei Xing ◽

Ran Su ◽

Fei Guo ◽

Leyi Wei

Keyword(s):

Support Vector Machine ◽

Support Vector ◽

Nucleotide Pair ◽

Position Specificity

Download Full-text

Analysis of the genome sequence of the pathogenic Muscovy duck parvovirus strain YY reveals a 14-nucleotide-pair deletion in the inverted terminal repeats

Archives of Virology ◽

10.1007/s00705-016-2946-0 ◽

2016 ◽

Vol 161 (9) ◽

pp. 2589-2594 ◽

Cited By ~ 5

Author(s):

Jianye Wang ◽

Yu Huang ◽

Mingxu Zhou ◽

Guoqiang Zhu

Keyword(s):

Genome Sequence ◽

Muscovy Duck ◽

Nucleotide Pair ◽

Inverted Terminal Repeats ◽

Terminal Repeats ◽

Muscovy Duck Parvovirus

Download Full-text

Nucleotide pair

The Dictionary of Genomics, Transcriptomics and Proteomics ◽

10.1002/9783527678679.dg08749 ◽

2015 ◽

pp. 1-1

Keyword(s):

Nucleotide Pair

Download Full-text

A three-layer ONIOM model for the outside binding of cationic porphyrins and nucleotide pair DNA

Journal of Molecular Modeling ◽

10.1007/s00894-012-1597-7 ◽

2012 ◽

Vol 19 (2) ◽

pp. 811-824 ◽

Cited By ~ 2

Author(s):

Gloria I. Cárdenas-Jirón ◽

Luis Cortez-Santibañez

Keyword(s):

Nucleotide Pair ◽

Cationic Porphyrins

Download Full-text

Comparative Analysis of Mitochondrial DNA by Using Nucleotide Genomic Signals

Materials Science Forum ◽

10.4028/www.scientific.net/msf.670.507 ◽

2010 ◽

Vol 670 ◽

pp. 507-516 ◽

Cited By ~ 2

Author(s):

Paul Dan Cristea ◽

Rodica Tuduce

Keyword(s):

Mitochondrial Dna ◽

Comparative Analysis ◽

Nucleotide Pair ◽

Innovative Technique ◽

Evolutionary Changes ◽

Genomic Signals

The nucleotide imbalance, the nucleotide pair imbalance and the nucleotide path of mitochondrial Nucleotide Genomic Signals (mtNuGSs) for several taxa have been analyzed comparatively. Evolutionary changes of conservative segments, as well as individual mutations of hyper-variable segments have been identified. An innovative technique, based on reference-offset representation of sets of related signals has been used throughout the study.

Download Full-text

Genetic heterogeneity of migraine with and without aura in Danes cannot be explained by mutation in mtDNA nucleotide pair 11084

Acta Neurologica Scandinavica ◽

10.1111/j.1600-0404.1997.tb00262.x ◽

2009 ◽

Vol 96 (3) ◽

pp. 171-173 ◽

Cited By ~ 11

Author(s):

M. B. Russell ◽

M. Diamant ◽

S. Nerby

Keyword(s):

Genetic Heterogeneity ◽

Nucleotide Pair

Download Full-text

Feature Selection for the Promoter Recognition and Prediction Problem

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch132 ◽

2008 ◽

pp. 2248-2262

Author(s):

George Potamias ◽

Alexandros Kanterakis

Keyword(s):

Feature Selection ◽

Dna Sequences ◽

Data Sets ◽

Prediction Problem ◽

Hybrid Filter ◽

Nucleotide Pair ◽

Promoter Recognition ◽

Whole Genomes ◽

Computational Discovery ◽

Algorithmic Process

With the completion of various whole genomes, one of the fundamental bioinformatics tasks is the identification of functional regulatory regions, such as promoters, and the computational discovery of genes from the produced DNA sequences. Confronted with huge amounts of DNA sequences, the utilization of automated computational sequence analysis methods and tools is more than demanding. In this article, we present an efficient feature selection to the promoter recognition, prediction, and localization problem. The whole approach is implemented in a system called MineProm. The basic idea underlying our approach is that each position-nucleotide pair in a DNA sequence is represented by a distinct binary-valued feature—the binary position base value (BPBV). A hybrid filter-wrapper, featuredeletion (or addition) algorithmic process is called for in order to select those BPBVs that best discriminate between two DNA sequences target classes (i.e., promoter vs. nonpromoter). MineProm is tested on two widely used benchmark data sets. Assessment of results demonstrates the reliability of the approach.

Download Full-text

nucleotide pair
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

DiTaxa: nucleotide-pair encoding of 16S rRNA for host phenotype and biomarker detection

Charge transport in a homogeneous molecular polynucleotide chain excited by initial displacements and velocities of nucleotide pair

Nucleotide-pair encoding of 16S rRNA sequences for host phenotype and biomarker detection

Identifying N6-methyladenosine sites using multi-interval nucleotide pair position specificity and support vector machine

Analysis of the genome sequence of the pathogenic Muscovy duck parvovirus strain YY reveals a 14-nucleotide-pair deletion in the inverted terminal repeats

Nucleotide pair

A three-layer ONIOM model for the outside binding of cationic porphyrins and nucleotide pair DNA

Comparative Analysis of Mitochondrial DNA by Using Nucleotide Genomic Signals

Genetic heterogeneity of migraine with and without aura in Danes cannot be explained by mutation in mtDNA nucleotide pair 11084

Feature Selection for the Promoter Recognition and Prediction Problem

Export Citation Format

nucleotide pairRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

DiTaxa: nucleotide-pair encoding of 16S rRNA for host phenotype and biomarker detection

Charge transport in a homogeneous molecular polynucleotide chain excited by initial displacements and velocities of nucleotide pair

Nucleotide-pair encoding of 16S rRNA sequences for host phenotype and biomarker detection

Identifying N6-methyladenosine sites using multi-interval nucleotide pair position specificity and support vector machine

Analysis of the genome sequence of the pathogenic Muscovy duck parvovirus strain YY reveals a 14-nucleotide-pair deletion in the inverted terminal repeats

Nucleotide pair

A three-layer ONIOM model for the outside binding of cationic porphyrins and nucleotide pair DNA

Comparative Analysis of Mitochondrial DNA by Using Nucleotide Genomic Signals

Genetic heterogeneity of migraine with and without aura in Danes cannot be explained by mutation in mtDNA nucleotide pair 11084

Feature Selection for the Promoter Recognition and Prediction Problem

nucleotide pair
Recently Published Documents