genomic sequence information Latest Research Papers

Abstract This chapter on immunogenetics in the rabbit focused on some genes with genetic and genomic sequence information including those encoding: soluble circulating immunoglobulin molecules (Igs) and their surface-bound forms on B lymphocytes (BCRs); T-cell receptors on T lymphocyte surfaces, (TCRs); the rabbit Leukocyte Antigen (RLA) complex (proteins on cells that function to present antigen fragments to TCRs); and some cytokine genes that encode key regulators of T- and B-cell responses.

Download Full-text

Metabolic pathway inference using multi-label classification with rich pathway features

10.1101/2020.02.02.919944 ◽

2020 ◽

Author(s):

Abdur Rahman M. A. Basher ◽

Ryan J. McLaughlin ◽

Steven J. Hallam

Keyword(s):

Metabolic Networks ◽

Genomic Sequence ◽

Sequence Information ◽

Prediction Problem ◽

Biological Organization ◽

Pathway Prediction ◽

Rule Sets ◽

The Individual ◽

Different Levels ◽

Genomic Sequence Information

AbstractMetabolic inference from genomic sequence information is a necessary step in determining the capacity of cells to make a living in the world at different levels of biological organization. A common method for determining the metabolic potential encoded in genomes is to map conceptually translated open reading frames onto a database containing known product descriptions. Such gene-centric methods are limited in their capacity to predict pathway presence or absence and do not support standardized rule-sets for automated and reproducible research. Pathway-centric methods based on defined rule sets or machine learning algorithms provide an adjunct or alternative inference method that supports hypothesis generation and testing of metabaolic relationships within and between cells. Here, we present mlLGPR, multi-label based on logistic regression for pathway prediction, a software package that uses supervised multi-label classification and rich pathway features to infer metabolic networks at the individual, population and community levels of organization. We evaluated mlLGPR performance using a corpora of 12 experimental datasets manifesting diverse multi-label properties, including manually curated organismal genomes, synthetic microbial communities and low complexity microbial communities. Resulting performance metrics equaled or exceeded previous reports for organismal genomes and identify specific challenges associated with features engineering and training data for community-level metabolic inference.Author summaryPredicting the complex series of metabolic interactions e.g. pathways, within and between cells from genomic sequence information is an integral problem in biology linking genotype to phenotype. This is a prerequisite to both understanding fundamental life processes and ultimately engineering these processes for specific biotechnological applications. A pathway prediction problem exists because we have limited knowledge of the reactions and pathways operating in cells even in model organisms like Esherichia coli where the majority of protein functions are determined. To improve pathway prediction outcomes for genomes at different levels of complexity and completion we have developed mlLGPR, multi-label based on logistic regression for pathway prediction, a scalable open source software package that uses supervised multi-label classification and rich pathway features to infer metabolic networks. We benchmark mlLGPR performance against other inference methods providing a code base and metrics for continued application of machine learning methods to the pathway prediction problem at the individual, population and community levels of biological organization.

Download Full-text

Selected genome regions for fruit weight and shelf life in tomato RILs discernible by markers based on genomic sequence information

Breeding Science ◽

10.1270/jsbbs.19015 ◽

2019 ◽

Vol 69 (3) ◽

pp. 447-454 ◽

Cited By ~ 4

Author(s):

Vladimir Cambiaso ◽

Magalí Diana Gimenez ◽

Javier Hernán Pereira da Costa ◽

Dana Valeria Vazquez ◽

Liliana Amelia Picardi ◽

...

Keyword(s):

Shelf Life ◽

Genomic Sequence ◽

Fruit Weight ◽

Sequence Information ◽

Genomic Sequence Information

Download Full-text

Using Genomic Sequence Information to Increase Conservation and Sustainable Use of Crop Diversity and Benefit-Sharing

Biopreservation and Biobanking ◽

10.1089/bio.2018.0043 ◽

2018 ◽

Vol 16 (5) ◽

pp. 368-376 ◽

Cited By ~ 5

Author(s):

Michael Halewood ◽

Isabel Lopez Noriega ◽

Dave Ellis ◽

Carolina Roa ◽

Mathieu Rouard ◽

...

Keyword(s):

Genomic Sequence ◽

Sustainable Use ◽

Crop Diversity ◽

Benefit Sharing ◽

Sequence Information ◽

Genomic Sequence Information

Download Full-text

A regulatory-sequence classifier with a neural network for genomic information processing

10.1101/355974 ◽

2018 ◽

Cited By ~ 1

Author(s):

Koh Onimaru ◽

Osamu Nishimura ◽

Shigehiro Kuraku

Keyword(s):

Deep Learning ◽

Genomic Sequence ◽

Regulatory Sequence ◽

Sequence Information ◽

Regulatory Sequences ◽

Genomic Information ◽

Protein Coding ◽

Coding Regions ◽

Gene Regulatory ◽

Genomic Sequence Information

Genotype-phenotype mapping is one of the fundamental challenges in biology. The difficulties stem in part from the large amount of sequence information and the puzzling genomic code, particularly of non-protein-coding regions such as gene regulatory sequences. However, recently deep learning–based methods were shown to have the ability to decipher the gene regulatory code of genomes. Still, prediction accuracy needs improvement. Here, we report the design of convolution layers that efficiently process genomic sequence information and developed a software, DeepGMAP, to train and compare different deep learning-based models (https://github.com/koonimaru/DeepGMAP). First, we demonstrate that our convolution layers, termed forward- and reverse-sequence scan (FRSS) layers, enhance the power to predict gene regulatory sequences. Second, we assessed previous studies and identified problems associated with data structures that caused overfitting. Finally, we introduce several visualization methods that provide insights into the syntax of gene regulatory sequences.

Download Full-text

Health informatics. Data elements and their metadata for describing structured clinical genomic sequence information in electronic health records

10.3403/30346825u ◽

2017 ◽

Cited By ~ 1

Keyword(s):

Electronic Health Records ◽

Health Informatics ◽

Genomic Sequence ◽

Sequence Information ◽

Health Records ◽

Electronic Health ◽

Data Elements ◽

Clinical Genomic ◽

Genomic Sequence Information

Download Full-text

Health informatics. Data elements and their metadata for describing structured clinical genomic sequence information in electronic health records

10.3403/30346825 ◽

2017 ◽

Keyword(s):

Electronic Health Records ◽

Health Informatics ◽

Genomic Sequence ◽

Sequence Information ◽

Health Records ◽

Electronic Health ◽

Data Elements ◽

Clinical Genomic ◽

Genomic Sequence Information

Download Full-text

The selenium content of SEPP1 versus selenium requirements in vertebrates

10.7287/peerj.preprints.784v1 ◽

2015 ◽

Author(s):

Sam Penglase ◽

Kristin Hamre ◽

Ståle Ellingsen

Keyword(s):

Genomic Sequence ◽

Circulatory System ◽

The Body ◽

Selenoprotein P ◽

Sequence Information ◽

Selenium Content ◽

Unique Case ◽

Nutrient Requirement ◽

Genetically Determined ◽

Genomic Sequence Information

Selenoprotein P (SEPP1) distributes selenium (Se) throughout the body via the circulatory system. The Se content of SEPP1 varies from 7 to 18 Se atoms depending on the species, but the reason for this variation remains unclear. Herein we provide evidence that vertebrate SEPP1 Sec content correlates positively with Se requirements (R2=0.88). As the Se content of full length SEPP1 is genetically determined, this presents a unique case where a nutrient requirement can be predicted based on genomic sequence information.

Download Full-text

The selenium content of SEPP1 versus selenium requirements in vertebrates

10.7287/peerj.preprints.784 ◽

2015 ◽

Author(s):

Sam Penglase ◽

Kristin Hamre ◽

Ståle Ellingsen

Keyword(s):

Genomic Sequence ◽

Circulatory System ◽

The Body ◽

Selenoprotein P ◽

Sequence Information ◽

Selenium Content ◽

Unique Case ◽

Nutrient Requirement ◽

Genetically Determined ◽

Genomic Sequence Information

Selenoprotein P (SEPP1) distributes selenium (Se) throughout the body via the circulatory system. The Se content of SEPP1 varies from 7 to 18 Se atoms depending on the species, but the reason for this variation remains unclear. Herein we provide evidence that vertebrate SEPP1 Sec content correlates positively with Se requirements (R2=0.88). As the Se content of full length SEPP1 is genetically determined, this presents a unique case where a nutrient requirement can be predicted based on genomic sequence information.

Download Full-text

Research participants’ attitudes towards the confidentiality of genomic sequence information

European Journal of Human Genetics ◽

10.1038/ejhg.2013.276 ◽

2013 ◽

Vol 22 (8) ◽

pp. 964-968 ◽

Cited By ~ 22

Author(s):

Leila Jamal ◽

Julie C Sapp ◽

Katie Lewis ◽

Tatiane Yanes ◽

Flavia M Facio ◽

...

Keyword(s):

Genomic Sequence ◽

Sequence Information ◽

Research Participants ◽

Genomic Sequence Information

Download Full-text

genomic sequence information
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Immunogenetics in the rabbit.

Metabolic pathway inference using multi-label classification with rich pathway features

Selected genome regions for fruit weight and shelf life in tomato RILs discernible by markers based on genomic sequence information

Using Genomic Sequence Information to Increase Conservation and Sustainable Use of Crop Diversity and Benefit-Sharing

A regulatory-sequence classifier with a neural network for genomic information processing

Health informatics. Data elements and their metadata for describing structured clinical genomic sequence information in electronic health records

Health informatics. Data elements and their metadata for describing structured clinical genomic sequence information in electronic health records

The selenium content of SEPP1 versus selenium requirements in vertebrates

The selenium content of SEPP1 versus selenium requirements in vertebrates

Research participants’ attitudes towards the confidentiality of genomic sequence information

Export Citation Format

genomic sequence informationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Immunogenetics in the rabbit.

Metabolic pathway inference using multi-label classification with rich pathway features

Selected genome regions for fruit weight and shelf life in tomato RILs discernible by markers based on genomic sequence information

Using Genomic Sequence Information to Increase Conservation and Sustainable Use of Crop Diversity and Benefit-Sharing

A regulatory-sequence classifier with a neural network for genomic information processing

Health informatics. Data elements and their metadata for describing structured clinical genomic sequence information in electronic health records

Health informatics. Data elements and their metadata for describing structured clinical genomic sequence information in electronic health records

The selenium content of SEPP1 versus selenium requirements in vertebrates

The selenium content of SEPP1 versus selenium requirements in vertebrates

Research participants’ attitudes towards the confidentiality of genomic sequence information

genomic sequence information
Recently Published Documents