MAGIC Database and Interfaces: An Integrated Package for Gene Discovery and Expression

Marie-Michèle Cordonnier-Pratt; Chun Liang; Haiming Wang; Dmitri S. Kolychev; Feng Sun; Robert Freeman; Robert Sullivan; Lee H. Pratt

doi:10.1002/cfg.399

MAGIC Database and Interfaces: An Integrated Package for Gene Discovery and Expression

Comparative and Functional Genomics ◽

10.1002/cfg.399 ◽

2004 ◽

Vol 5 (3) ◽

pp. 268-275 ◽

Cited By ~ 12

Author(s):

Marie-Michèle Cordonnier-Pratt ◽

Chun Liang ◽

Haiming Wang ◽

Dmitri S. Kolychev ◽

Feng Sun ◽

...

Keyword(s):

Dna Sequences ◽

Relational Databases ◽

Gene Discovery ◽

Biological Data ◽

Single Nucleotide Polymorphism Discovery ◽

Web Browser ◽

Polymorphism Discovery ◽

Est Clustering ◽

Database Administration ◽

Core Facilities

The rapidly increasing rate at which biological data is being produced requires a corresponding growth in relational databases and associated tools that can help laboratories contend with that data. With this need in mind, we describe here a Modular Approach to a Genomic, Integrated and Comprehensive (MAGIC) Database. This Oracle 9i database derives from an initial focus in our laboratory on gene discovery via production and analysis of expressed sequence tags (ESTs), and subsequently on gene expression as assessed by both EST clustering and microarrays. The MAGIC Gene Discovery portion of the database focuses on information derived from DNA sequences and on its biological relevance. In addition to MAGIC SEQ-LIMS, which is designed to support activities in the laboratory, it contains several additional subschemas. The latter include MAGIC Admin for database administration, MAGIC Sequence for sequence processing as well as sequence and clone attributes, MAGIC Cluster for the results of EST clustering, MAGIC Polymorphism in support of microsatellite and single-nucleotide-polymorphism discovery, and MAGIC Annotation for electronic annotation by BLAST and BLAT. The MAGIC Microarray portion is a MIAME-compliant database with two components at present. These are MAGIC Array-LIMS, which makes possible remote entry of all information into the database, and MAGIC Array Analysis, which provides data mining and visualization. Because all aspects of interaction with the MAGIC Database are via a web browser, it is ideally suited not only for individual research laboratories but also for core facilities that serve clients at any distance.

Download Full-text

Applying graph database technology for analyzing perturbed co-expression networks in cancer

Database ◽

10.1093/database/baaa110 ◽

2020 ◽

Vol 2020 ◽

Author(s):

Claire M Simpson ◽

Florian Gnad

Keyword(s):

Relational Databases ◽

Molecular Mechanisms ◽

Biological Data ◽

Database Management System ◽

Graph Database ◽

Graph Databases ◽

Graph Representations ◽

Rnaseq Data ◽

Database Technology ◽

Speed Accuracy

Abstract Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corresponding gene products. The growing avalanche of available RNA sequencing (RNAseq) data fuels the construction of such networks, which are usually stored in relational databases like most other biological data. Inferring linkage by recursive multiple-join statements, however, is computationally expensive and complex to design in relational databases. In contrast, graph databases store and represent complex interconnected data as nodes, edges and properties, making it fast and intuitive to query and analyze relationships. While graph-based database technologies are on their way from a fringe domain to going mainstream, there are only a few studies reporting their application to biological data. We used the graph database management system Neo4j to store and analyze co-expression networks derived from RNAseq data from The Cancer Genome Atlas. Comparing co-expression in tumors versus healthy tissues in six cancer types revealed significant perturbation tracing back to erroneous or rewired gene regulation. Applying centrality, community detection and pathfinding graph algorithms uncovered the destruction or creation of central nodes, modules and relationships in co-expression networks of tumors. Given the speed, accuracy and straightforwardness of managing these densely connected networks, we conclude that graph databases are ready for entering the arena of biological data.

Download Full-text

Single Nucleotide Polymorphism Discovery and Genetic Differentiation Analysis of Geese Bred in Poland, Using Genotyping-by-Sequencing (GBS)

Genes ◽

10.3390/genes12071074 ◽

2021 ◽

Vol 12 (7) ◽

pp. 1074

Author(s):

Joanna Grzegorczyk ◽

Artur Gurgul ◽

Maria Oczkowicz ◽

Tomasz Szmatoła ◽

Agnieszka Fornal ◽

...

Keyword(s):

Genotyping By Sequencing ◽

Read Depth ◽

Model Organisms ◽

Single Nucleotide Polymorphism Discovery ◽

Nucleotide Polymorphisms ◽

Single Nucleotide ◽

Polymorphism Discovery ◽

Genome Wide ◽

Plumage Development ◽

Edar Gene

Poland is the largest European producer of goose, while goose breeding has become an essential and still increasing branch of the poultry industry. The most frequently bred goose is the White Kołuda® breed, constituting 95% of the country’s population, whereas geese of regional varieties are bred in smaller, conservation flocks. However, a goose’s genetic diversity is inaccurately explored, mainly because the advantages of the most commonly used tools are strongly limited in non-model organisms. One of the most accurate used markers for population genetics is single nucleotide polymorphisms (SNP). A highly efficient strategy for genome-wide SNP detection is genotyping-by-sequencing (GBS), which has been already widely applied in many organisms. This study attempts to use GBS in 12 conservative goose breeds and the White Kołuda® breed maintained in Poland. The GBS method allowed for the detection of 3833 common raw SNPs. Nevertheless, after filtering for read depth and alleles characters, we obtained the final markers panel used for a differentiation analysis that comprised 791 SNPs. These variants were located within 11 different genes, and one of the most diversified variants was associated with the EDAR gene, which is especially interesting as it participates in the plumage development, which plays a crucial role in goose breeding.

Download Full-text

Single-nucleotide polymorphism discovery and validation in high-density SNP array for genetic analysis in European white oaks

Molecular Ecology Resources ◽

10.1111/1755-0998.12407 ◽

2015 ◽

Vol 15 (6) ◽

pp. 1446-1459 ◽

Cited By ~ 25

Author(s):

C. Lepoittevin ◽

C. Bodénès ◽

E. Chancerel ◽

L. Villate ◽

T. Lang ◽

...

Keyword(s):

Single Nucleotide Polymorphism ◽

Genetic Analysis ◽

Snp Array ◽

High Density ◽

Single Nucleotide Polymorphism Discovery ◽

Nucleotide Polymorphism ◽

Single Nucleotide ◽

Polymorphism Discovery

Download Full-text

Promoter region of the bovine growth hormone receptor gene: Single nucleotide polymorphism discovery in cattle and association with performance in Brangus bulls1

Journal of Animal Science ◽

10.2527/jas.2008-0990 ◽

2008 ◽

Vol 86 (12) ◽

pp. 3315-3323 ◽

Cited By ~ 15

Author(s):

A. J. Garrett ◽

G. Rincon ◽

J. F. Medrano ◽

M. A. Elzo ◽

G. A. Silver ◽

...

Keyword(s):

Promoter Region ◽

Growth Hormone Receptor ◽

Receptor Gene ◽

Single Nucleotide Polymorphism Discovery ◽

Nucleotide Polymorphism ◽

Single Nucleotide ◽

Polymorphism Discovery ◽

Growth Hormone Receptor Gene ◽

Gene Single Nucleotide Polymorphism ◽

Hormone Receptor Gene

Download Full-text

Single nucleotide polymorphism discovery of Pinus radiata with chromosome walking PCR method

Frontiers of Forestry in China ◽

10.1007/s11461-008-0055-2 ◽

2008 ◽

Vol 3 (3) ◽

pp. 352-356

Author(s):

Wei Li ◽

Hui Li ◽

Xiaoyang Chen ◽

Harry Wu

Keyword(s):

Single Nucleotide Polymorphism ◽

Pinus Radiata ◽

Single Nucleotide Polymorphism Discovery ◽

Nucleotide Polymorphism ◽

Chromosome Walking ◽

Single Nucleotide ◽

Polymorphism Discovery ◽

Pcr Method

Download Full-text

Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library

BMC Genomics ◽

10.1186/1471-2164-10-559 ◽

2009 ◽

Vol 10 (1) ◽

Cited By ~ 97

Author(s):

Cecilia Castaño Sánchez ◽

Timothy PL Smith ◽

Ralph T Wiedmann ◽

Roger L Vallejo ◽

Mohamed Salem ◽

...

Keyword(s):

Single Nucleotide Polymorphism ◽

Rainbow Trout ◽

Deep Sequencing ◽

Single Nucleotide Polymorphism Discovery ◽

Nucleotide Polymorphism ◽

Single Nucleotide ◽

Reduced Representation ◽

Polymorphism Discovery ◽

Reduced Representation Library

Download Full-text

Fishing for SNPs: A Targeted Locus Approach for Single Nucleotide Polymorphism Discovery in Rainbow Trout

Transactions of the American Fisheries Society ◽

10.1577/t05-291.1 ◽

2006 ◽

Vol 135 (6) ◽

pp. 1698-1721 ◽

Cited By ~ 16

Author(s):

A. E. Sprowles ◽

M. R. Stephens ◽

N. W. Clipperton ◽

B. P. May

Keyword(s):

Single Nucleotide Polymorphism ◽

Rainbow Trout ◽

Single Nucleotide Polymorphism Discovery ◽

Nucleotide Polymorphism ◽

Single Nucleotide ◽

Polymorphism Discovery

Download Full-text

Genome-Wide Single Nucleotide Polymorphism Discovery and the Construction of a High-Density Genetic Map for Melon (Cucumis melo L.) Using Genotyping-by-Sequencing

Frontiers in Plant Science ◽

10.3389/fpls.2017.00125 ◽

2017 ◽

Vol 8 ◽

Cited By ~ 9

Author(s):

Che-Wei Chang ◽

Yu-Hua Wang ◽

Chih-Wei Tung

Keyword(s):

Single Nucleotide Polymorphism ◽

Genetic Map ◽

Cucumis Melo ◽

Genotyping By Sequencing ◽

Single Nucleotide Polymorphism Discovery ◽

Nucleotide Polymorphism ◽

Single Nucleotide ◽

Polymorphism Discovery ◽

Genome Wide ◽

Cucumis Melo L

Download Full-text

Single‐nucleotide polymorphism discovery and diversity in the model legume M edicago truncatula

Molecular Ecology Resources ◽

10.1111/1755-0998.12021 ◽

2012 ◽

Vol 13 (1) ◽

pp. 84-95 ◽

Cited By ~ 14

Author(s):

Karine Loridon ◽

Concetta Burgarella ◽

Nathalie Chantret ◽

Frédéric Martins ◽

Jérôme Gouzy ◽

...

Keyword(s):

Single Nucleotide Polymorphism ◽

Single Nucleotide Polymorphism Discovery ◽

Nucleotide Polymorphism ◽

Single Nucleotide ◽

Model Legume ◽

Polymorphism Discovery

Download Full-text

An Affinity Propagation-Based DNA Motif Discovery Algorithm

BioMed Research International ◽

10.1155/2015/853461 ◽

2015 ◽

Vol 2015 ◽

pp. 1-10 ◽

Cited By ~ 5

Author(s):

Chunxiao Sun ◽

Hongwei Huo ◽

Qiang Yu ◽

Haitao Guo ◽

Zhigang Sun

Keyword(s):

Dna Sequences ◽

Motif Discovery ◽

Simulated Data ◽

Biological Data ◽

Affinity Propagation ◽

Local Optimum ◽

Data Sets ◽

Dna Motif ◽

Challenging Tasks ◽

Dna Motif Discovery

The planted(l,d)motif search (PMS) is one of the fundamental problems in bioinformatics, which plays an important role in locating transcription factor binding sites (TFBSs) in DNA sequences. Nowadays, identifying weak motifs and reducing the effect of local optimum are still important but challenging tasks for motif discovery. To solve the tasks, we propose a new algorithm, APMotif, which first applies the Affinity Propagation (AP) clustering in DNA sequences to produce informative and good candidate motifs and then employs Expectation Maximization (EM) refinement to obtain the optimal motifs from the candidate motifs. Experimental results both on simulated data sets and real biological data sets show that APMotif usually outperforms four other widely used algorithms in terms of high prediction accuracy.

Download Full-text