Construction and characterization of two BAC libraries from Brachypodium distachyon, a new model for grass genomics

Naxin Huo; Yong Q. Gu; Gerard R. Lazo; John P. Vogel; Devin Coleman-Derr; Ming-Cheng Luo; Roger Thilmony; David F. Garvin; Olin D. Anderson

doi:10.1139/g06-087

Construction and characterization of two BAC libraries from Brachypodium distachyon, a new model for grass genomics

Genome ◽

10.1139/g06-087 ◽

2006 ◽

Vol 49 (9) ◽

pp. 1099-1108 ◽

Cited By ~ 47

Author(s):

Naxin Huo ◽

Yong Q. Gu ◽

Gerard R. Lazo ◽

John P. Vogel ◽

Devin Coleman-Derr ◽

...

Keyword(s):

Genomic Sequence ◽

Brachypodium Distachyon ◽

Repetitive Sequences ◽

Average Insert Size ◽

Insert Size ◽

Genome Coverage ◽

Wheat Ests ◽

A Genome ◽

Bac Libraries ◽

Brachypodium Genome

Brachypodium is well suited as a model system for temperate grasses because of its compact genome and a range of biological features. In an effort to develop resources for genome research in this emerging model species, we constructed 2 bacterial artificial chromosome (BAC) libraries from an inbred diploid Brachypodium distachyon line, Bd21, using restriction enzymes HindIII and BamHI. A total of 73 728 clones (36 864 per BAC library) were picked and arrayed in 192 384-well plates. The average insert size for the BamHI and HindIII libraries is estimated to be 100 and 105 kb, respectively, and inserts of chloroplast origin account for 4.4% and 2.4%, respectively. The libraries individually represent 9.4- and 9.9-fold haploid genome equivalents with combined 19.3-fold genome coverage, based on a genome size of 355 Mb reported for the diploid Brachypodium, implying a 99.99% probability that any given specific sequence will be present in each library. Hybridization of the libraries with 8 starch biosynthesis genes was used to empirically evaluate this theoretical genome coverage; the frequency at which these genes were present in the library clones gave an estimated coverage of 11.6- and 19.6-fold genome equivalents. To obtain a first view of the sequence composition of the Brachypodium genome, 2185 BAC end sequences (BES) representing 1.3 Mb of random genomic sequence were compared with the NCBI GenBank database and the GIRI repeat database. Using a cutoff expectation value of E < 10−10, only 3.3% of the BESs showed similarity to repetitive sequences in the existing database, whereas 40.0% had matches to the sequences in the EST database, suggesting that a considerable portion of the Brachypodium genome is likely transcribed. When the BESs were compared with individual EST databases, more matches hit wheat than maize, although their EST collections are of a similar size, further supporting the close relationship between Brachypodium and the Triticeae. Moreover, 122 BESs have significant matches to wheat ESTs mapped to individual chromosome bin positions. These BACs represent colinear regions containing the mapped wheat ESTs and would be useful in identifying additional markers for specific wheat chromosome regions.

Download Full-text

Construction and characterization of a bacterial artificial chromosome (BAC) library for the A genome of wheat

Genome ◽

10.1139/g99-076 ◽

1999 ◽

Vol 42 (6) ◽

pp. 1176-1182 ◽

Cited By ~ 66

Author(s):

D Lijavetzky ◽

G Muzzi ◽

T Wicker ◽

B Keller ◽

R Wing ◽

...

Keyword(s):

Bacterial Artificial Chromosome ◽

Repetitive Sequences ◽

Bac Library ◽

Artificial Chromosome ◽

High Density ◽

Triticum Monococcum ◽

Average Insert Size ◽

Insert Size ◽

High Molecular Weight Dna ◽

A Genome

A genomic bacterial artificial chromosome (BAC) library of the A genome of wheat has been constructed. Triticum monococcum accession DV92 was selected for this purpose because it is a cultivated diploid wheat and one of the parental lines used in the construction of a saturated genetic map. Leaves from this accession were used to isolate high-molecular-weight DNA from nuclei. This DNA was partially digested with restriction enzyme Hind III, subjected to double size selection, electroeluted and cloned into the pINDIGO451 BAC vector. The library consists of 276 480 clones with an average insert size of 115 kb. Excluding the 1.33% of empty clones and 0.14% of clones with chloroplast DNA, the coverage of this library is 5.6 genome equivalents. With this genome coverage the probability of having any DNA sequence represented in this library is higher than 99.6%. Clones were sorted in 720 384-well plates and blotted onto 15 high-density filters. High-density filters were screened with several single or low-copy clones and five positive BAC clones were selected for further analysis. Since most of the T. monococcum BAC ends included repetitive sequences, a modification was introduced into the classical end-isolation procedure to select low copy sequences for chromosome walking.Key words: bacterial artificial chromosome, BAC library, Triticum monococcum, wheat.

Download Full-text

Integrative Physical and Genetic Mapping of the Chickpea Genome for Fine Mapping and Analysis of Agronomic Traits

10.32747/2010.7592122.bard ◽

2010 ◽

Author(s):

Hongbin Zhang ◽

Shahal Abbo ◽

Weidong Chen ◽

Amir Sherman ◽

Dani Shtienberg ◽

...

Keyword(s):

Ssr Markers ◽

Dna Markers ◽

Agronomic Traits ◽

Physical Map ◽

Ascochyta Blight ◽

Genetic Maps ◽

Average Insert Size ◽

Marker Assisted Breeding ◽

Insert Size ◽

A Genome

Chickpea is the third most important pulse crop in the world and ranks first in the Middle East; however, it has been subjected to only limited research in modern genomics. In the first period of this project (US-3034-98R) we constructed two large-insert BAC and BIBAC libraries, developed 325 SSR markers and mapped QTLs controlling ascochyta blight resistance (ABR) and days to first flower (DTF). Nevertheless, the utilities of these tools and results in gene discovery and marker-assisted breeding are limited due to the absence of an essential platform. The goals of this period of the project were to use the resources and tools developed in the first period of the project to develop a BAC/BIBAC physical map for chickpea and using it to identify BAC/BIBACcontigs containing agronomic genes of interest, with an emphasis on ABR and DTF, and develop DNA markers suitable for marker-assisted breeding. Toward these goals, we proposed: 1) Fingerprint ~50,000 (10x) BACs from the BAC and BIBAC libraries, assemble the clones into a genome-wide BAC/BIBAC physical map, and integrate the BAC/BIBAC map with the existing chickpea genetic maps (Zhang, USA); 2) fine-map ABR and DTFQTLs and enhance molecular tools for chickpea genetics and breeding (Shahal, Sherman and DaniShtienberg, Israel; Chen and Muehlbauer; USA); and 3) integrate the BAC/BIBAC map with the existing chickpea genetic maps (Sherman, Israel; Zhang and Chen, USA). For these objectives, a total of $460,000 was requested originally, but a total of $300,000 was awarded to the project. We first developed two new BAC and BIBAC libraries, Chickpea-CME and Chickpea- CHV. The chickpea-CMEBAC library contains 22,272 clones, with an average insert size of 130 kb and equivalent to 4.0 fold of the chickpea genome. The chickpea-CHVBIBAC library contains 38,400 clones, with an average insert size of 140 kb and equivalent to 7.5 fold of the chickpea genome. The two new libraries (11.5 x), along with the two BAC (Chickpea-CHI) and BIBAC (Chickpea-CBV) libraries (7.1 x) constructed in the first period of the project, provide libraries essential for chickpea genome physical mapping and many other genomics researches. Using these four libraries we then developed the proposed BAC/BIBAC physical map of chickpea. A total of 67,584 clones were fingerprinted, and 64,211 (~11.6 x) of the fingerprints validated and used in the physical map assembly. The physical map consists of 1,945 BAC/BIBACcontigs, with each containing an average of 39.2 clones and having an average physical length of 559 kb. The contigs collectively span ~1,088 Mb, being 1.49 fold of the 740- Mb chickpea genome. Third, we integrated the physical map with the two existing chickpea genetic maps using a total of 172 (124 + 48) SSR markers. Fourth, we identified tightly linked markers for ABR-QTL1, increased marker density at ABR-QTL2 and studied the genetic basis of resistance to pod abortion, a major problem in the east Mediterranean, caused by heat stress. Finally, we, using the integrated map, isolated the BAC/BIBACcontigs containing or closely linked to QTL4.1, QTL4.2 and QTL8 for ABR and QTL8 for DTF. The integrated BAC/BIBAC map resulted from the project will provide a powerful platform and tools essential for many aspects of advanced genomics and genetics research of this crop and related species. These includes, but are not limited to, targeted development of SNP, InDel and SSR markers, high-resolution mapping of the chickpea genome and its agronomic genes and QTLs, sequencing and decoding of all genes of the genome using the next-generation sequencing technology, and comparative genome analysis of chickpea versus other legumes. The DNA markers and BAC/BIBACcontigs containing or closely linked to ABR and DTF provide essential tools to develop SSR and SNP markers well-suited for marker-assisted breeding of the traits and clone their corresponding genes. The development of the tools and knowledge will thus promote enhanced and substantial genetic improvement of the crop and related legumes.

Download Full-text

A Bacterial Artificial Chromosome Library of Lotus japonicus Constructed in an Agrobacterium tumefaciens-Transformable Vector

Molecular Plant-Microbe Interactions ◽

10.1094/mpmi.2001.14.3.422 ◽

2001 ◽

Vol 14 (3) ◽

pp. 422-425 ◽

Cited By ~ 22

Author(s):

Artem E. Men ◽

Khalid Meksem ◽

My Abdelmajid Kassem ◽

Dasharath Lohar ◽

Jiri Stiller ◽

...

Keyword(s):

Escherichia Coli ◽

Agrobacterium Tumefaciens ◽

Plant Transformation ◽

Lotus Japonicus ◽

Bac Library ◽

Artificial Chromosome ◽

Average Insert Size ◽

Insert Size ◽

Model Legume ◽

Genome Coverage

We constructed a BAC library of the model legume Lotus japonicus with a 6-to 7-fold genome coverage. We used vector PCLD04541, which allows direct plant transformation by BACs. The average insert size is 94 kb. Clones were stable in Escherichia coli and Agrobacterium tumefaciens.

Download Full-text

Construction of Papaya Male and Female BAC Libraries and Application in Physical Mapping of the Sex Chromosomes

Journal of Biomedicine and Biotechnology ◽

10.1155/2011/929472 ◽

2011 ◽

Vol 2011 ◽

pp. 1-7 ◽

Cited By ~ 12

Author(s):

Andrea R. Gschwend ◽

Qingyi Yu ◽

Paul Moore ◽

Christopher Saski ◽

Cuixia Chen ◽

...

Keyword(s):

Y Chromosome ◽

Restriction Enzyme ◽

Sex Chromosomes ◽

Physical Map ◽

Bac Library ◽

Specific Region ◽

Average Insert Size ◽

Insert Size ◽

Male And Female ◽

Bac Libraries

Papaya is a major fruit crop in the tropics and has recently evolved sex chromosomes. Towards sequencing the papaya sex chromosomes, two bacterial artificial chromosome (BAC) libraries were constructed from papaya male and female genomic DNA. The female BAC library was constructed using restriction enzymeBstY I and consists of 36,864 clones with an average insert size of 104 kb, providing 10.3x genome equivalents. The male BAC library was constructed using restriction enzymeEcoR I and consists of 55,296 clones with an average insert size of 101 kb, providing 15.0x genome equivalents. The male BAC library was used in constructing the physical map of the male-specific region of the male Y chromosome (MSY) and in filling gaps and extending the physical map of the hermaphrodite-specific region of the Yhchromosome (HSY) and the X chromosome physical map. The female BAC library was used to extend the X physical map gap. The MSY, HSY, and X physical maps offer a unique opportunity to study chromosomal rearrangements, Y chromosome degeneration, and dosage compensation of the papaya nascent sex chromosomes.

Download Full-text

Construction and characterization of two Citrus BAC libraries and identification of clones containing the phytoene synthase gene

Genome ◽

10.1139/g09-017 ◽

2009 ◽

Vol 52 (5) ◽

pp. 484-489 ◽

Cited By ~ 6

Author(s):

M. N.R. Baig ◽

An Yu ◽

Wenwu Guo ◽

Xiuxin Deng

Keyword(s):

Dna Sequences ◽

False Positive ◽

Bac Library ◽

Phytoene Synthase ◽

Haploid Genome ◽

Average Insert Size ◽

Insert Size ◽

Important Species ◽

Dna Contamination ◽

Bac Libraries

Two deep-coverage Bacterial Artificial Chromosome (BAC) libraries of Citrus sinensis (L.) Osbeck ‘Cara Cara’ navel orange and Citrus reticulata (L.) Blanco ‘Egan No. 1’ Ponkan mandarin, which belong to the two most important species of the Citrus genus, have been constructed and characterized to facilitate gene cloning and to analyze variety-specific genome composition. The C. sinensis BAC library consists of 36 000 clones with negligible false-positive clones and an estimated average insert size of 126 kb covering ~4.5 × 109 bp and thus providing an 11.8-fold coverage of haploid genome equivalents, whereas the C. reticulata library consists of 21 000 clones also with negligible false-positive clones and an estimated average of 120 kb covering ~2.5 × 109 bp representing a 6.6-fold coverage of haploid genome equivalents. Both libraries were evaluated for contamination with high-copy vector, empty pIndigoBAC536 vector, and organellar DNA sequences. Screening has been performed by Southern hybridization of BAC filters, which results in <0.5% chloroplast DNA contamination and no mitochondrial DNA contamination in both libraries. Eight and five positive clones harboring the gene encoding Phytoene synthase (Psy (EC 2.5.1.32)) were identified from the C. sinensis and C. reticulata libraries, respectively, using the filter hybridization procedure. These results suggest that the two BAC libraries are useful tools for the isolation of functional genes and advanced genomics research in the two important species C. sinensis and C. reticulata. Resources, high-density filters, individual clones, and whole libraries are available for public distribution and are accessible at the National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University.

Download Full-text

Construction of a Llama Bacterial Artificial Chromosome Library with Approximately 9-Fold Genome Equivalent Coverage

Journal of Biomedicine and Biotechnology ◽

10.1155/2012/371414 ◽

2012 ◽

Vol 2012 ◽

pp. 1-4 ◽

Cited By ~ 2

Author(s):

K. W. Airmet ◽

J. D. Hinckley ◽

L. T. Tree ◽

M. Moss ◽

S. Blumell ◽

...

Keyword(s):

Bacterial Artificial Chromosome ◽

Bac Library ◽

Artificial Chromosome ◽

The United States ◽

Modern Technology ◽

Average Insert Size ◽

Genome Equivalent ◽

Insert Size ◽

Genome Coverage ◽

Genomic Studies

The Ilama is an important agricultural livestock in much of South America. The llama is increasing in popularity in the United States as a companion animal. Little work has been done to improve llama production using modern technology. A paucity of information is available regarding the llama genome. We report the construction of a llama bacterial artificial chromosome (BAC) library of about 196,224 clones in the vector pECBAC1. Using flow cytometry and bovine, human, mouse, and chicken as controls, we determined the llama genome size to be2.4×109 bp. The average insert size of the library is 137.8 kb corresponding to approximately 9-fold genome coverage. Further studies are needed to further characterize the library and llama genome. We anticipate that this new library will help facilitate future genomic studies in the llama.

Download Full-text

BAC Library Development and Clone Characterization for Dormancy-Responsive DREB4A, DAM, and FT from Leafy Spurge (Euphorbia esula) Identifies Differential Splicing and Conserved Promoter Motifs

Weed Science ◽

10.1614/ws-d-12-00175.1 ◽

2013 ◽

Vol 61 (2) ◽

pp. 303-309 ◽

Cited By ~ 7

Author(s):

David P. Horvath ◽

David Kudrna ◽

Jayson Talag ◽

James V. Anderson ◽

Wun S. Chao ◽

...

Keyword(s):

Transcription Factor ◽

Sequence Data ◽

Bac Library ◽

Artificial Chromosome ◽

Phylogenetic Footprinting ◽

Leafy Spurge ◽

Average Insert Size ◽

Insert Size ◽

Perennial Weed ◽

Bac Libraries

We developed two leafy spurge bacterial artificial chromosome (BAC) libraries that together represent approximately 5× coverage of the leafy spurge genome. The BAC libraries have an average insert size of approximately 143 kb, and copies of the library and filters for hybridization-based screening are publicly available through the Arizona Genomics Institute. These libraries were used to clone full-length genomic copies of an AP2/ERF transcription factor of the A4 subfamily of DEHYDRATION-RESPONSIVE ELEMENT-BINDING PROTEINS (DREB) known to be differentially expressed in crown buds of leafy spurge during endodormancy, a DORMANCY ASSOCIATED MADS-BOX (DAM) gene, and several FLOWERING LOCUS T (FT) genes. Sequencing of these BAC clones revealed the presence of multiple FT genes in leafy spurge. Sequencing also provided evidence that two different DAM transcripts expressed in crown buds of leafy spurge during endo- and eco-dormancy result from alternate splicing of a single DAM gene. Sequence data from the FT promoters was used to identify several conserved elements previously recognized in Arabidopsis, as well as potential novel transcription factor binding sites that may regulate FT. These leafy spurge BAC libraries represent a new genomics-based tool that complements existing genomics resources for the study of plant growth and development in this model perennial weed. Furthermore, phylogenetic footprinting using genes identified with this resource demonstrate the usefulness of studying weedy species to further our general knowledge of agriculturally important genes.

Download Full-text

F-Box Genes in the Wheat Genome and Expression Profiling in Wheat at Different Developmental Stages

Genes ◽

10.3390/genes11101154 ◽

2020 ◽

Vol 11 (10) ◽

pp. 1154

Author(s):

Min Jeong Hong ◽

Jin-Baek Kim ◽

Yong Weon Seo ◽

Dae Yeon Kim

Keyword(s):

Developmental Stages ◽

Brachypodium Distachyon ◽

Triticum Aestivum L ◽

Wheat Genome ◽

Post Translational Modification ◽

Genome Wide ◽

A Genome ◽

High Sequence Homology ◽

Wide Scale

Genes of the F-box family play specific roles in protein degradation by post-translational modification in several biological processes, including flowering, the regulation of circadian rhythms, photomorphogenesis, seed development, leaf senescence, and hormone signaling. F-box genes have not been previously investigated on a genome-wide scale; however, the establishment of the wheat (Triticum aestivum L.) reference genome sequence enabled a genome-based examination of the F-box genes to be conducted in the present study. In total, 1796 F-box genes were detected in the wheat genome and classified into various subgroups based on their functional C-terminal domain. The F-box genes were distributed among 21 chromosomes and most showed high sequence homology with F-box genes located on the homoeologous chromosomes because of allohexaploidy in the wheat genome. Additionally, a synteny analysis of wheat F-box genes was conducted in rice and Brachypodium distachyon. Transcriptome analysis during various wheat developmental stages and expression analysis by quantitative real-time PCR revealed that some F-box genes were specifically expressed in the vegetative and/or seed developmental stages. A genome-based examination and classification of F-box genes provide an opportunity to elucidate the biological functions of F-box genes in wheat.

Download Full-text

Genome scaffolding with PE-contaminated mate-pair libraries

10.1101/025650 ◽

2015 ◽

Cited By ~ 1

Author(s):

Kristoffer Sahlin ◽

Rayan Chikhi ◽

Lars Arvestad

Keyword(s):

Source Code ◽

Linear Programs ◽

Read Pair ◽

Insert Size ◽

Essential Step ◽

Genome Scaffolding ◽

Mate Pair ◽

A Genome ◽

The Impact ◽

Adapter Sequence

Scaffolding is often an essential step in a genome assembly process,in which contigs are ordered and oriented using read pairs from a combination of paired-ends libraries and longer-range mate-pair libraries. Although a simple idea, scaffolding is unfortunately hard to get right in practice. One source of problem is so-called PE-contamination in mate-pair libraries, in which a non-negligible fraction of the read pairs get the wrong orientation and a much smaller insert size than what is expected. This contamination has been discussed in previous work on integrated scaffolders in end-to-end assemblers such as Allpaths-LG and MaSuRCA but the methods relies on the fact that the orientation is observable, \emph{e.g.}, by finding the junction adapter sequence in the reads. This is not always the case, making orientation and insert size of a read pair stochastic. Furthermore, work on modeling PE-contamination has so far been disregarded in stand-alone scaffolders and the effect that PE-contamination has on scaffolding quality has not been examined before. We have addressed PE-contamination in an update of our scaffolder BESST. We formulate the problem as an Integer Linear Program (ILP) and use characteristics of the problem, such as contig lengths and insert size, to efficiently solve the ILP using a linear amount (with respect to the number of contigs) of Linear Programs. Our results show significant improvement over both integrated and standalone scaffolders. The impact of modeling PE-contamination is quantified by comparison with the previous BESST model. We also show how other scaffolders are vulnerable to PE-contaminated libraries, resulting in increased number of misassemblies, more conservative scaffolding, and inflated assembly sizes. The model is implemented in BESST. Source code and usage instructions are found at https://github.com/ksahlin/BESST. BESST can also be downloaded using PyPI.

Download Full-text

A porcine brain-wide RNA editing landscape

10.21203/rs.3.rs-110949/v1 ◽

2020 ◽

Author(s):

Jinrong Huang ◽

Lin Lin ◽

Zhanying Dong ◽

Ling Yang ◽

Tianyu Zheng ◽

...

Keyword(s):

Rna Editing ◽

Repetitive Sequences ◽

Brain Regions ◽

Mammalian Brain ◽

Protein Coding ◽

Porcine Brain ◽

Coding Regions ◽

Pig Brain ◽

Genome Wide ◽

A Genome

Abstract Adenosine-to-inosine (A-to-I) RNA editing, catalyzed by ADAR enzymes, is an essential post-transcriptional modiﬁcation. Although hundreds of thousands of RNA editing sites have been reported in mammals, brain-wide analysis of the RNA editing in the mammalian brain remains rare. Here, a genome-wide RNA editing investigation is performed in 119 samples, representing 30 anatomically defined subregions in the pig brain. We identify a total of 682,037 A-to-I RNA editing sites of which 97% are not identified before. Within the pig brain, cerebellum and olfactory bulb are regions with most edited transcripts. The editing level of sites residing in protein-coding regions are similar across brain regions, whereas region-distinct editing is observed in repetitive sequences. Highly edited conserved recoding events in pig and human brain are found in neurotransmitter receptors, demonstrating the evolutionary importance of RNA editing in neurotransmission functions. The porcine brain-wide RNA landscape provides a rich resource to better understand the evolutionally importance of post-transcriptional RNA editing.

Download Full-text