Transcriptome and Genome Characterization Using Massively Parallel Paired End Tag (PET) Sequencing Analysis

2009 ◽  
pp. 41-60
Author(s):  
Chia-Lin Wei ◽  
Yijun Ruan
Circulation ◽  
2015 ◽  
Vol 132 (suppl_3) ◽  
Author(s):  
Nathan R Tucker ◽  
Jiangchuan Ye ◽  
Honghuang Lin ◽  
Michael A McLellan ◽  
Emelia J Benjamin ◽  
...  

Introduction: Genome-wide association studies have identified 14 independent loci for atrial fibrillation (AF). The 4q25 locus upstream of the left-right asymmetry gene PITX2 is, by far, the strongest association signal for AF. However, as with most GWAS loci, the functional variants are noncoding, presumed to be regulatory, and remain unknown. We therefore sought to rapidly identify the functional variants at an AF locus by combining high throughput sequencing and massively parallel reporter assays. Methods and Results: We sequenced a ~750kb region encompassing the PITX2 locus in 462 individuals with early-onset AF from the MGH AF Study and 464 referents from the Framingham Heart Study. The SNP most significantly associated with AF in our sequenced sample was rs2129983, which is 140kb from PITX2 (OR=2.43, P =8.9X10 -16 ). rs2129983 is approximately 1.7kb from the most significantly associated SNP in a prior AF GWAS, rs6817105 (r 2 =0.52). From the targeted sequencing analysis, we identified 262 SNVs with a MAF >0.5% within a genomic region bounded by SNPs with an r2 greater than 0.4 with the top variant. To identify functional variants, we then utilized a massively parallel reporter assay (MPRA) in order to measure enhancer activity at each SNP across the entire AF locus. In both HL-1 and C2C12 myoblasts, MPRA identified many distinct SNP regions with differential enhancer activity. Using AF-association status as a standard, we were able to identify a series of variants that have both differential activity in either cell line tested and also a high level of association (rs17042076, rs4469143). Mechanistically, these functional SNPs are predicted to alter transcription factor binding. Conclusions: We have comprehensively identified the AF-associated variation at 4q25 and determined which of these variants are functional through differential enhancer activity. Here, in addition to identifying the causative variation for AF at 4q25, we provide a generalizable pathway for translating this work to other loci, a method that could expedite the identification of causative genetic variants at other disease loci.


2019 ◽  
Vol 75 (1) ◽  
pp. 29-38 ◽  
Author(s):  
John R Lozada ◽  
Felipe C Geyer ◽  
Pier Selenica ◽  
David Brown ◽  
Barbara Alemar ◽  
...  

Antibiotics ◽  
2021 ◽  
Vol 10 (4) ◽  
pp. 383
Author(s):  
Gabriela N. Tenea ◽  
Clara Ortega

The genome characterization of the Lactiplantibacillus plantarum strain UTNGt2, isolated from wild copoazu or white cacao (Theobroma grandiflorum), is described. A total of 31 contigs is assembled with a total length of 3,264,448 bases, with all contigs matching the core genome of different groups in the database. The genome size is 3,540,752 bases with GC content of 44.53% and the genome repeat sequences constitute around 457,386 bases of the assembly. The UTNGt2 matches the Lactiplantibacillus plantarum genome with 99% identity. The genome contains 3115 genes, 3052 protein-coding genes, assigned with the EggNOG database. On the basis of the results, 745 proteins are classified with an unknown function, from which 128 proteins have no match in the BLASTN database. It also contains 57 tRNAs, 5 copies of 5S rRNA, and 1 copy of tmRNA. Based on gene prediction and annotation results, 9.4% of proteins are involved in carbohydrate transport and metabolism and 8.46% in transcription, 2.36% are responsible for defense mechanisms, 0.5% are responsible for the biosynthesis of secondary metabolites, transport, and catabolism, while 25.11% have an unknown function. The genome revealed the presence of genes involved in riboflavin and folate production, the presence of CRISPR/Cas genes, phage sequences, the absence of acquired antibiotics resistance genes, virulence, and pathogenic factors, suggesting that UTNGt2 is a safe strain. Its highly antimicrobial capacity is related to the presence of two bacteriocin clusters (class IIc) of the sactipeptide class (contig 4) and plantaricin E class (contig 22), as detected by the BAGEL 4 webserver. Several RiPP-like peptides (non-bactericidal ribosomally produced and post-translationally modified peptides), polyketides (PKs), and terpenes were predicted. Whole-genome sequencing analysis revealed that the UTNGt2 strain has diverse bacteriocins with a high inhibitory capacity, thus it is a bacteriocinogenic strain. Considering the safety profile, UTNGt2 is a nonpathogenic, nonvirulent strain with valuable biotechnological traits and can be further exploited for its probiotic and antimicrobial potential in the food industry or as a potential producer strain of antimicrobial peptides as an alternative to conventional antibiotics.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Hiroaki Nakanishi ◽  
Katsumi Yoneyama ◽  
Masaaki Hara ◽  
Aya Takada ◽  
Kazuyuki Saito

AbstractWe developed a method that can detect each animal species of origin for crude drugs derived from multiple animal species based on massively parallel sequencing analysis of mitochondrial genes. The crude drugs derived from animals investigated in this study were Cervi Parvum Cornu and Trogopterorum feces, which are derived from a mix of different animal species, two chopped cicada sloughs, and two commercial Kampo drugs. The mitochondrial 12S rRNA, 16S rRNA, and cytochrome oxidase subunit I gene regions were amplified and sequenced using MiSeq. The ratios of haplotype to total number of sequences reads were calculated after sequence extraction and trimming. Haplotypes that exceeded the threshold were defined as positive haplotypes, which were compared with all available sequences using BLAST. In the Cervi Parvum Cornu and Trogopterorum feces samples, the haplotype ratios corresponded roughly to the mixture ratios, although there was a slight difference from mixture ratios depending on the gene examined. This method could also roughly estimate the compositions of chopped cicada sloughs and Kampo drugs. This analysis, whereby the sequences of several genes are elucidated, is better for identifying the included animal species. This method should be useful for quality control of crude drugs and Kampo drugs.


Sign in / Sign up

Export Citation Format

Share Document