de novo assembling
Recently Published Documents


TOTAL DOCUMENTS

8
(FIVE YEARS 1)

H-INDEX

3
(FIVE YEARS 0)

2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Ying Chen ◽  
Fan Nie ◽  
Shang-Qian Xie ◽  
Ying-Feng Zheng ◽  
Qi Dai ◽  
...  

AbstractLong nanopore reads are advantageous in de novo genome assembly. However, nanopore reads usually have broad error distribution and high-error-rate subsequences. Existing error correction tools cannot correct nanopore reads efficiently and effectively. Most methods trim high-error-rate subsequences during error correction, which reduces both the length of the reads and contiguity of the final assembly. Here, we develop an error correction, and de novo assembly tool designed to overcome complex errors in nanopore reads. We propose an adaptive read selection and two-step progressive method to quickly correct nanopore reads to high accuracy. We introduce a two-stage assembler to utilize the full length of nanopore reads. Our tool achieves superior performance in both error correction and de novo assembling nanopore reads. It requires only 8122 hours to assemble a 35X coverage human genome and achieves a 2.47-fold improvement in NG50. Furthermore, our assembly of the human WERI cell line shows an NG50 of 22 Mbp. The high-quality assembly of nanopore reads can significantly reduce false positives in structure variation detection.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
José Córdoba-Caballero ◽  
Pedro Seoane ◽  
Fernando M. Jabato ◽  
James R. Perkins ◽  
Manuel Manchado ◽  
...  

AbstractSenegalese sole is an economically important flatfish species in aquaculture and an attractive model to decipher the molecular mechanisms governing the severe transformations occurring during metamorphosis, where retinoic acid seems to play a key role in tissue remodeling. In this study, a robust sole transcriptome was envisaged by reducing the number of assembled libraries (27 out of 111 available), fine-tuning a new automated and reproducible set of workflows for de novo assembling based on several assemblers, and removing low confidence transcripts after mapping onto a sole female genome draft. From a total of 96 resulting assemblies, two “raw” transcriptomes, one containing only Illumina reads and another with Illumina and GS-FLX reads, were selected to provide SOLSEv5.0, the most informative transcriptome with low redundancy and devoid of most single-exon transcripts. It included both Illumina and GS-FLX reads and consisted of 51,348 transcripts of which 22,684 code for 17,429 different proteins described in databases, where 9527 were predicted as complete proteins. SOLSEv5.0 was used as reference for the study of retinoic acid (RA) signalling in sole larvae using drug treatments (DEAB, a RA synthesis blocker, and TTNPB, a RA-receptor agonist) for 24 and 48 h. Differential expression and functional interpretation were facilitated by an updated version of DEGenes Hunter. Acute exposure of both drugs triggered an intense, specific and transient response at 24 h but with hardly observable differences after 48 h at least in the DEAB treatments. Activation of RA signalling by TTNPB specifically increased the expression of genes in pathways related to RA degradation, retinol storage, carotenoid metabolism, homeostatic response and visual cycle, and also modified the expression of transcripts related to morphogenesis and collagen fibril organisation. In contrast, DEAB mainly decreased genes related to retinal production, impairing phototransduction signalling in the retina. A total of 755 transcripts mainly related to lipid metabolism, lipid transport and lipid homeostasis were altered in response to both treatments, indicating non-specific drug responses associated with intestinal absorption. These results indicate that a new assembling and transcript sieving were both necessary to provide a reliable transcriptome to identify the many aspects of RA action during sole development that are of relevance for sole aquaculture.


2017 ◽  
Vol 17 (S2) ◽  
Author(s):  
Alexey А. Moskalev ◽  
Anna V. Kudryavtseva ◽  
Alexander S. Graphodatsky ◽  
Violetta R. Beklemisheva ◽  
Natalya A. Serdyukova ◽  
...  

GigaScience ◽  
2017 ◽  
Vol 6 (7) ◽  
Author(s):  
Jian Gao ◽  
Qiye Li ◽  
Zongji Wang ◽  
Yang Zhou ◽  
Paolo Martelli ◽  
...  
Keyword(s):  
De Novo ◽  

2017 ◽  
Vol 2017 ◽  
pp. 1-14 ◽  
Author(s):  
Wenlan Tian ◽  
Dev Paudel ◽  
Wagner Vendrame ◽  
Jianping Wang

Jatropha (Jatropha curcasL.) is an economically important species with a great potential for biodiesel production. To enrich the jatropha genomic databases and resources for microgravity studies, we sequenced and annotated the transcriptome of jatropha and developed SSR and SNP markers from the transcriptome sequences. In total 1,714,433 raw reads with an average length of 441.2 nucleotides were generated. De novo assembling and clustering resulted in 115,611 uniquely assembled sequences (UASs) including 21,418 full-length cDNAs and 23,264 new jatropha transcript sequences. The whole set of UASs were fully annotated, out of which 59,903 (51.81%) were assigned with gene ontology (GO) term, 12,584 (10.88%) had orthologs in Eukaryotic Orthologous Groups (KOG), and 8,822 (7.63%) were mapped to 317 pathways in six different categories in Kyoto Encyclopedia of Genes and Genome (KEGG) database, and it contained 3,588 putative transcription factors. From the UASs, 9,798 SSRs were discovered with AG/CT as the most frequent (45.8%) SSR motif type. Further 38,693 SNPs were detected and 7,584 remained after filtering. This UAS set has enriched the current jatropha genomic databases and provided a large number of genetic markers, which can facilitate jatropha genetic improvement and many other genetic and biological studies.


2015 ◽  
Author(s):  
Nadia M Davidson ◽  
Ian J Majewski ◽  
Alicia Oshlack

Genomic instability is a hallmark of cancer and, as such, structural alterations and fusion genes are common events in the cancer landscape. RNA sequencing (RNA-Seq) is a powerful method for profiling cancers, but current methods for identifying fusion genes are optimized for short reads. JAFFA (https://code.google.com/p/jaffa-project/) is a sensitive fusion detection method that clearly out-performs other methods with reads of 100bp or greater. JAFFA compares a cancer transcriptome to the reference transcriptome, rather than the genome, where the cancer transcriptome is inferred using long reads directly or by de novo assembling short reads.


1999 ◽  
Vol 112 (22) ◽  
pp. 4101-4112 ◽  
Author(s):  
K. Ojima ◽  
Z.X. Lin ◽  
Z.Q. Zhang ◽  
T. Hijikata ◽  
S. Holtzer ◽  
...  

While over a dozen I-Z-I proteins are expressed in postmitotic myoblasts and myotubes it is unclear how, when, or where these first assemble into transitory I-Z-I bodies (thin filament/Z-band precursors) and, a short time later, into definitive I-Z-I bands. By double-staining the growth tips of transfected myotubes expressing (a) MYC-tagged s-alpha-actinins (MYC/s-alpha-actinins) or (b) green fluorescent protein-tagged titin cap (GFP/T-cap) with antibodies against MYC and I-Z-I band proteins, we found that the de novo assembly of I-Z-I bodies and their maturation into I-Z-I bands involved relatively concurrent, cooperative binding and reconfiguration of, at a minimum, 5 integral Z-band molecules. These included s-alpha-actinin, nebulin, titin, T-cap and alpha-actin. Resolution of the approximately 1.0 microm polarized alpha-actin/nebulin/tropomyosin/troponin thin filament complexes occurred subsequent to the maturation of Z-bands into a dense tetragonal configuration. Of particular interest is finding that mutant MYC/s-alpha-actinin peptides (a) lacking spectrin-like repeats 1–4, or consisting of spectrin-like repeats 1–4 only, as well as (b) mutants/fragments lacking titin or alpha-actin binding sites, were promptly and exclusively incorporated into de novo assembling I-Z-I bodies and definitive I-Z-I bands as was exogenous full length MYC/s-alpha-actinin or GFP/T-cap.


Sign in / Sign up

Export Citation Format

Share Document