New synthetic-diploid benchmark for accurate variant calling evaluation

Mapping Intimacies ◽

10.1101/223297 ◽

2017 ◽

Cited By ~ 9

Author(s):

Heng Li ◽

Jonathan M Bloom ◽

Yossi Farjoun ◽

Mark Fleharty ◽

Laura Gauthier ◽

...

Keyword(s):

Cell Lines ◽

Human Cell ◽

Error Rate ◽

De Novo ◽

Variant Calling ◽

Benchmark Dataset ◽

Whole Genome ◽

Human Cell Lines ◽

Short Read ◽

Benchmark Datasets

Constructed from the consensus of multiple variant callers based on short-read data, existing benchmark datasets for evaluating variant calling accuracy are biased toward easy regions accessible by known algorithms. We derived a new benchmark dataset from the de novo PacBio assemblies of two human cell lines that are homozygous across the whole genome. This benchmark provides a more accurate and less biased estimate of the error rate of small variant calls in a realistic context.

Download Full-text

A systematic benchmark of Nanopore long read RNA sequencing for transcript level analysis in human cell lines

10.1101/2021.04.21.440736 ◽

2021 ◽

Author(s):

Ying Chen ◽

Nadia M. Davidson ◽

Yuk Kei Wan ◽

Harshil Patel ◽

Fei Yao ◽

...

Keyword(s):

Rna Sequencing ◽

Cell Lines ◽

Human Cell ◽

Average Length ◽

Read Length ◽

Human Cell Lines ◽

Alternative Promoters ◽

Cdna Sequencing ◽

Short Read ◽

Long Read

AbstractThe human genome contains more than 200,000 gene isoforms. However, different isoforms can be highly similar, and with an average length of 1.5kb remain difficult to study with short read sequencing. To systematically evaluate the ability to study the transcriptome at a resolution of individual isoforms we profiled 5 human cell lines with short read cDNA sequencing and Nanopore long read direct RNA, amplification-free direct cDNA, PCR-cDNA sequencing. The long read protocols showed a high level of consistency, with amplification-free RNA and cDNA sequencing being most similar. While short and long reads generated comparable gene expression estimates, they differed substantially for individual isoforms. We find that increased read length improves read-to-transcript assignment, identifies interactions between alternative promoters and splicing, enables the discovery of novel transcripts from repetitive regions, facilitates the quantification of full-length fusion isoforms and enables the simultaneous profiling of m6A RNA modifications when RNA is sequenced directly. Our study demonstrates the advantage of long read RNA sequencing and provides a comprehensive resource that will enable the development and benchmarking of computational methods for profiling complex transcriptional events at isoform-level resolution.

Download Full-text

Whole-genome expression analysis of mammalian-wide interspersed repeat elements in human cell lines

DNA Research ◽

10.1093/dnares/dsw048 ◽

2016 ◽

pp. dsw048 ◽

Cited By ~ 3

Author(s):

Davide Carnevali ◽

Anastasia Conti ◽

Matteo Pellegrini ◽

Giorgio Dieci

Keyword(s):

Cell Lines ◽

Expression Analysis ◽

Human Cell ◽

Whole Genome ◽

Human Cell Lines ◽

Repeat Elements ◽

Genome Expression ◽

Whole Genome Expression

Download Full-text

De Novo Assembly of Two Swedish Genomes Reveals Missing Segments from the Human GRCh38 Reference and Improves Variant Calling of Population-Scale Sequencing Data

Genes ◽

10.3390/genes9100486 ◽

2018 ◽

Vol 9 (10) ◽

pp. 486 ◽

Cited By ~ 22

Author(s):

Adam Ameur ◽

Huiwen Che ◽

Marcel Martin ◽

Ignas Bunikis ◽

Johan Dahlberg ◽

...

Keyword(s):

Genome Sequencing ◽

De Novo Assembly ◽

De Novo ◽

Variant Calling ◽

Whole Genome Sequencing Data ◽

Personal Genome ◽

Whole Genome ◽

Sequencing Data ◽

Short Read ◽

Population Scale

The current human reference sequence (GRCh38) is a foundation for large-scale sequencing projects. However, recent studies have suggested that GRCh38 may be incomplete and give a suboptimal representation of specific population groups. Here, we performed a de novo assembly of two Swedish genomes that revealed over 10 Mb of sequences absent from the human GRCh38 reference in each individual. Around 6 Mb of these novel sequences (NS) are shared with a Chinese personal genome. The NS are highly repetitive, have an elevated GC-content, and are primarily located in centromeric or telomeric regions. Up to 1 Mb of NS can be assigned to chromosome Y, and large segments are also missing from GRCh38 at chromosomes 14, 17, and 21. Inclusion of NS into the GRCh38 reference radically improves the alignment and variant calling from short-read whole-genome sequencing data at several genomic loci. A re-analysis of a Swedish population-scale sequencing project yields > 75,000 putative novel single nucleotide variants (SNVs) and removes > 10,000 false positive SNV calls per individual, some of which are located in protein coding regions. Our results highlight that the GRCh38 reference is not yet complete and demonstrate that personal genome assemblies from local populations can improve the analysis of short-read whole-genome sequencing data.

Download Full-text

The influence of low dose Bisphenol A on whole genome DNA methylation and chromatin compaction in different human cell lines

Toxicology in Vitro ◽

10.1016/j.tiv.2019.03.010 ◽

2019 ◽

Vol 58 ◽

pp. 26-34 ◽

Cited By ~ 2

Author(s):

I.O. Suchkova ◽

L.K. Sasina ◽

N.I. Dergacheva ◽

G.A. Sofronov ◽

E.L. Patkin

Keyword(s):

Dna Methylation ◽

Bisphenol A ◽

Cell Lines ◽

Human Cell ◽

Low Dose ◽

Whole Genome ◽

Human Cell Lines ◽

Chromatin Compaction

Download Full-text

Investigations of the influence of microgravity on the state of human cell lines

Kosmìčna nauka ì tehnologìâ ◽

10.15407/knit2004.05.226 ◽

2004 ◽

Vol 10 (5-6) ◽

pp. 226-228

Author(s):

L.M. Nosach ◽

◽

O.Yu. Povnitsa ◽

V.L. Zhovnovata ◽

◽

...

Keyword(s):

Cell Lines ◽

Human Cell ◽

The State ◽

Human Cell Lines

Download Full-text

Analysis of multiple transcription factor cistromes in human cell lines

Signaling Pathways Project Datasets ◽

10.1621/wdndilykwm ◽

2019 ◽

Author(s):

J Gertz

Keyword(s):

Transcription Factor ◽

Cell Lines ◽

Human Cell ◽

Human Cell Lines ◽

Multiple Transcription Factor

Download Full-text

Faculty Opinions recommendation of Comprehensive sampling of gene expression in human cell lines with massively parallel signature sequencing.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1012584.188502 ◽

2003 ◽

Author(s):

Vishvanath Nene

Keyword(s):

Gene Expression ◽

Cell Lines ◽

Human Cell ◽

Massively Parallel Signature Sequencing ◽

Massively Parallel ◽

Human Cell Lines ◽

Parallel Signature Sequencing

Download Full-text

Faculty Opinions recommendation of Ewing sarcoma fusion protein EWSR1/FLI1 interacts with EWSR1 leading to mitotic defects in zebrafish embryos and human cell lines.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1160530.620894 ◽

2009 ◽

Author(s):

Stephen Lessnick

Keyword(s):

Fusion Protein ◽

Cell Lines ◽

Human Cell ◽

Ewing Sarcoma ◽

Zebrafish Embryos ◽

Human Cell Lines

Download Full-text

New inhibitors of sepiapterin reductase. Lack of an effect of intracellular tetrahydrobiopterin depletion upon in vitro proliferation of two human cell lines.

Journal of Biological Chemistry ◽

10.1016/s0021-9258(18)42807-4 ◽

1992 ◽

Vol 267 (8) ◽

pp. 5599-5607

Author(s):

G.K. Smith ◽

D.S. Duch ◽

M.P. Edelstein ◽

E.C. Bigham

Keyword(s):

Cell Lines ◽

Human Cell ◽

Human Cell Lines ◽

In Vitro Proliferation ◽

Sepiapterin Reductase

Download Full-text

Dicalcin suppresses in vitro trophoblast attachment in human cell lines

Biochemical and Biophysical Research Communications ◽

10.1016/j.bbrc.2021.07.030 ◽

2021 ◽

Vol 570 ◽

pp. 206-213

Author(s):

Ryohei Saito ◽

Hiromasa Satoh ◽

Kayo Aoba ◽

Hajime Hirasawa ◽

Naofumi Miwa

Keyword(s):

Cell Lines ◽

Human Cell ◽

Human Cell Lines

Download Full-text