BCIP: a gene-centered platform for identifying potential regulatory genes in breast cancer

Jiaqi Wu; Shuofeng Hu; Yaowen Chen; Zongcheng Li; Jian Zhang; Hanyu Yuan; Qiang Shi; Ningsheng Shao; Xiaomin Ying

doi:10.1038/srep45235

BCIP: a gene-centered platform for identifying potential regulatory genes in breast cancer

Scientific Reports ◽

10.1038/srep45235 ◽

2017 ◽

Vol 7 (1) ◽

Cited By ~ 8

Author(s):

Jiaqi Wu ◽

Shuofeng Hu ◽

Yaowen Chen ◽

Zongcheng Li ◽

Jian Zhang ◽

...

Keyword(s):

Breast Cancer ◽

Gene Expression ◽

Copy Number Variation ◽

Copy Number ◽

Expression Profiles ◽

Mammary Tissue ◽

Lymph Node Status ◽

Specific Gene ◽

Driver Genes ◽

Number Variation

Abstract Breast cancer is a disease with high heterogeneity. Many issues on tumorigenesis and progression are still elusive. It is critical to identify genes that play important roles in the progression of tumors, especially for tumors with poor prognosis such as basal-like breast cancer and tumors in very young women. To facilitate the identification of potential regulatory or driver genes, we present the Breast Cancer Integrative Platform (BCIP, http://www.omicsnet.org/bcancer/). BCIP maintains multi-omics data selected with strict quality control and processed with uniform normalization methods, including gene expression profiles from 9,005 tumor and 376 normal tissue samples, copy number variation information from 3,035 tumor samples, microRNA-target interactions, co-expressed genes, KEGG pathways, and mammary tissue-specific gene functional networks. This platform provides a user-friendly interface integrating comprehensive and flexible analysis tools on differential gene expression, copy number variation, and survival analysis. The prominent characteristic of BCIP is that users can perform analysis by customizing subgroups with single or combined clinical features, including subtypes, histological grades, pathologic stages, metastasis status, lymph node status, ER/PR/HER2 status, TP53 mutation status, menopause status, age, tumor size, therapy responses, and prognosis. BCIP will help to identify regulatory or driver genes and candidate biomarkers for further research in breast cancer.

Download Full-text

Faculty Opinions recommendation of The impact of copy number variation on local gene expression in mouse hematopoietic stem and progenitor cells.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1158714.618930 ◽

2009 ◽

Author(s):

Michael Lassner ◽

Antoni Rafalski

Keyword(s):

Gene Expression ◽

Copy Number Variation ◽

Progenitor Cells ◽

Copy Number ◽

Hematopoietic Stem ◽

Stem And Progenitor Cells ◽

Number Variation ◽

The Impact

Download Full-text

Insights into dispersed duplications and complex structural mutations from whole genome sequencing 706 families

10.1101/2020.08.03.235358 ◽

2020 ◽

Author(s):

Christopher W. Whelan ◽

Robert E. Handsaker ◽

Giulio Genovese ◽

Seva Kashin ◽

Monkol Lek ◽

...

Keyword(s):

Gene Expression ◽

Copy Number Variation ◽

Copy Number ◽

De Novo ◽

Whole Genome ◽

Sequencing Data ◽

Number Variation ◽

Structural Mutations ◽

Or Gene ◽

Genomic Locations

AbstractTwo intriguing forms of genome structural variation (SV) – dispersed duplications, and de novo rearrangements of complex, multi-allelic loci – have long escaped genomic analysis. We describe a new way to find and characterize such variation by utilizing identity-by-descent (IBD) relationships between siblings together with high-precision measurements of segmental copy number. Analyzing whole-genome sequence data from 706 families, we find hundreds of “IBD-discordant” (IBDD) CNVs: loci at which siblings’ CNV measurements and IBD states are mathematically inconsistent. We found that commonly-IBDD CNVs identify dispersed duplications; we mapped 95 of these common dispersed duplications to their true genomic locations through family-based linkage and population linkage disequilibrium (LD), and found several to be in strong LD with genome-wide association (GWAS) signals for common diseases or gene expression variation at their revealed genomic locations. Other CNVs that were IBDD in a single family appear to involve de novo mutations in complex and multi-allelic loci; we identified 26 de novo structural mutations that had not been previously detected in earlier analyses of the same families by diverse SV analysis methods. These included a de novo mutation of the amylase gene locus and multiple de novo mutations at chromosome 15q14. Combining these complex mutations with more-conventional CNVs, we estimate that segmental mutations larger than 1kb arise in about one per 22 human meioses. These methods are complementary to previous techniques in that they interrogate genomic regions that are home to segmental duplication, high CNV allele frequencies, and multi-allelic CNVs.Author SummaryCopy number variation is an important form of genetic variation in which individuals differ in the number of copies of segments of their genomes. Certain aspects of copy number variation have traditionally been difficult to study using short-read sequencing data. For example, standard analyses often cannot tell whether the duplicated copies of a segment are located near the original copy or are dispersed to other regions of the genome. Another aspect of copy number variation that has been difficult to study is the detection of mutations in the copy number of DNA segments passed down from parents to their children, particularly when the mutations affect genome segments which already display common copy number variation in the population. We develop an analytical approach to solving these problems when sequencing data is available for all members of families with at least two children. This method is based on determining the number of parental haplotypes the two siblings share at each location in their genome, and using that information to determine the possible inheritance patterns that might explain the copy numbers we observe in each family member. We show that dispersed duplications and mutations can be identified by looking for copy number variants that do not follow these expected inheritance patterns. We use this approach to determine the location of 95 common duplications which are dispersed to distant regions of the genome, and demonstrate that these duplications are linked to genetic variants that affect disease risk or gene expression levels. We also identify a set of copy number mutations not detected by previous analyses of sequencing data from a large cohort of families, and show that repetitive and complex regions of the genome undergo frequent mutations in copy number.

Download Full-text

Human rDNA Copy Number Is Unstable in Metastatic Breast Cancers

10.1101/623595 ◽

2019 ◽

Author(s):

Virginia Valori ◽

Katalin Tus ◽

Christina Laukaitis ◽

David T. Harris ◽

Lauren LeBeau ◽

...

Keyword(s):

Breast Cancer ◽

Copy Number Variation ◽

Copy Number ◽

Genome Instability ◽

Gene Clusters ◽

Genome Rearrangements ◽

Epigenetic Silencing ◽

Healthy Tissue ◽

Gene Promoters ◽

Number Variation

AbstractEpigenetic silencing, including the formation of heterochromatin, silent chromosome territories, and repressed gene promoters, acts to stabilize patterns of gene regulation and the physical structure of the genome. Reduction of epigenetic silencing can result in genome rearrangements, particularly at intrinsically unstable regions of the genome such as transposons, satellite repeats, and repetitive gene clusters including the rRNA gene clusters (rDNA). It is thus expected that mutational or environmental conditions that compromise heterochromatin function might cause genome instability, and diseases associated with decreased epigenetic stability might exhibit genome changes as part of their etiology. We find support of this hypothesis in invasive ductal breast carcinoma, in which reduced epigenetic silencing has been previously described, by using a facile method to quantify rDNA copy number in biopsied breast tumors and pair-matched healthy tissue. We found that rDNA and satellite DNA sequences had significant copy number variation – both losses and gains of copies – compared to healthy tissue, arguing that these genome rearrangements are common in developing breast cancer. Thus, any proposed etiology onset or progression of breast cancer should consider alterations to the epigenome, but must also accommodate concomitant changes to genome sequence at heterochromatic loci.Authors’ StatementOne of the common hallmarks of cancer is genome instability, including hypermutation and changes to chromosome structure. Using tumor tissues obtained from women with invasive ductal carcinoma, we find that a sensitive area of the genome – the ribosomal DNA gene repeat cluster – shows hypervariability in copy number. The patterns we observe as not consistent with an adaptive loss leading to increased tumor growth, but rather we conclude that copy number variation at repeat DNA is a general consequence of reduced heterochromatin function in cancer progression.

Download Full-text

Urinary polyphenols, glutathione S-transferases copy number variation, and breast cancer risk: Results from the Shanghai women's health study

Molecular Carcinogenesis ◽

10.1002/mc.20799 ◽

2011 ◽

Vol 51 (5) ◽

pp. 379-388 ◽

Cited By ~ 8

Author(s):

Jianfeng Luo ◽

Yu-Tang Gao ◽

Wong-Ho Chow ◽

Xiao-ou Shu ◽

Honglan Li ◽

...

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Copy Number Variation ◽

Women’S Health ◽

Women's Health ◽

Copy Number ◽

Health Study ◽

Number Variation ◽

Glutathione S Transferases

Download Full-text

Recent Advances in Studying of Copy Number Variation and Gene Expression

Gene Expression to Genetical Genomics ◽

10.4137/gegg.s14286 ◽

2014 ◽

Vol 7 ◽

pp. 1-5 ◽

Cited By ~ 1

Keyword(s):

Gene Expression ◽

Copy Number Variation ◽

Copy Number ◽

Recent Advances ◽

Number Variation

Download Full-text

Copy number variation, gene expression and histological localization of human beta-defensin 2 in patients with adeno-tonsillar hypertrophy

Biotechnic & Histochemistry ◽

10.1080/10520295.2020.1752936 ◽

2020 ◽

Vol 95 (8) ◽

pp. 634-640

Author(s):

Fulvio Celsi ◽

Luisa Zupin ◽

Emmanouil Athanasakis ◽

Eva Orzan ◽

Domenico Leonardo Grasso ◽

...

Keyword(s):

Gene Expression ◽

Copy Number Variation ◽

Copy Number ◽

Tonsillar Hypertrophy ◽

Number Variation

Download Full-text

Abstract 4720: Genome-wide copy number variation and breast cancer risk: Preliminary report

10.1158/1538-7445.am10-4720 ◽

2010 ◽

Author(s):

Kyoung-Mu Lee ◽

Miey Park ◽

Sang-Hoon Moon ◽

Hyung-Chol Kim ◽

Ji-Young Lee ◽

...

Keyword(s):

Breast Cancer ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Copy Number Variation ◽

Copy Number ◽

Preliminary Report ◽

Genome Wide ◽

Number Variation

Download Full-text

Splicing, Mutation, and Methylation Alterations Drive Gene Expression in HPV-OPC more than Copy Number Variation: A Network Propagation Analysis

International Journal of Radiation Oncology*Biology*Physics ◽

10.1016/j.ijrobp.2019.11.119 ◽

2020 ◽

Vol 106 (5) ◽

pp. 1185

Author(s):

J.R. Qualliotine ◽

B. Rosenthal ◽

G. Xu ◽

A. Mark ◽

C.A. Nasamram ◽

...

Keyword(s):

Gene Expression ◽

Copy Number Variation ◽

Copy Number ◽

Splicing Mutation ◽

Drive Gene Expression ◽

Propagation Analysis ◽

Network Propagation ◽

Number Variation ◽

Drive Gene

Download Full-text

Copy number variation is highly correlated with differential gene expression: a pan-cancer study

BMC Medical Genetics ◽

10.1186/s12881-019-0909-5 ◽

2019 ◽

Vol 20 (1) ◽

Cited By ~ 18

Author(s):

Xin Shao ◽

Ning Lv ◽

Jie Liao ◽

Jinbo Long ◽

Rui Xue ◽

...

Keyword(s):

Gene Expression ◽

Genetic Variation ◽

Copy Number Variation ◽

Differential Gene Expression ◽

Copy Number ◽

Close Correlation ◽

Number Variation ◽

Cancer Types ◽

Differential Gene ◽

The Relationship

Abstract Background Cancer is a heterogeneous disease with many genetic variations. Lines of evidence have shown copy number variations (CNVs) of certain genes are involved in development and progression of many cancers through the alterations of their gene expression levels on individual or several cancer types. However, it is not quite clear whether the correlation will be a general phenomenon across multiple cancer types. Methods In this study we applied a bioinformatics approach integrating CNV and differential gene expression mathematically across 1025 cell lines and 9159 patient samples to detect their potential relationship. Results Our results showed there is a close correlation between CNV and differential gene expression and the copy number displayed a positive linear influence on gene expression for the majority of genes, indicating that genetic variation generated a direct effect on gene transcriptional level. Another independent dataset is utilized to revalidate the relationship between copy number and expression level. Further analysis show genes with general positive linear influence on gene expression are clustered in certain disease-related pathways, which suggests the involvement of CNV in pathophysiology of diseases. Conclusions This study shows the close correlation between CNV and differential gene expression revealing the qualitative relationship between genetic variation and its downstream effect, especially for oncogenes and tumor suppressor genes. It is of a critical importance to elucidate the relationship between copy number variation and gene expression for prevention, diagnosis and treatment of cancer.

Download Full-text

Copy Number Variation Polymorphisms Modulate the Gene Expression Levels of Glutathione S-transferase Subtypes in Human Airway Epithelium.

10.1164/ajrccm-conference.2009.179.1_meetingabstracts.a3980 ◽

2009 ◽

Author(s):

MW Butler ◽

NR Hackett ◽

J Salit ◽

Y Strulovici-Barel ◽

RG Crystal

Keyword(s):

Gene Expression ◽

Copy Number Variation ◽

Airway Epithelium ◽

Copy Number ◽

Glutathione S Transferase ◽

Expression Levels ◽

Human Airway ◽

Human Airway Epithelium ◽

Number Variation ◽

Gene Expression Levels

Download Full-text