scholarly journals DOT: Gene-set analysis by combining decorrelated association statistics

2019 ◽  
Author(s):  
Olga A. Vsevolozhskaya ◽  
Min Shi ◽  
Fengjiao Hu ◽  
Dmitri V. Zaykin

AbstractHistorically, the majority of statistical association methods have been designed assuming availability of SNP-level information. However, modern genetic and sequencing data present new challenges to access and sharing of genotype-phenotype datasets, including cost management, difficulties in consolidation of records across research groups, etc. These issues make methods based on SNP-level summary statistics particularly appealing. The most common form of combining statistics is a sum of SNP-level squared scores, possibly weighted, as in burden tests for rare variants. The overall significance of the resulting statistic is evaluated using its distribution under the null hypothesis. Here, we demonstrate that this basic approach can be substantially improved by decorrelating scores prior to their addition, resulting in remarkable power gains in situations that are most commonly encountered in practice; namely, under heterogeneity of effect sizes and diversity between pairwise LD. In these situations, the power of the traditional test, based on the added squared scores, quickly reaches a ceiling, as the number of variants increases. Thus, the traditional approach does not benefit from information potentially contained in any additional SNPs, while our decorrelation by orthogonal transformation (DOT) method yields steady gain in power. We present theoretical and computational analyses of both approaches, and reveal causes behind sometimes dramatic difference in their respective powers. We showcase DOT by analyzing breast cancer data, in which our method strengthened levels of previously reported associations and implied the possibility of multiple new alleles that jointly confer breast cancer risk.

2011 ◽  
Vol 4 (2) ◽  
pp. 8-12
Author(s):  
Leo Alexander T Leo Alexander T ◽  
◽  
Pari Dayal L Pari Dayal L ◽  
Valarmathi S Valarmathi S ◽  
Ponnuraja C Ponnuraja C ◽  
...  

2020 ◽  
Vol 4 (5) ◽  
pp. 805-812
Author(s):  
Riska Chairunisa ◽  
Adiwijaya ◽  
Widi Astuti

Cancer is one of the deadliest diseases in the world with a mortality rate of 57,3% in 2018 in Asia. Therefore, early diagnosis is needed to avoid an increase in mortality caused by cancer. As machine learning develops, cancer gene data can be processed using microarrays for early detection of cancer outbreaks. But the problem that microarray has is the number of attributes that are so numerous that it is necessary to do dimensional reduction. To overcome these problems, this study used dimensions reduction Discrete Wavelet Transform (DWT) with Classification and Regression Tree (CART) and Random Forest (RF) as classification method. The purpose of using these two classification methods is to find out which classification method produces the best performance when combined with the DWT dimension reduction. This research use five microarray data, namely Colon Tumors, Breast Cancer, Lung Cancer, Prostate Tumors and Ovarian Cancer from Kent-Ridge Biomedical Dataset. The best accuracy obtained in this study for breast cancer data were 76,92% with CART-DWT, Colon Tumors 90,1% with RF-DWT, lung cancer 100% with RF-DWT, prostate tumors 95,49% with RF-DWT, and ovarian cancer 100% with RF-DWT. From these results it can be concluded that RF-DWT is better than CART-DWT.  


2018 ◽  
Vol 64 (2) ◽  
pp. 196-199
Author(s):  
Gulya Miryusupova ◽  
G. Khakimov ◽  
N. Shayusupov

According to the results of breast cancer data in the Republic of Uzbekistan in addition to the increase in morbidity and mortality from breast cancer among women the presence of age specific features among indigenous women in the direction of “rejuvenating” of the disease with all molecular-biological (phenotypic) subtypes of breast cancer were marked. Within the framework of age-related features the prevalence of the least favorable phenotypes of breast cancer was found among indigenous women: Her2/neu hyperexpressive and three times negative subtype of breast cancer. The data obtained made it possible to build a so-called population “portrait” of breast cancer on the territory of the Republic, which in turn would contribute to further improvement of cancer care for the female population of the country.


2021 ◽  
Vol 7 (1) ◽  
Author(s):  
Na Li ◽  
Belle W. X. Lim ◽  
Ella R. Thompson ◽  
Simone McInerny ◽  
Magnus Zethoven ◽  
...  

AbstractBreast cancer (BC) has a significant heritable component but the genetic contribution remains unresolved in the majority of high-risk BC families. This study aims to investigate the monogenic causes underlying the familial aggregation of BC beyond BRCA1 and BRCA2, including the identification of new predisposing genes. A total of 11,511 non-BRCA familial BC cases and population-matched cancer-free female controls in the BEACCON study were investigated in two sequencing phases: 1303 candidate genes in up to 3892 cases and controls, followed by validation of 145 shortlisted genes in an additional 7619 subjects. The coding regions and exon–intron boundaries of all candidate genes and 14 previously proposed BC genes were sequenced using custom designed sequencing panels. Pedigree and pathology data were analysed to identify genotype-specific associations. The contribution of ATM, PALB2 and CHEK2 to BC predisposition was confirmed, but not RAD50 and NBN. An overall excess of loss-of-function (LoF) (OR 1.27, p = 9.05 × 10−9) and missense (OR 1.27, p = 3.96 × 10−73) variants was observed in the cases for the 145 candidate genes. Leading candidates harbored LoF variants with observed ORs of 2–4 and individually accounted for no more than 0.79% of the cases. New genes proposed by this study include NTHL1, WRN, PARP2, CTH and CDK9. The new candidate BC predisposition genes identified in BEACCON indicate that much of the remaining genetic causes of high-risk BC families are due to genes in which pathogenic variants are both very rare and convey only low to moderate risk.


2021 ◽  
Vol 9 (7) ◽  
pp. e002383
Author(s):  
Jin-Li Wei ◽  
Si-Yu Wu ◽  
Yun-Song Yang ◽  
Yi Xiao ◽  
Xi Jin ◽  
...  

PurposeRegulatory T cells (Tregs) heavily infiltrate triple-negative breast cancer (TNBC), and their accumulation is affected by the metabolic reprogramming in cancer cells. In the present study, we sought to identify cancer cell-intrinsic metabolic modulators correlating with Tregs infiltration in TNBC.Experimental designUsing the RNA-sequencing data from our institute (n=360) and the Molecular Taxonomy of Breast Cancer International Consortium TNBC cohort (n=320), we calculated the abundance of Tregs in each sample and evaluated the correlation between gene expression levels and Tregs infiltration. Then, in vivo and in vitro experiments were performed to verify the correlation and explore the underlying mechanism.ResultsWe revealed that GTP cyclohydrolase 1 (GCH1) expression was positively correlated with Tregs infiltration and high GCH1 expression was associated with reduced overall survival in TNBC. In vivo and in vitro experiments showed that GCH1 increased Tregs infiltration, decreased apoptosis, and elevated the programmed cell death-1 (PD-1)-positive fraction. Metabolomics analysis indicated that GCH1 overexpression reprogrammed tryptophan metabolism, resulting in L-5-hydroxytryptophan (5-HTP) accumulation in the cytoplasm accompanied by kynurenine accumulation and tryptophan reduction in the supernatant. Subsequently, aryl hydrocarbon receptor, activated by 5-HTP, bound to the promoter of indoleamine 2,3-dioxygenase 1 (IDO1) and thus enhanced the transcription of IDO1. Furthermore, the inhibition of GCH1 by 2,4-diamino-6-hydroxypyrimidine (DAHP) decreased IDO1 expression, attenuated tumor growth, and enhanced the tumor response to PD-1 blockade immunotherapy.ConclusionsTumor-cell-intrinsic GCH1 induced immunosuppression through metabolic reprogramming and IDO1 upregulation in TNBC. Inhibition of GCH1 by DAHP serves as a potential immunometabolic strategy in TNBC.


BMC Cancer ◽  
2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Heidemarie Haller ◽  
Petra Voiß ◽  
Holger Cramer ◽  
Anna Paul ◽  
Mattea Reinisch ◽  
...  

Abstract Background Cancer registries usually assess data of conventional treatments and/or patient survival. Beyond that, little is known about the influence of other predictors of treatment response related to the use of complementary therapies (CM) and lifestyle factors affecting patients’ quality and quantity of life. Methods INTREST is a prospective cohort study collecting register data at multiple German certified cancer centers, which provide individualized, integrative, in- and outpatient breast cancer care. Patient-reported outcomes and clinical cancer data of anticipated N = 715 women with pTNM stage I-III breast cancer are collected using standardized case report forms at the time of diagnosis, after completing neo−/adjuvant chemotherapy, after completing adjuvant therapy (with the exception of endocrine therapy) as well as 1, 2, 5, and 10 years after baseline. Endpoints for multivariable prediction models are quality of life, fatigue, treatment adherence, and progression-based outcomes/survival. Predictors include the study center, sociodemographic characteristics, histologic cancer and comorbidity data, performance status, stress perception, depression, anxiety, sleep quality, spirituality, social support, physical activity, diet behavior, type of conventional treatments, use of and belief in CM treatments, and participation in a clinical trial. Safety is recorded following the Common Terminology Criteria for Adverse Events. Discussion This trial is currently recruiting participants. Future analyses will allow to identify predictors of short- and long-term response to integrative breast cancer treatment in women, which, in turn, may improve cancer care as well as quality and quantity of life with cancer. Trial registration German Clinical Trial Register DRKS00014852. Retrospectively registered at July 4th, 2018.


2021 ◽  
pp. 019394592110319
Author(s):  
Wonshik Chee ◽  
Eun-Ok Im

The purpose of the study was to explore the associations of sub-ethnicity to the survivorship experience of Asian American breast cancer survivors and identify the multiple factors that influenced their survivorship experience. This was a secondary analysis of the data among 94 Asian American breast cancer survivors from a larger ongoing study. Instruments included: questions on background characteristics, the perceived isolation scale, the Personal Resource Questionnaire, the Memorial Symptom Assessment Scale-Short Form, and the Functional Assessment of Cancer Therapy-Breast Cancer. Data were analyzed using hierarchical logistic and multiple regression analyses. After controlling for other factors, being a Japanese American (ref. = being a Chinese American) was significantly associated with pain scores (odds ratio [OR] = −0.32, p < .01), symptom distress scores ( β = −0.27, p < .01), and the quality of life scores ( β = 0.22, p = .03). Sub-ethnic variations in cultural attitudes, values, and beliefs need to be considered in future research/practice with Asian American breast cancer survivors.


2021 ◽  
pp. 1063293X2110251
Author(s):  
K Vijayakumar ◽  
Vinod J Kadam ◽  
Sudhir Kumar Sharma

Deep Neural Network (DNN) stands for multilayered Neural Network (NN) that is capable of progressively learn the more abstract and composite representations of the raw features of the input data received, with no need for any feature engineering. They are advanced NNs having repetitious hidden layers between the initial input and the final layer. The working principle of such a standard deep classifier is based on a hierarchy formed by the composition of linear functions and a defined nonlinear Activation Function (AF). It remains uncertain (not clear) how the DNN classifier can function so well. But it is clear from many studies that within DNN, the AF choice has a notable impact on the kinetics of training and the success of tasks. In the past few years, different AFs have been formulated. The choice of AF is still an area of active study. Hence, in this study, a novel deep Feed forward NN model with four AFs has been proposed for breast cancer classification: hidden layer 1: Swish, hidden layer, 2:-LeakyReLU, hidden layer 3: ReLU, and final output layer: naturally Sigmoidal. The purpose of the study is twofold. Firstly, this study is a step toward a more profound understanding of DNN with layer-wise different AFs. Secondly, research is also aimed to explore better DNN-based systems to build predictive models for breast cancer data with improved accuracy. Therefore, the benchmark UCI dataset WDBC was used for the validation of the framework and evaluated using a ten-fold CV method and various performance indicators. Multiple simulations and outcomes of the experimentations have shown that the proposed solution performs in a better way than the Sigmoid, ReLU, and LeakyReLU and Swish activation DNN in terms of different parameters. This analysis contributes to producing an expert and precise clinical dataset classification method for breast cancer. Furthermore, the model also achieved improved performance compared to many established state-of-the-art algorithms/models.


Sign in / Sign up

Export Citation Format

Share Document