scholarly journals Semantic Web-Based Integration of Cancer Pathways and Allele Frequency Data

2009 ◽  
Vol 8 ◽  
pp. CIN.S1006 ◽  
Author(s):  
Matthew E. Holford ◽  
Haseena Rajeevan ◽  
Hongyu Zhao ◽  
Kenneth K. Kidd ◽  
Kei-Hoi Cheung

We demonstrate the use of Semantic Web technology to integrate the ALFRED allele frequency database and the Starpath pathway resource. The linking of population-specific genotype data with cancer-related pathway data is potentially useful given the growing interest in personalized medicine and the exploitation of pathway knowledge for cancer drug discovery. We model our data using the Web Ontology Language (OWL), drawing upon ideas from existing standard formats BioPAX for pathway data and PML for allele frequency data. We store our data within an Oracle database, using Oracle Semantic Technologies. We then query the data using Oracle's rule-based inference engine and SPARQL-like RDF query language. The ability to perform queries across the domains of population genetics and pathways offers the potential to answer a number of cancer-related research questions. Among the possibilities is the ability to identify genetic variants which are associated with cancer pathways and whose frequency varies significantly between ethnic groups. This sort of information could be useful for designing clinical studies and for providing background data in personalized medicine. It could also assist with the interpretation of genetic analysis results such as those from genome-wide association studies.

2020 ◽  
Author(s):  
Marta Pelizzola ◽  
Merle Behr ◽  
Housen Li ◽  
Axel Munk ◽  
Andreas Futschik

AbstractSince haplotype information is of widespread interest in biomedical applications, effort has been put into their reconstruction. Here, we propose a new, computationally efficient method, called haploSep, that is able to accurately infer major haplotypes and their frequencies just from multiple samples of allele frequency data. Our approach seems to be the first that is able to estimate more than one haplotype given such data. Even the accuracy of experimentally obtained allele frequencies can be improved by re-estimating them from our reconstructed haplotypes. From a methodological point of view, we model our problem as a multivariate regression problem where both the design matrix and the coefficient matrix are unknown. The design matrix, with 0/1 entries, models haplotypes and the columns of the coefficient matrix represent the frequencies of haplotypes, which are non-negative and sum up to one. We illustrate our method on simulated and real data focusing on experimental evolution and microbial data.


1998 ◽  
Vol 43 (6) ◽  
pp. 14384J ◽  
Author(s):  
Bruce Budowle ◽  
F. Samuel Baechtel ◽  
Rose Fejeran

1994 ◽  
Vol 3 (3) ◽  
pp. 248-253 ◽  
Author(s):  
John S. Waye ◽  
Melanie Richard ◽  
George Carmody ◽  
Pamela J. Newall

1999 ◽  
Vol 9 (6) ◽  
pp. 669???682 ◽  
Author(s):  
Andrea Gaedigk ◽  
R. Russell Gotschall ◽  
Nata??a S. Forbes ◽  
Stephen D. Simon ◽  
Gregory L. Kearns ◽  
...  

2006 ◽  
Vol 51 (2) ◽  
pp. 436-437 ◽  
Author(s):  
Ronny Decorte ◽  
Elke Verhoeven ◽  
Elisabeth Vanhoutte ◽  
Katleen Knaepen ◽  
Jean-Jacques Cassiman

2021 ◽  
Vol 19 (1) ◽  
pp. 48
Author(s):  
Ferdy Saputra ◽  
Tike Sartika ◽  
Anneke Anggraeni ◽  
Andi Baso Lompengeng Ishak ◽  
Komarudin Komarudin ◽  
...  

<p class="MDPI17abstract"><strong>Objective: </strong>This study tries to examine several multivariate methods in classifying genetic diversity using microsatellite allele frequency data.</p><p class="MDPI17abstract"><strong>Methods: </strong>This study used microsatellite allele frequency data from White Leghorn (n = 48), Kampung (n = 48), Pelung (n = 24), Sentul (n = 24), and Black Kedu (n = 25) from Indonesian Research Institute for Animal Production. Allele frequency data were analyzed by the Neighbor-Joining (NJ) method using the POPTREE2 program. The data was also analyzed by the Principal Component Analysis (PCA), Correspondence Analysis (CA), and Hierarchical Clustering on Principal Components (HCPC) methods using the factoextra and FactoMineR packages in the R 4.0.0 program.<strong></strong></p><p class="MDPI17abstract"><strong>Results: </strong>Correspondence Analysis (CA) found Sentul is more closer to Black Kedu. However, based on NJ, PCA, and HCPC showed Sentul is closer to Kampung. Based on the value of Dimension 1, Correspondence Analysis (80.7%) can explain greater variation than PCA (58.9%). However, CA method generated different results compared to NJ, PCA, and HCPC. NJ, PCA, and HCPC found four chicken clusters, namely cluster 1 (White Leghorn), cluster 2 (Pelung), cluster 3 (Black Kedu), and cluster 4 (Kampung and Sentul).<strong></strong></p><p class="MDPI17abstract"><strong>Conclusions: </strong>In conclusion, HCPC is a better multivariate method for analyzing allele frequency data than PCA and CA. HCPC can be used to analyze allele frequency data better than PCA, because HCPC is a combination of methods from hierarchical clustering and principal components.</p>


Author(s):  
C. Paz-y-Miño ◽  
O. Astudillo-González ◽  
D. Maldonado-Oyervide ◽  
A. López-Cortés ◽  
A. Pérez-Villa ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document