Heterogeneous data integration by tree-augmented naïve Bayes for protein-protein interactions prediction

Xiaotong Lin; Xue-wen Chen

doi:10.1002/pmic.201200326

Heterogeneous data integration by tree-augmented naïve Bayes for protein-protein interactions prediction

PROTEOMICS ◽

10.1002/pmic.201200326 ◽

2012 ◽

Vol 13 (2) ◽

pp. 261-268 ◽

Cited By ~ 21

Author(s):

Xiaotong Lin ◽

Xue-wen Chen

Keyword(s):

Data Integration ◽

Protein Interactions ◽

Naive Bayes ◽

Heterogeneous Data ◽

Naïve Bayes ◽

Protein Protein Interactions ◽

Heterogeneous Data Integration

Download Full-text

Identification of Interface Residues Involved in Protein-Protein Interactions Using Naïve Bayes Classifier

Advanced Data Mining and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-540-88192-6_20 ◽

2008 ◽

pp. 207-216 ◽

Cited By ~ 1

Author(s):

Chishe Wang ◽

Jiaxing Cheng ◽

Shoubao Su ◽

Dongzhe Xu

Keyword(s):

Protein Interactions ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Protein Protein Interactions ◽

Naïve Bayes Classifier ◽

Interface Residues

Download Full-text

A Comprehensive Approach Characterizing Fusion Proteins and Their Interactions Using Biomedical Literature

10.1101/371088 ◽

2018 ◽

Author(s):

Somnath Tagore ◽

Alessandro Gorohovski ◽

Lars Juhl Jensen ◽

Milana Frenkel-Morgenstern

Keyword(s):

Protein Interactions ◽

Fusion Proteins ◽

Naive Bayes ◽

Naïve Bayes ◽

Biomedical Literature ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Protein Protein Interactions ◽

Naïve Bayes Classifier ◽

Downstream Analysis

AbstractToday’s increase in scientific literature requires the efficient methods of data mining for improving the extraction of the useful information from texts. In this manuscript, we used a data and text mining method to identify fusions and their protein-protein interactions from published biomedical text. The extracted fusion proteins and their protein-protein interactions are used as a training set for a Naïve Bayes classifier that is further used for final identification of testing dataset, consisting of 1817 fusions. Our method has a literature corpus, text and annotation mappers; keywords, rule bases, negative tokens, and pattern extractor; synonym tagger, normalization, regular expression mapper; and Naïve Bayes classifier. We classified 1817 unique fusion proteins and their corresponding 2908 protein-protein interactions for 18 cancer types. Therefore, it can be used for screening literature for identifying mentions unique cases of fusions that can be further used for downstream analysis. It is available at http://protfus.md.biu.ac.il/.

Download Full-text

VGEs-Oriented Multi-sourced Heterogeneous Data Integration

Geo-information Science ◽

10.3724/sp.j.1047.2009.00292 ◽

2010 ◽

Vol 11 (3) ◽

pp. 292-298

Author(s):

Hongjun SU ◽

Yehua SHENG ◽

Yongning WEN ◽

Min CHEN

Keyword(s):

Data Integration ◽

Heterogeneous Data ◽

Heterogeneous Data Integration

Download Full-text

Reconsideration of in silico siRNA design from a perspective of heterogeneous data integration: problems and solutions

Briefings in Bioinformatics ◽

10.1093/bib/bbs073 ◽

2012 ◽

Vol 15 (2) ◽

pp. 292-305 ◽

Cited By ~ 5

Author(s):

Q. Liu ◽

H. Zhou ◽

R. Zhu ◽

Y. Xu ◽

Z. Cao

Keyword(s):

Data Integration ◽

In Silico ◽

Heterogeneous Data ◽

Heterogeneous Data Integration ◽

Problems And Solutions ◽

Sirna Design ◽

Integration Problems

Download Full-text

A Data Model for Heterogeneous Data Integration Architecture

Communications in Computer and Information Science - Beyond Databases, Architectures, and Structures ◽

10.1007/978-3-319-06932-6_53 ◽

2014 ◽

pp. 547-556 ◽

Cited By ~ 7

Author(s):

Michał Chromiak ◽

Krzysztof Stencel

Keyword(s):

Data Integration ◽

Data Model ◽

Heterogeneous Data ◽

Heterogeneous Data Integration ◽

Integration Architecture

Download Full-text

Hierarchical Multi-Agent System for Heterogeneous Data Integration

Intelligent Decision Systems in Large-Scale Distributed Environments - Studies in Computational Intelligence ◽

10.1007/978-3-642-21271-0_8 ◽

2011 ◽

pp. 165-186 ◽

Cited By ~ 3

Author(s):

Aleksander Byrski ◽

Marek Kisiel-Dorohinicki ◽

Jacek Dajda ◽

Grzegorz Dobrowolski ◽

Edward Nawarecki

Keyword(s):

Data Integration ◽

Heterogeneous Data ◽

Multi Agent System ◽

Agent System ◽

Heterogeneous Data Integration ◽

Multi Agent

Download Full-text

Design of Heterogeneous Data Integration and Sharing System for Coastal International Trade

Journal of Coastal Research ◽

10.2112/si103-147.1 ◽

2020 ◽

Vol 103 (sp1) ◽

pp. 718

Author(s):

Dongfeng Geng

Keyword(s):

International Trade ◽

Data Integration ◽

Heterogeneous Data ◽

Heterogeneous Data Integration

Download Full-text

Heterogeneous data integration framework based on grid service

2009 IEEE International Conference on Network Infrastructure and Digital Content ◽

10.1109/icnidc.2009.5360897 ◽

2009 ◽

Author(s):

Yanbing Liu ◽

Zhangxiong Liu ◽

Laiming Luo

Keyword(s):

Data Integration ◽

Heterogeneous Data ◽

Grid Service ◽

Integration Framework ◽

Heterogeneous Data Integration

Download Full-text

Design and Implementation of Oilfield Heterogeneous Data Integration Model Based on Ontology

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.912-914.1201 ◽

2014 ◽

Vol 912-914 ◽

pp. 1201-1204

Author(s):

Gang Huang ◽

Xiu Ying Wu ◽

Man Yuan

Keyword(s):

Data Integration ◽

Heterogeneous Data ◽

Data Sources ◽

Semantic Heterogeneity ◽

Integration Model ◽

Integration Framework ◽

Heterogeneous Data Integration ◽

Semantic Level ◽

Heterogeneous Data Sources ◽

Semantic Difference

This paper provides an ontology-based distributed heterogeneous data integration framework (ODHDIF). The framework resolves the problem of semantic interoperability between heterogeneous data sources in semantic level. By metadatas specifying the distributed, heterogeneous data and by describing semantic information of data source , having "ontology" as a common semantic model, semantic match is established through ontology mapping between heterogeneous data sources and semantic difference institutions are shielded, so that semantic heterogeneity problem of the heterogeneous data sources can be effectively solved. It provides an effective technology measure for the interior information of enterprises to be shared in time accurately.

Download Full-text

Agent based heterogeneous data integration and maintenance decision support for high-speed railway signal system

17th International IEEE Conference on Intelligent Transportation Systems (ITSC) ◽

10.1109/itsc.2014.6957995 ◽

2014 ◽

Cited By ~ 2

Author(s):

Lianbao Yang ◽

Tianhua Xu ◽

Zhenxian Wang

Keyword(s):

Decision Support ◽

Data Integration ◽

High Speed ◽

Heterogeneous Data ◽

Signal System ◽

High Speed Railway ◽

Heterogeneous Data Integration ◽

Agent Based ◽

Railway Signal ◽

Maintenance Decision

Download Full-text