Collaborative Graph Learning with Auxiliary Text for Temporal Event Prediction in Healthcare

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/486 ◽

2021 ◽

Author(s):

Chang Lu ◽

Chandan K Reddy ◽

Prithwish Chakraborty ◽

Samantha Kleinberg ◽

Yue Ning

Keyword(s):

Domain Knowledge ◽

Healthcare Providers ◽

Structural Features ◽

Text Data ◽

Graph Learning ◽

Care Plans ◽

Event Prediction ◽

Proposed Model ◽

Health Event ◽

Text Features

Accurate and explainable health event predictions are becoming crucial for healthcare providers to develop care plans for patients. The availability of electronic health records (EHR) has enabled machine learning advances in providing these predictions. However, many deep-learning-based methods are not satisfactory in solving several key challenges: 1) effectively utilizing disease domain knowledge; 2) collaboratively learning representations of patients and diseases; and 3) incorporating unstructured features. To address these issues, we propose a collaborative graph learning model to explore patient-disease interactions and medical domain knowledge. Our solution is able to capture structural features of both patients and diseases. The proposed model also utilizes unstructured text data by employing an attention manipulating strategy and then integrates attentive text features into a sequential learning process. We conduct extensive experiments on two important healthcare problems to show the competitive prediction performance of the proposed method compared with various state-of-the-art models. We also confirm the effectiveness of learned representations and model interpretability by a set of ablation and case studies.

Download Full-text

A Densely Connected GRU Neural Network Based on Coattention Mechanism for Chinese Rice-Related Question Similarity Matching

Agronomy ◽

10.3390/agronomy11071307 ◽

2021 ◽

Vol 11 (7) ◽

pp. 1307

Author(s):

Haoriqin Wang ◽

Huaji Zhu ◽

Huarui Wu ◽

Xiaomin Wang ◽

Xiao Han ◽

...

Keyword(s):

Self-Supervised Graph Learning With Hyperbolic Embedding for Temporal Health Event Prediction

IEEE Transactions on Cybernetics ◽

10.1109/tcyb.2021.3109881 ◽

2021 ◽

pp. 1-13

Author(s):

Chang Lu ◽

Chandan K. Reddy ◽

Yue Ning

Keyword(s):

Graph Learning ◽

Event Prediction ◽

Health Event ◽

Hyperbolic Embedding

Download Full-text

Estimation and tracking of elder activity levels for health event prediction

10.32469/10355/6657 ◽

2009 ◽

Author(s):

Nicholas M. Harvey

Keyword(s):

Activity Levels ◽

Event Prediction ◽

Health Event

Download Full-text

Towards effective link prediction: A hybrid similarity model

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-200344 ◽

2020 ◽

pp. 1-14

Author(s):

Longjie Li ◽

Lu Wang ◽

Hongsheng Luo ◽

Xiaoyun Chen

Keyword(s):

Link Prediction ◽

Structural Similarity ◽

Research Direction ◽

Structural Features ◽

Proposed Model ◽

Similarity Model ◽

Weight Calculation ◽

Stable Performance ◽

Grey Relation ◽

Important Research Direction

Link prediction is an important research direction in complex network analysis and has drawn increasing attention from researchers in various fields. So far, a plethora of structural similarity-based methods have been proposed to solve the link prediction problem. To achieve stable performance on different networks, this paper proposes a hybrid similarity model to conduct link prediction. In the proposed model, the Grey Relation Analysis (GRA) approach is employed to integrate four carefully selected similarity indexes, which are designed according to different structural features. In addition, to adaptively estimate the weight for each index based on the observed network structures, a new weight calculation method is presented by considering the distribution of similarity scores. Due to taking separate similarity indexes into account, the proposed method is applicable to multiple different types of network. Experimental results show that the proposed method outperforms other prediction methods in terms of accuracy and stableness on 10 benchmark networks.

Download Full-text

A Global Socio-economic-medico-legal Model for the Sustainability of Longitudinal Electronic Health Records

Methods of Information in Medicine ◽

10.1055/s-0038-1634081 ◽

2006 ◽

Vol 45 (03) ◽

pp. 240-245 ◽

Cited By ~ 21

Author(s):

A. Shabo

Keyword(s):

Electronic Health Records ◽

Healthcare Providers ◽

Financial Assets ◽

Healthcare Organizations ◽

Health Records ◽

Record Keeping ◽

Proposed Model ◽

Legal Model ◽

Electronic Health

Summary Objectives: This paper pursues the challenge of sustaining lifetime electronic health records (EHRs) based on a comprehensive socio-economic-medico-legal model. The notion of a lifetime EHR extends the emerging concept of a longitudinal and cross-institutional EHR and is invaluable information for increasing patient safety and quality of care. Methods: The challenge is how to compile and sustain a coherent EHR across the lifetime of an individual. Several existing and hypothetical models are described, analyzed and compared in an attempt to suggest a preferred approach. Results: The vision is that lifetime EHRs should be sustained by new players in the healthcare arena, who will function as independent health record banks (IHRBs). Multiple competing IHRBs would be established and regulated following preemptive legislation. They should be neither owned by healthcare providers nor by health insurer/payers or government agencies. The new legislation should also stipulate that the records located in these banks be considered the medico-legal copies of an individual’s records, and that healthcare providers no longer serve as the legal record keepers. Conclusions: The proposed model is not centered on any of the current players in the field; instead, it is focussed on the objective service of sustaining individual EHRs, much like financial banks maintain and manage financial assets. This revolutionary structure provides two main benefits: 1) Healthcare organizations will be able to cut the costs of long-term record keeping, and 2) healthcare providers will be able to provide better care based on the availability of a lifelong EHR of their new patients.

Download Full-text

Modeling Human Factors Topics in Aviation Reports

Proceedings of the Human Factors and Ergonomics Society Annual Meeting ◽

10.1177/1071181319631095 ◽

2019 ◽

Vol 63 (1) ◽

pp. 126-130

Author(s):

Beth Lyall-Wilson ◽

Nicolas Kim ◽

Elizabeth Hohman

Keyword(s):

Human Factors ◽

Topic Modeling ◽

Domain Knowledge ◽

Aviation Safety ◽

Subject Matter Experts ◽

Text Data ◽

Modeling Approach ◽

Modeling Process ◽

Manual Review ◽

Initial Extraction

This paper describes the development and new application of a text modeling process for identifying human factors topics, such as fatigue, workload, and distraction in aviation safety reports. Current approaches to identifying human factors topic representations in text data rely on manual review from subject matter experts. The implementation of a semi-supervised text modeling method overcomes the need for lengthy manual review through an initial extraction of pre-defined human factors topics, freeing time for focus on analyzing the information. This modeling approach allows analysts to use keywords to define topics of interest up front and influence the convergence of the model toward a result that reflects them, which provides an advantage over classic topic modeling approaches where domain knowledge is not integrated into the generation of derived topics. This paper includes a description of the modeling approach and rationale, data used, evaluation methods, challenges, and suggestions for future applications.

Download Full-text

Semi-Supervised Aspect-Based Sentiment Analysis for Case-Related Microblog Reviews Using Case Knowledge Graph Embedding

International Journal of Asian Language Processing ◽

10.1142/s2717554520500125 ◽

2021 ◽

pp. 2050012

Author(s):

Peilian Zhao ◽

Cunli Mao ◽

Zhengtao Yu

Keyword(s):

Sentiment Analysis ◽

Domain Knowledge ◽

Opinion Mining ◽

Data Augmentation ◽

Training Data ◽

Knowledge Graph ◽

Fine Grained ◽

Learning Framework ◽

Proposed Model ◽

Real World Applications

Aspect-Based Sentiment Analysis (ABSA), a fine-grained task of opinion mining, which aims to extract sentiment of specific target from text, is an important task in many real-world applications, especially in the legal field. Therefore, in this paper, we study the problem of limitation of labeled training data required and ignorance of in-domain knowledge representation for End-to-End Aspect-Based Sentiment Analysis (E2E-ABSA) in legal field. We proposed a new method under deep learning framework, named Semi-ETEKGs, which applied E2E framework using knowledge graph (KG) embedding in legal field after data augmentation (DA). Specifically, we pre-trained the BERT embedding and in-domain KG embedding for unlabeled data and labeled data with case elements after DA, and then we put two embeddings into the E2E framework to classify the polarity of target-entity. Finally, we built a case-related dataset based on a popular benchmark for ABSA to prove the efficiency of Semi-ETEKGs, and experiments on case-related dataset from microblog comments show that our proposed model outperforms the other compared methods significantly.

Download Full-text

The Classification System of Literary Works Based on K-Means Clustering

Journal of Interconnection Networks ◽

10.1142/s0219265921410012 ◽

2021 ◽

pp. 2141001

Author(s):

Sanqiang Wei ◽

Hongxia Hou ◽

Hua Sun ◽

Wei Li ◽

Wenxia Song

Keyword(s):

Clustering Algorithm ◽

Performance Ratio ◽

Levels Of Abstraction ◽

Text Documents ◽

Text Data ◽

Literary Works ◽

Accuracy Comparison ◽

Word Classification ◽

Text Features ◽

And Performance

The plots in certain literary works are very complicated and hinder readers from understanding them. Therefore tools should be proposed to support readers; comprehension of complex literary works supports their understanding by providing the most important information to readers. A human reader must capture multiple levels of abstraction and meaning to formulate an understanding of a document. Hence, in this paper, an Improved [Formula: see text]-means clustering algorithm (IKCA) has been proposed for literary word classification. For text data, the words that can express exact semantic in a class are generally better features. This paper uses the proposed technique to capture numerous cluster centroids for every class and then select the high-frequency words in centroids the text features for classification. Furthermore, neural networks have been used to classify text documents and [Formula: see text]-mean to cluster text documents. To develop the model based on unsupervised and supervised techniques to meet and identify the similarity between documents. The numerical results show that the suggested model will enhance to increases quality comparison of the existing Algorithm and [Formula: see text]-means algorithm, accuracy comparison of ALA and IKCA (95.2%), time is taken for clustering is less than 2 hours, success rate (97.4%) and performance ratio (98.1%).

Download Full-text

Improving the evidence-based clinical decision-making process: Interactive classification and topic discovery on diabetes-related biomedical literature (Preprint)

10.2196/preprints.27434 ◽

2021 ◽

Author(s):

Adrian Ahne ◽

Guy Fagherazzi ◽

Xavier Tannier ◽

Thomas Czernichow ◽

Francisco Orchard

Keyword(s):

Decision Making ◽

Active Learning ◽

Hierarchical Clustering ◽

Domain Knowledge ◽

Clinical Decision Making ◽

Clinical Decision ◽

Biomedical Literature ◽

Decision Making Process ◽

Evidence Based ◽

Text Data

BACKGROUND The amount of available textual health data such as scientific and biomedical literature is constantly growing and it becomes more and more challenging for health professionals to properly summarise those data and in consequence to practice evidence-based clinical decision making. Moreover, the exploration of large unstructured health text data is very challenging for non experts due to limited time, resources and skills. Current tools to explore text data lack ease of use, need high computation efforts and have difficulties to incorporate domain knowledge and focus on topics of interest. OBJECTIVE We developed a methodology which is able to explore and target topics of interest via an interactive user interface for experts and non-experts. We aim to reach near state of the art performance, while reducing memory consumption, increasing scalability and minimizing user interaction effort to improve the clinical decision making process. The performance is evaluated on diabetes-related abstracts from Pubmed. METHODS The methodology consists of four parts: 1) A novel interpretable hierarchical clustering of documents where each node is defined by headwords (describe documents in this node the most); 2) An efficient classification system to target topics; 3) Minimized users interaction effort through active learning; 4) A visual user interface through which a user interacts. We evaluated our approach on 50,911 diabetes-related abstracts from Pubmed which provide a hierarchical Medical Subject Headings (MeSH) structure, a unique identifier for a topic. Hierarchical clustering performance was compared against the implementation in the machine learning library scikit-learn. On a subset of 2000 randomly chosen diabetes abstracts, our active learning strategy was compared against three other strategies: random selection of training instances, uncertainty sampling which chooses instances the model is most uncertain about and an expected gradient length strategy based on convolutional neural networks (CNN). RESULTS For the hierarchical clustering performance, we achieved a F1-Score of 0.73 compared to scikit-learn’s of 0.76. Concerning active learning performance, after 200 chosen training samples based on these strategies, the weighted F1-Score over all MeSH codes resulted in satisfying 0.62 F1-Score of our approach, compared to 0.61 of the uncertainty strategy, 0.61 the CNN and 0.45 the random strategy. Moreover, our methodology showed a constant low memory use with increased number of documents but increased execution time. CONCLUSIONS We proposed an easy to use tool for experts and non-experts being able to combine domain knowledge with topic exploration and target specific topics of interest while improving transparency. Furthermore our approach is very memory efficient and highly parallelizable making it interesting for large Big Data sets. This approach can be used by health professionals to rapidly get deep insights into biomedical literature to ultimately improve the evidence-based clinical decision making process.

Download Full-text

Content Noise Detection Model Using Deep Learning in Web Forums

Sustainability ◽

10.3390/su12125074 ◽

2020 ◽

Vol 12 (12) ◽

pp. 5074

Author(s):

Jiyoung Woo ◽

Jaeseok Yun

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Model ◽

Detection Model ◽

Proposed Model ◽

Web Forum ◽

Web Forums ◽

Conventional Machine ◽

Text Features ◽

Deep Learning Model

Spam posts in web forum discussions cause user inconvenience and lower the value of the web forum as an open source of user opinion. In this regard, as the importance of a web post is evaluated in terms of the number of involved authors, noise distorts the analysis results by adding unnecessary data to the opinion analysis. Here, in this work, an automatic detection model for spam posts in web forums using both conventional machine learning and deep learning is proposed. To automatically differentiate between normal posts and spam, evaluators were asked to recognize spam posts in advance. To construct the machine learning-based model, text features from posted content using text mining techniques from the perspective of linguistics were extracted, and supervised learning was performed to distinguish content noise from normal posts. For the deep learning model, raw text including and excluding special characters was utilized. A comparison analysis on deep neural networks using the two different recurrent neural network (RNN) models of the simple RNN and long short-term memory (LSTM) network was also performed. Furthermore, the proposed model was applied to two web forums. The experimental results indicate that the deep learning model affords significant improvements over the accuracy of conventional machine learning associated with text features. The accuracy of the proposed model using LSTM reaches 98.56%, and the precision and recall of the noise class reach 99% and 99.53%, respectively.

Download Full-text