Machine learning model to predict recurrent ulcer bleeding in patients with history of idiopathic gastroduodenal ulcer bleeding

2019 ◽  
Vol 49 (7) ◽  
pp. 912-918 ◽  
Author(s):  
Grace Lai-Hung Wong ◽  
Andy Jinhua Ma ◽  
Huiqi Deng ◽  
Jessica Yuet-Ling Ching ◽  
Vincent Wai-Sun Wong ◽  
...  
PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e10381
Author(s):  
Rohit Nandakumar ◽  
Valentin Dinu

Throughout the history of drug discovery, an enzymatic-based approach for identifying new drug molecules has been primarily utilized. Recently, protein–protein interfaces that can be disrupted to identify small molecules that could be viable targets for certain diseases, such as cancer and the human immunodeficiency virus, have been identified. Existing studies computationally identify hotspots on these interfaces, with most models attaining accuracies of ~70%. Many studies do not effectively integrate information relating to amino acid chains and other structural information relating to the complex. Herein, (1) a machine learning model has been created and (2) its ability to integrate multiple features, such as those associated with amino-acid chains, has been evaluated to enhance the ability to predict protein–protein interface hotspots. Virtual drug screening analysis of a set of hotspots determined on the EphB2-ephrinB2 complex has also been performed. The predictive capabilities of this model offer an AUROC of 0.842, sensitivity/recall of 0.833, and specificity of 0.850. Virtual screening of a set of hotspots identified by the machine learning model developed in this study has identified potential medications to treat diseases caused by the overexpression of the EphB2-ephrinB2 complex, including prostate, gastric, colorectal and melanoma cancers which are linked to EphB2 mutations. The efficacy of this model has been demonstrated through its successful ability to predict drug-disease associations previously identified in literature, including cimetidine, idarubicin, pralatrexate for these conditions. In addition, nadolol, a beta blocker, has also been identified in this study to bind to the EphB2-ephrinB2 complex, and the possibility of this drug treating multiple cancers is still relatively unexplored.


However, oftentimes people just search a restaurant by using word “restaurant”, while the word “restaurant” means differently to different individuals. For an Asian, it can mean a “Chinese restaurant” or “Thai restaurant”. How to correctly interpret search requests based on people’s preference is a challenge. Building a machine-learning model based on activity history of a registered user can solve this problem. The activity histories used by this research are reviews and ratings from users. This project introduces a data processing pipeline, which uses reviews from registered users to generate a machine-learning model for each registered user. This project also defines an architecture, which uses the generated machine-learning models to support real-time personalized recommendations for restaurant searching and type of foods good at those recommended restaurants. Finally, this project aims to develop a good machine learning model, different collaborative filtering methodologies are considered to predict restaurants using user ratings. Slope One, k-Nearest Neighbors algorithm and multiclass SVM classification are some of the collaborating methodologies are going to consider in this project.


2016 ◽  
Vol 23 (3) ◽  
Author(s):  
I. I. Dutka ◽  
F. V. Grynchuk

Despite the advances in endoscopic haemostasis, the incidence of recurrent ulcer bleeding remains to be high. It necessitates further search for its prognosis and methods of treatment.The objective of the research was to analyse risk factors for recurrent gastroduodenal ulcer bleeding.Materials and methods. The study included 203 patients with gastroduodenal ulcer bleeding. There were 135 (66.5%) males and 68 (33.3%) females. All the patients were examined and received conservative treatment according to treatment protocols.Results. Duodenal ulcer was diagnosed in 127 (62.3%) patients, gastric ulcer was found in 68 (33.3%) patients, gastroduodenal ulcer was seen in 9 (4.4%) patients. The recurrence of bleeding was observed in 24 (11.8%) cases. Most cases of recurrent bleeding (n=11 (45.8%) occurred within 2-3 days after the admission. 9 (37.5%) patients developed the recurrence of bleeding later. The lowest number of recurrent bleeding occurred within the first day - 4 (16.7%) cases. The incidence of recurrent bleeding was higher in men rather than in women - 17 (70.8%). Recurrent bleeding was observed in 9 (64.29%) patients with blood type O; 4 (28.57%) patients with blood type A; 1 (7.14%) patient with blood type B; 1 (7.14%) patient with blood type AB. The majority of recurrences (n=15 (62.5%) occurred in patients without ulcer in anamnesis. There was found no clear connection between ulcer location and the rate of recurrent bleeding.Conclusions.The scales of predicting recurrent bleeding that are known today do not consider a number of important clinical and pathogenetic factors as a basis of recurrence.The improvement of the results of treating bleeding ulcers is possible only on the basis of the complex of factors determining the effectiveness of regeneration.


Blood ◽  
2019 ◽  
Vol 134 (Supplement_1) ◽  
pp. 1659-1659
Author(s):  
Srdan Verstovsek ◽  
Valerio De Stefano ◽  
Florian H. Heidel ◽  
Mike Zuurman ◽  
Michael Zaiac ◽  
...  

Introduction: Thromboembolic events (TEs) are one of the most prevalent complications in patients (pts) with polycythemia vera (PV). This real-world evidence study of the US OPTUM database evaluated the incidence of TEs in hydroxyurea (HU)-treated PV pts who either switched to ruxolitinib (RUX) after initial treatment (Tx) with HU (HU-RUX group) or continued HU Tx without switching (HU-alone group). Machine learning was then used to build a precise and scientifically robust model to predict the occurrence of TEs in PV pts with/without a history of TEs and HU failure (defined by either European LeukemiaNet [ELN] hematologic criteria or TEs). Methods: The OPTUM database comprises claims data and electronic medical records from 90 million pts (2007-2017, median stay in the database=7 years), including 69,464 PV pts. To avoid any selection bias during comparison, only pts treated prior to the RUX market launch were included in the HU-alone group (HU-RUX, n=81; HU-alone, n=195). Due to unavailability of Tx duration, time difference between the first and the last prescription was used as a proxy, and overall Tx duration was matched in both groups. TEs were assessed before Tx initiation in both groups. For HU-RUX pts, it was also assessed while on HU (median duration 27 months) and on RUX (median duration 14 months). For HU-alone pts, it was assessed during the first 27 months of Tx (any pt included in the analysis was treated for longer than this due to duration matching) and during remaining period of Tx (median duration 14 months). TEs were identified by either a restrictive definition (a list of ICD codes containing keywords from the RESPONSE study was automatically generated and manually curated) or a less restrictive one (list of ICD codes was manually expanded to include any TEs matching those from the GEMFIN study). PV pts who were exclusively treated with HU for ≥6 months were selected (n=2057) for modeling. Outcomes to be predicted were TEs in the 12 months following the end of the 6-month HU Tx period, and HU failure within 3 months of Tx. A logistic regression model was used for prediction. The baseline features extracted from the database included median lab parameters (3-6 months after HU initiation), history of thrombosis prior to primary diagnosis of PV, sociological features (age, gender), comorbidities, and concomitant medications (from inpatient/outpatient tables). Performance assessment methods included Receiver Operating Characteristic-Area Under the Curve (ROC-AUC) in early stages and confusion matrix in later stages; the findings were converted to clinically interpretable decision-tree classification algorithms. Results: Based on the extensive definition, the annual incidence of TEs in the HU-RUX and HU-alone groups, respectively, was 9% and 7% before HU initiation, which increased to 17% and 13% on HU Tx. The small difference in baseline incidence may reflect residual differences between the two groups. After a median duration of 14 months, the incidence of TEs decreased to 15% in pts who switched to RUX vs an increase to 20% in pts who continued HU Tx. A similar trend was observed using less restrictive definition (Figure 1). This definition resulted in a substantially increased incidence of TEs and a decreased predictive power of the machine-learning model. Using modeling, decision trees were developed to predict the occurrence of TEs in PV pts with/without a history of TEs. Lymphocyte percentage (<17%) and red cell distribution width (RDW; <15%) were predictors in pts without a history of TEs, whereas lymphocyte percentage (>13%) and platelet count (>393x103/µL) were predictors in pts with a history of TEs (Figure 2). Based on the decision tree developed to predict HU failure, phlebotomy-dependent pts with >15% RDW had a higher risk of HU failure within 3 months of Tx (Figure 3). Conclusions: A reduction in the incidence of TEs was observed in pts switching to RUX vs those who continued HU Tx. Based on the findings from this machine-learning model in PV pts, phlebotomy dependency and RDW were indicated as predictors of HU Tx failure within 3 months, whereas lymphocyte percentage+platelet count and lymphocyte percentage+RDW were predictors of incidence of TEs in pts with and without a history of TEs, respectively. Non-adjustment of the results for antiplatelet/anticoagulant Tx was a study limitation. Further validation of this machine-learning model is planned in other European databases. Disclosures Verstovsek: Celgene: Consultancy, Research Funding; Gilead: Research Funding; Promedior: Research Funding; CTI BioPharma Corp: Research Funding; Genetech: Research Funding; Protaganist Therapeutics: Research Funding; Constellation: Consultancy; Pragmatist: Consultancy; Incyte: Research Funding; Roche: Research Funding; NS Pharma: Research Funding; Blueprint Medicines Corp: Research Funding; Novartis: Consultancy, Research Funding; Sierra Oncology: Research Funding; Pharma Essentia: Research Funding; Astrazeneca: Research Funding; Ital Pharma: Research Funding. De Stefano:Celgene: Consultancy, Honoraria, Speakers Bureau; Janssen: Consultancy, Honoraria, Speakers Bureau; Amgen: Consultancy, Honoraria, Speakers Bureau; Novartis: Consultancy, Honoraria, Research Funding, Speakers Bureau; Alexion: Consultancy, Honoraria, Speakers Bureau. Heidel:Novartis: Consultancy, Honoraria, Research Funding; Celgene: Consultancy; CTI: Consultancy. Zuurman:Novartis Pharma B.V.: Employment. Zaiac:Novartis: Employment, Equity Ownership. Bigan:Novartis: Consultancy. Ruhl:Novartis: Consultancy. Meier:Novartis: Consultancy. Kiladjian:Celgene: Consultancy; Novartis: Honoraria, Research Funding; AOP Orphan: Honoraria, Research Funding.


2018 ◽  
Author(s):  
Steen Lysgaard ◽  
Paul C. Jennings ◽  
Jens Strabo Hummelshøj ◽  
Thomas Bligaard ◽  
Tejs Vegge

A machine learning model is used as a surrogate fitness evaluator in a genetic algorithm (GA) optimization of the atomic distribution of Pt-Au nanoparticles. The machine learning accelerated genetic algorithm (MLaGA) yields a 50-fold reduction of required energy calculations compared to a traditional GA.


Author(s):  
Dhilsath Fathima.M ◽  
S. Justin Samuel ◽  
R. Hari Haran

Aim: This proposed work is used to develop an improved and robust machine learning model for predicting Myocardial Infarction (MI) could have substantial clinical impact. Objectives: This paper explains how to build machine learning based computer-aided analysis system for an early and accurate prediction of Myocardial Infarction (MI) which utilizes framingham heart study dataset for validation and evaluation. This proposed computer-aided analysis model will support medical professionals to predict myocardial infarction proficiently. Methods: The proposed model utilize the mean imputation to remove the missing values from the data set, then applied principal component analysis to extract the optimal features from the data set to enhance the performance of the classifiers. After PCA, the reduced features are partitioned into training dataset and testing dataset where 70% of the training dataset are given as an input to the four well-liked classifiers as support vector machine, k-nearest neighbor, logistic regression and decision tree to train the classifiers and 30% of test dataset is used to evaluate an output of machine learning model using performance metrics as confusion matrix, classifier accuracy, precision, sensitivity, F1-score, AUC-ROC curve. Results: Output of the classifiers are evaluated using performance measures and we observed that logistic regression provides high accuracy than K-NN, SVM, decision tree classifiers and PCA performs sound as a good feature extraction method to enhance the performance of proposed model. From these analyses, we conclude that logistic regression having good mean accuracy level and standard deviation accuracy compared with the other three algorithms. AUC-ROC curve of the proposed classifiers is analyzed from the output figure.4, figure.5 that logistic regression exhibits good AUC-ROC score, i.e. around 70% compared to k-NN and decision tree algorithm. Conclusion: From the result analysis, we infer that this proposed machine learning model will act as an optimal decision making system to predict the acute myocardial infarction at an early stage than an existing machine learning based prediction models and it is capable to predict the presence of an acute myocardial Infarction with human using the heart disease risk factors, in order to decide when to start lifestyle modification and medical treatment to prevent the heart disease.


2019 ◽  
Vol 156 (6) ◽  
pp. S-62-S-63
Author(s):  
Louis Ho Shing Lau ◽  
Jessica Y. Ching ◽  
Yee Kit Tse ◽  
Rachel Ling ◽  
Francis K. Chan ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document