Similarity-based machine learning support vector machine predictor of drug-drug interactions with improved accuracies

2018 ◽  
Vol 44 (2) ◽  
pp. 268-275 ◽  
Author(s):  
Dalong Song ◽  
Yao Chen ◽  
Qian Min ◽  
Qingrong Sun ◽  
Kai Ye ◽  
...  
2020 ◽  
Vol 5 (2) ◽  
Author(s):  
Adinda miftahul Ilmi Habiba ◽  
Agi Prasetiadi ◽  
Cepi Ramdani

Penelitian ini untuk mengetahui kualitas kesehatan terumbu karang disuatu wilayah di Indonesia dengan mengambil beberapa faktor seperti wisatawan yang datang, latitude, longtitude, suhu, tahun, populasi warga, jumlah pemuda, dan jumlah industri, dan metode yang digunakan adalah machine learning dengan algoritma K-Nearest Neighbor, Support Vector Machine, dan Ensemble Classifier, untuk ensemble menggunkan randomforest untuk mengambil cabang-cabang pohon atau fitur keputusan yang paling relevan dengan output, penelitian ini diharapkan bisa menjadi acuan bagi wilayah yang kondisi terumbu karangnya masih kurang baik dapat mencontoh wilayah yang kondisi terumbu karangnya sudah baik dengan melihat faktor apa saja yang mempengaruhi terumbu karang disuatu wilayah itu masuk kategori baik. Hasil akhir dari penelitian ini pada algoritma K-Nearest Neighbor faktor yang berpengaruh bagi kesehatan terumbu karang yaitu wisatawan yang datang, latitude, longtitude, suhu, tahum dan pupulasi warga, sementara pada algoritma Support Vector Machine faktor yang berpengaruh wisatawan yang datang, Latitude, suhu dan tahun untuk algoritma Ensemble Classifier faktor yang berpengaruh wisatawan yang datang, latitude, longtitude, suhu dan jumlah industry, Pada kasus ini algoritma Support Vector Machine memiliki kinerja lebih baik dibandingkan K-Nearest Neighbor dan Ensemble Classifier.Kata Kunci: Ekosistem, Ensemble Classifier, K-Nearest Neighbor, Machine Learning, Support Vector Machine 


2019 ◽  
Vol 16 (2) ◽  
pp. 441-444
Author(s):  
D. V. Soundari ◽  
R. Padmapriya ◽  
C. Thirumariselvi ◽  
N. Nanthini ◽  
K. Priyadharsini

A woman majorly suffers due to breast cancer which is due to hormone imbalance. It leads to huge death in recent years. Early detection of the breast cancer is more important to prevent human lives. Image Processing plays an important to classify and detect the same. So this paper proposes machine learning based cancer classification using support vector machine with Wisconsin breast cancer data set.


2020 ◽  
Vol 7 (3) ◽  
pp. 320
Author(s):  
Favorisen R. Lumbanraja ◽  
Ira Hariati Br Sitepu ◽  
Didik Kurniawan ◽  
Aristoteles Aristoteles

<p><em>Tuberkulosis (TB atau TBC) merupakan salah satu penyakit infeksi yang disebabkan oleh Bakteri Mycobacterium tuberculosis. Bakteri tersebut merupakan bakteri yang sangat kuat sehingga dalam pengobatannya memerlukan waktu yang cukup lama. Pengobatan penyakit tuberkulosis dilakukan selama 6-9 bulan secara rutin dengan sedikitnya 3 macam jenis obat. Saat ini kebanyakan masyarakat menganggap batuk dalam jangka waktu berbulan-bulan merupakan batuk biasa, jika dicermati salah satu gejala yang ditimbulkan penyakit tuberkulosis, yaitu batuk dalam jangka waktu yang panjang. Pada penelitian ini digunakan data penderita tuberkulosis di Kota Bandar Lampung, data cuaca dan matrix jarak antara kejadian penderita tuberkulosis yang satu dengan kejadian yang lainnya dalam lingkup kecamatan. Jumlah dari keseluruhan data sebanyak 600 data dengan 44 variabel. Penelitian ini juga menggunakan 3 kernel yaitu, Linear, Gaussian, dan Polynomial dengan menggunakan Metode SVM dengan kernel Linear mendapatkan nilai rata-rata R<sup>2</sup> sebesar 51.43 %, pada percobaan dengan metode SVM dengan kernel Gaussian mendapatkan nilai rata-rata R<sup>2</sup> sebesar 58.53 % dan pada percobaan dengan metode SVM dengan kernel Polynomial mendapatkan nilai rata-rata R<sup>2</sup> sebesar 36.03 %.</em></p><p><strong><em>Kata Kunci</em></strong><em> : Prediksi penderita tuberculosis, tuberculosis, Machine Learning, Support Vector Machine.</em></p><p class="Abstrak"><em>Tuberculosis (TB / TBC) is one of infectious disease caused by Mycobacterium tuberculosis bacteria. These bacteria are very strong bacteria so for the treatment takes a long time. Tuberculosis treatment is carried out for 6-9 months regularly with at least 3 types of drugs. Currently, most of people consider a cough for months is a common cough, if looked by one of the symptoms caused by tuberculosis, which is a cough for a long time. In this research, data on tuberculosis patients in the city of Bandar Lampung were used, weather data and the distance matrix between the case of tuberculosis patients with other case within the district. The total number of data is 600 data with 44 variables. This research also uses 3 kernels</em><em> </em><em>namely, Linear, Gaussian, and Polynomial by using the SVM method with the Linear kernel getting an average R<sup>2</sup> value of 51.43%, in the experiment with the SVM method with a gaussian kernel getting an average R<sup>2</sup> value of 58.53% and at Experiments with the SVM method with the Polynomial kernel obtained an average value of R<sup>2</sup> of 36.03%</em><em> .</em></p><p class="Abstrak"><strong><em>Keywords</em></strong><em> : Prediction of tuberculosis sufferers, tuberculosis, Machine Learning, Support Vector Machine.</em></p>


2020 ◽  
Vol 25 (1) ◽  
pp. 24-38
Author(s):  
Eka Patriya

Saham adalah instrumen pasar keuangan yang banyak dipilih oleh investor sebagai alternatif sumber keuangan, akan tetapi saham yang diperjual belikan di pasar keuangan sering mengalami fluktuasi harga (naik dan turun) yang tinggi. Para investor berpeluang tidak hanya mendapat keuntungan, tetapi juga dapat mengalami kerugian di masa mendatang. Salah satu indikator yang perlu diperhatikan oleh investor dalam berinvestasi saham adalah pergerakan Indeks Harga Saham Gabungan (IHSG). Tindakan dalam menganalisa IHSG merupakan hal yang penting dilakukan oleh investor dengan tujuan untuk menemukan suatu trend atau pola yang mungkin berulang dari pergerakan harga saham masa lalu, sehingga dapat digunakan untuk memprediksi pergerakan harga saham di masa mendatang. Salah satu metode yang dapat digunakan untuk memprediksi pergerakan harga saham secara akurat adalah machine learning. Pada penelitian ini dibuat sebuah model prediksi harga penutupan IHSG menggunakan algoritma Support Vector Regression (SVR) yang menghasilkan kemampuan prediksi dan generalisasi yang baik dengan nilai RMSE training dan testing sebesar 14.334 dan 20.281, serta MAPE training dan testing sebesar 0.211% dan 0.251%. Hasil penelitian ini diharapkan dapat membantu para investor dalam mengambil keputusan untuk menyusun strategi investasi saham.


2021 ◽  
Vol 186 (Supplement_1) ◽  
pp. 445-451
Author(s):  
Yifei Sun ◽  
Navid Rashedi ◽  
Vikrant Vaze ◽  
Parikshit Shah ◽  
Ryan Halter ◽  
...  

ABSTRACT Introduction Early prediction of the acute hypotensive episode (AHE) in critically ill patients has the potential to improve outcomes. In this study, we apply different machine learning algorithms to the MIMIC III Physionet dataset, containing more than 60,000 real-world intensive care unit records, to test commonly used machine learning technologies and compare their performances. Materials and Methods Five classification methods including K-nearest neighbor, logistic regression, support vector machine, random forest, and a deep learning method called long short-term memory are applied to predict an AHE 30 minutes in advance. An analysis comparing model performance when including versus excluding invasive features was conducted. To further study the pattern of the underlying mean arterial pressure (MAP), we apply a regression method to predict the continuous MAP values using linear regression over the next 60 minutes. Results Support vector machine yields the best performance in terms of recall (84%). Including the invasive features in the classification improves the performance significantly with both recall and precision increasing by more than 20 percentage points. We were able to predict the MAP with a root mean square error (a frequently used measure of the differences between the predicted values and the observed values) of 10 mmHg 60 minutes in the future. After converting continuous MAP predictions into AHE binary predictions, we achieve a 91% recall and 68% precision. In addition to predicting AHE, the MAP predictions provide clinically useful information regarding the timing and severity of the AHE occurrence. Conclusion We were able to predict AHE with precision and recall above 80% 30 minutes in advance with the large real-world dataset. The prediction of regression model can provide a more fine-grained, interpretable signal to practitioners. Model performance is improved by the inclusion of invasive features in predicting AHE, when compared to predicting the AHE based on only the available, restricted set of noninvasive technologies. This demonstrates the importance of exploring more noninvasive technologies for AHE prediction.


2021 ◽  
Vol 13 (6) ◽  
pp. 3497
Author(s):  
Hassan Adamu ◽  
Syaheerah Lebai Lutfi ◽  
Nurul Hashimah Ahamed Hassain Malim ◽  
Rohail Hassan ◽  
Assunta Di Vaio ◽  
...  

Sustainable development plays a vital role in information and communication technology. In times of pandemics such as COVID-19, vulnerable people need help to survive. This help includes the distribution of relief packages and materials by the government with the primary objective of lessening the economic and psychological effects on the citizens affected by disasters such as the COVID-19 pandemic. However, there has not been an efficient way to monitor public funds’ accountability and transparency, especially in developing countries such as Nigeria. The understanding of public emotions by the government on distributed palliatives is important as it would indicate the reach and impact of the distribution exercise. Although several studies on English emotion classification have been conducted, these studies are not portable to a wider inclusive Nigerian case. This is because Informal Nigerian English (Pidgin), which Nigerians widely speak, has quite a different vocabulary from Standard English, thus limiting the applicability of the emotion classification of Standard English machine learning models. An Informal Nigerian English (Pidgin English) emotions dataset is constructed, pre-processed, and annotated. The dataset is then used to classify five emotion classes (anger, sadness, joy, fear, and disgust) on the COVID-19 palliatives and relief aid distribution in Nigeria using standard machine learning (ML) algorithms. Six ML algorithms are used in this study, and a comparative analysis of their performance is conducted. The algorithms are Multinomial Naïve Bayes (MNB), Support Vector Machine (SVM), Random Forest (RF), Logistics Regression (LR), K-Nearest Neighbor (KNN), and Decision Tree (DT). The conducted experiments reveal that Support Vector Machine outperforms the remaining classifiers with the highest accuracy of 88%. The “disgust” emotion class surpassed other emotion classes, i.e., sadness, joy, fear, and anger, with the highest number of counts from the classification conducted on the constructed dataset. Additionally, the conducted correlation analysis shows a significant relationship between the emotion classes of “Joy” and “Fear”, which implies that the public is excited about the palliatives’ distribution but afraid of inequality and transparency in the distribution process due to reasons such as corruption. Conclusively, the results from this experiment clearly show that the public emotions on COVID-19 support and relief aid packages’ distribution in Nigeria were not satisfactory, considering that the negative emotions from the public outnumbered the public happiness.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Florent Le Borgne ◽  
Arthur Chatton ◽  
Maxime Léger ◽  
Rémi Lenain ◽  
Yohann Foucher

AbstractIn clinical research, there is a growing interest in the use of propensity score-based methods to estimate causal effects. G-computation is an alternative because of its high statistical power. Machine learning is also increasingly used because of its possible robustness to model misspecification. In this paper, we aimed to propose an approach that combines machine learning and G-computation when both the outcome and the exposure status are binary and is able to deal with small samples. We evaluated the performances of several methods, including penalized logistic regressions, a neural network, a support vector machine, boosted classification and regression trees, and a super learner through simulations. We proposed six different scenarios characterised by various sample sizes, numbers of covariates and relationships between covariates, exposure statuses, and outcomes. We have also illustrated the application of these methods, in which they were used to estimate the efficacy of barbiturates prescribed during the first 24 h of an episode of intracranial hypertension. In the context of GC, for estimating the individual outcome probabilities in two counterfactual worlds, we reported that the super learner tended to outperform the other approaches in terms of both bias and variance, especially for small sample sizes. The support vector machine performed well, but its mean bias was slightly higher than that of the super learner. In the investigated scenarios, G-computation associated with the super learner was a performant method for drawing causal inferences, even from small sample sizes.


Sign in / Sign up

Export Citation Format

Share Document