Predicting Protein Producibility in Filamentous Fungi

Mapping Intimacies ◽

10.1101/138560 ◽

2017 ◽

Author(s):

Karmen L Dykstra ◽

Juho Rousu ◽

Mikko Arvas

Keyword(s):

Filamentous Fungi ◽

Predictive Performance ◽

Support Vector ◽

E Coli ◽

Machine Learning Methods ◽

Vector Machines ◽

Production Host ◽

Domain Information ◽

Protein Dataset ◽

Variable Performance

AbstractIn this paper we study the problem of predicting the producibility of recombinant proteins in filamentous fungi, especially T. reesei, using machine learning methods. We train supervised and semi-supervised support vector machines with protein sequences, represented by their amino acid composition as well as protein family and domain information. Our results indicate, somewhat surprisingly, that quite modest amount of proteins with experimental data are required to build a state-of-the-art classifier and that additional unlabeled sequences in semi-supervised models do not bring increased predictive performance. Our experiments in cross-species prediction show that models trained for the filamentous fungus A. niger protein dataset can be generalized to predict protein producibility in T. reesei, and vice versa, without sacrificing too much accuracy, regardless of their approximately 500 millions years of divergence. However, predictors trained on E. coli and S. cerevisiae datasets gave variable performance when applied to the filamentous fungi datasets, indicating that while protein producibility prediction can be generalized accross related species, fully generic prediction tools applicable to any protein production host may not be realistic to achieve.

Download Full-text

Identifying Cancer Targets Based on Machine Learning Methods via Chou’s 5-steps Rule and General Pseudo Components

Current Topics in Medicinal Chemistry ◽

10.2174/1568026619666191016155543 ◽

2019 ◽

Vol 19 (25) ◽

pp. 2301-2317 ◽

Cited By ~ 2

Author(s):

Ruirui Liang ◽

Jiayang Xie ◽

Chi Zhang ◽

Mengying Zhang ◽

Hai Huang ◽

...

Keyword(s):

Machine Learning ◽

Growth Rate ◽

Big Data ◽

Human Genome Project ◽

Genome Project ◽

Support Vector ◽

Successful Implementation ◽

Learning Methods ◽

Machine Learning Methods ◽

Vector Machines

In recent years, the successful implementation of human genome project has made people realize that genetic, environmental and lifestyle factors should be combined together to study cancer due to the complexity and various forms of the disease. The increasing availability and growth rate of ‘big data’ derived from various omics, opens a new window for study and therapy of cancer. In this paper, we will introduce the application of machine learning methods in handling cancer big data including the use of artificial neural networks, support vector machines, ensemble learning and naïve Bayes classifiers.

Download Full-text

Predictive modeling for peri-implantitis by using machine learning techniques

Scientific Reports ◽

10.1038/s41598-021-90642-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Tomoaki Mameno ◽

Masahiro Wada ◽

Kazunori Nozaki ◽

Toshihito Takahashi ◽

Yoshitaka Tsujioka ◽

...

Keyword(s):

Machine Learning ◽

Demographic Data ◽

Risk Indicators ◽

Machine Learning Techniques ◽

Support Vector ◽

Machine Learning Methods ◽

Complex Interactions ◽

Learning Techniques ◽

Increased Risk ◽

Vector Machines

AbstractThe purpose of this retrospective cohort study was to create a model for predicting the onset of peri-implantitis by using machine learning methods and to clarify interactions between risk indicators. This study evaluated 254 implants, 127 with and 127 without peri-implantitis, from among 1408 implants with at least 4 years in function. Demographic data and parameters known to be risk factors for the development of peri-implantitis were analyzed with three models: logistic regression, support vector machines, and random forests (RF). As the results, RF had the highest performance in predicting the onset of peri-implantitis (AUC: 0.71, accuracy: 0.70, precision: 0.72, recall: 0.66, and f1-score: 0.69). The factor that had the most influence on prediction was implant functional time, followed by oral hygiene. In addition, PCR of more than 50% to 60%, smoking more than 3 cigarettes/day, KMW less than 2 mm, and the presence of less than two occlusal supports tended to be associated with an increased risk of peri-implantitis. Moreover, these risk indicators were not independent and had complex effects on each other. The results of this study suggest that peri-implantitis onset was predicted in 70% of cases, by RF which allows consideration of nonlinear relational data with complex interactions.

Download Full-text

The impact of different parameter sets on the classification of asteroid types

10.5194/epsc2021-807 ◽

2021 ◽

Author(s):

Hanna Klimczak ◽

Wojciech Kotłowski ◽

Dagmara Oszkiewicz ◽

Francesca DeMeo ◽

Agnieszka Kryszczyńska ◽

...

Keyword(s):

Gradient Boosting ◽

Support Vector ◽

Multilayer Perceptrons ◽

Machine Learning Methods ◽

Vector Machines ◽

Science Centre ◽

The Difference ◽

The Impact

The aim of the project is the classification of asteroids according to the most commonly used asteroid taxonomy (Bus-Demeo et al. 2009) with the use of various machine learning methods like Logistic Regression, Naive Bayes, Support Vector Machines, Gradient Boosting and Multilayer Perceptrons. Different parameter sets are used for classification in order to compare the quality of prediction with limited amount of data, namely the difference in performance between using the 0.45mu to 2.45mu spectral range and multiple spectral features, as well as performing the Prinicpal Component Analysis to reduce the dimensions of the spectral data. &#160; This work has been supported by grant&#160;No. 2017/25/B/ST9/00740 from the National Science Centre, Poland.

Download Full-text

Black Box Machine-Learning Methods: Neural Networks and Support Vector Machines

Data Science and Predictive Analytics ◽

10.1007/978-3-319-72347-1_11 ◽

2018 ◽

pp. 383-422 ◽

Cited By ~ 1

Author(s):

Ivo D. Dinov

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Support Vector Machines ◽

Black Box ◽

Support Vector ◽

Learning Methods ◽

Machine Learning Methods ◽

Vector Machines

Download Full-text

Incorporating Amino Acids Composition and Functional Domains for Identifying Bacterial Toxin Proteins

BioMed Research International ◽

10.1155/2014/972692 ◽

2014 ◽

Vol 2014 ◽

pp. 1-7 ◽

Cited By ~ 2

Author(s):

Min-Gang Su ◽

Chien-Hsun Huang ◽

Tzong-Yi Lee ◽

Yu-Ju Chen ◽

Hsin-Yi Wu

Keyword(s):

Amino Acids ◽

Cell Biology ◽

Predictive Performance ◽

Computational Prediction ◽

Amino Acid Sequences ◽

Bacterial Toxins ◽

Bacterial Toxin ◽

Support Vector ◽

Functional Domain ◽

Domain Information

Aside from pathogenesis, bacterial toxins also have been used for medical purpose such as drugs for cancer and immune diseases. Correctly identifying bacterial toxins and their types (endotoxins and exotoxins) has great impact on the cell biology study and therapy development. However, experimental methods for bacterial toxins identification are time-consuming and labor-intensive, implying an urgent need for computational prediction. Thus, we are motivated to develop a method for computational identification of bacterial toxins based on amino acid sequences and functional domain information. In this study, a nonredundant dataset of 167 bacterial toxins including 77 exotoxins and 90 endotoxins is adopted to learn the predictive model by using support vector machines (SVMs). The cross-validation evaluation shows that the SVM models trained with amino acids and dipeptides composition could yield an accuracy of 96.07% and 92.50%, respectively. For discriminating endotoxins from exotoxins, the SVM models trained with amino acids and dipeptides composition have achieved an accuracy of 95.71% and 92.86%, respectively. After incorporating functional domain information, the predictive performance is further improved. The proposed method has been demonstrated to be able to more effectively identify and classify bacterial toxins than the other two features on independent dataset, which may aid in bacterial biomedical development.

Download Full-text

Forecasting Electric Load by Support Vector Machines with Genetic Algorithms

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2005.p0134 ◽

2005 ◽

Vol 9 (2) ◽

pp. 134-141 ◽

Cited By ~ 4

Author(s):

Ping-Feng Pai ◽

◽

Wei-Chiang Hong ◽

Chih-Shen Lin ◽

◽

...

Keyword(s):

Genetic Algorithms ◽

Support Vector Machines ◽

Moving Average ◽

Predictive Performance ◽

Support Vector ◽

Electric Load ◽

Power Company ◽

Svm Model ◽

Vector Machines ◽

General Regression Neural Networks

Support vector machines (SVMs) have been successfully used in solving nonlinear regression and time series problems. However, the application of SVMs to load forecasting is very rare. Therefore, the purpose of this paper is to examine the feasibility of SVMs in forecasting electric load. In addition, the genetic algorithms are applied in the parameter selection of SVM model. Forecasting results compared with other two models, namely autoregressive integrated moving average (ARIMA) and general regression neural networks (GRNN), are provided. The experimental data are borrowed from the Taiwan Power Company. The numerical results indicate that the SVM model with genetic algorithms (SVMG) results in better predictive performance than the other two approaches.

Download Full-text

Rescale-Invariant SVM for Binary Classification

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/348 ◽

2017 ◽

Cited By ~ 1

Author(s):

Mojtaba Montazery ◽

Nic Wilson

Keyword(s):

Machine Learning ◽

Decision Making ◽

Support Vector Machines ◽

Binary Classification ◽

Experimental Results ◽

Support Vector ◽

Computation Method ◽

Learning Methods ◽

Machine Learning Methods ◽

Vector Machines

Support Vector Machines (SVM) are among the most well-known machine learning methods, with broad use in different scientific areas. However, one necessary pre-processing phase for SVM is normalization (scaling) of features, since SVM is not invariant to the scales of the features’ spaces, i.e., different ways of scaling may lead to different results. We define a more robust decision-making approach for binary classification, in which one sample strongly belongs to a class if it belongs to that class for all possible rescalings of features. We derive a way of characterising the approach for binary SVM that allows determining when an instance strongly belongs to a class and when the classification is invariant to rescaling. The characterisation leads to a computation method to determine whether one sample is strongly positive, strongly negative or neither. Our experimental results back up the intuition that being strongly positive suggests stronger confidence that an instance really is positive.

Download Full-text

Machine Learning Methods for Prediction of Food Effects on Bioavailability: A Comparison of Support Vector Machines and Artificial Neural Networks

European Journal of Pharmaceutical Sciences ◽

10.1016/j.ejps.2021.106018 ◽

2021 ◽

pp. 106018

Author(s):

Harriet Bennett-Lenane ◽

Brendan T. Griffin ◽

Joseph P. O'Shea

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Artificial Neural Networks ◽

Support Vector Machines ◽

Support Vector ◽

Food Effects ◽

Learning Methods ◽

Machine Learning Methods ◽

Vector Machines ◽

Artificial Neural

Download Full-text

Advanced machine learning methods in psychiatry: an introduction

General Psychiatry ◽

10.1136/gpsych-2020-100197 ◽

2020 ◽

Vol 33 (2) ◽

pp. e100197 ◽

Cited By ~ 2

Author(s):

Tsung-Chin Wu ◽

Zhirou Zhou ◽

Hongyue Wang ◽

Bokai Wang ◽

Tuo Lin ◽

...

Keyword(s):

Mental Health ◽

Machine Learning ◽

Neural Networks ◽

Artificial Neural Networks ◽

Support Vector Machines ◽

Support Vector ◽

Learning Methods ◽

Machine Learning Methods ◽

Vector Machines ◽

Artificial Neural

Mental health questions can be tackled through machine learning (ML) techniques. Apart from the two ML methods we introduced in our previous paper, we discuss two more advanced ML approaches in this paper: support vector machines and artificial neural networks. To illustrate how these ML methods have been employed in mental health, recent research applications in psychiatry were reported.

Download Full-text

Result and Performance Analysis of Rainfall Prediction System Based on Deep Neural Network

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit2063165 ◽

2020 ◽

pp. 633-638

Author(s):

Akshay Rajendra Naik ◽

A. V. Deorankar ◽

P. B. Ambhore

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Neural Network ◽

Machine Learning Algorithms ◽

Support Vector ◽

Learning Methods ◽

Rainfall Prediction ◽

Machine Learning Methods ◽

Vector Machines ◽

And Performance

Rainfall prediction is useful for all people for decision making in all fields, such as out door gamming, farming, traveling, and factory and for other activities. We studied various methods for rainfall prediction such as machine learning and neural networks. There is various machine learning algorithms are used in previous existing methods such as naïve byes, support vector machines, random forest, decision trees, and ensemble learning methods. We used deep neural network for rainfall prediction, and for optimization of deep neural network Adam optimizer is used for setting modal parameters, as a result our method gives better results as compare to other machine learning methods.

Download Full-text