Interpretable genotype-to-phenotype classifiers with performance guarantees

Mapping Intimacies ◽

10.1101/388348 ◽

2018 ◽

Cited By ~ 1

Author(s):

Alexandre Drouin ◽

Gaël Letarte ◽

Frédéric Raymond ◽

Mario Marchand ◽

Jacques Corbeil ◽

...

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Resistance Mechanisms ◽

Machine Learning Algorithms ◽

Health Concern ◽

Computationally Efficient ◽

Performance Guarantees ◽

Improve Model ◽

A Cell ◽

Interpretable Models

ABSTRACTUnderstanding the relationship between the genome of a cell and its phenotype is a central problem in precision medicine. Nonetheless, genotype-to-phenotype prediction comes with great challenges for machine learning algorithms that limit their use in this setting. The high dimensionality of the data tends to hinder generalization and challenges the scalability of most learning algorithms. Additionally, most algorithms produce models that are complex and difficult to interpret. We alleviate these limitations by proposing strong performance guarantees, based on sample compression theory, for rule-based learning algorithms that produce highly interpretable models. We show that these guarantees can be leveraged to accelerate learning and improve model interpretability. Our approach is validated through an application to the genomic prediction of antimicrobial resistance, an important public health concern. Highly accurate models were obtained for 12 species and 56 antibiotics, and their interpretation revealed known resistance mechanisms, as well as some potentially new ones. An open-source disk-based implementation that is both memory and computationally efficient is provided with this work. The implementation is turnkey, requires no prior knowledge of machine learning, and is complemented by comprehensive tutorials.

Download Full-text

Machine Learning Algorithms To Improve Model Accuracy and Latency, and Human-Autonomy Teaming

2018 Modeling and Simulation Technologies Conference ◽

10.2514/6.2018-4063 ◽

2018 ◽

Cited By ~ 1

Author(s):

Vincent E. Houston ◽

Bryan Barrows ◽

Walter Manuel ◽

Lisa R. Le Vie

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Model Accuracy ◽

Improve Model ◽

Human Autonomy

Download Full-text

Withdrawal: Machine Learning Algorithms To Improve Model Accuracy and Latency, and Human-Autonomy Teaming

2018 Modeling and Simulation Technologies Conference ◽

10.2514/6.2018-4063.c1 ◽

2018 ◽

Author(s):

Vincent E. Houston ◽

Bryan Barrows ◽

Walter Manuel ◽

Lisa R. Le Vie

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Model Accuracy ◽

Improve Model ◽

Human Autonomy

Download Full-text

High-dimensional hepatopath data analysis by machine learning for predicting HBV-related fibrosis

Scientific Reports ◽

10.1038/s41598-021-84556-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Xiangke Pu ◽

Danni Deng ◽

Chaoyi Chu ◽

Tianle Zhou ◽

Jianhong Liu

Keyword(s):

Machine Learning ◽

Liver Fibrosis ◽

Clinical Data ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Health Concern ◽

High Dimensional ◽

Non Invasive ◽

Predictive Algorithms ◽

Invasive Method

AbstractChronic HBV infection, the main cause of liver cirrhosis and hepatocellular carcinoma, has become a global health concern. Machine learning algorithms are particularly adept at analyzing medical phenomenon by capturing complex and nonlinear relationships in clinical data. Our study proposed a predictive model on the basis of 55 routine laboratory and clinical parameters by machine learning algorithms as a novel non-invasive method for liver fibrosis diagnosis. The model was further evaluated on the accuracy and rationality and proved to be highly accurate and efficient for the prediction of HBV-related fibrosis. In conclusion, we suggested a potential combination of high-dimensional clinical data and machine learning predictive algorithms for the liver fibrosis diagnosis.

Download Full-text

Classical Machine Learning Algorithms and Shallower Convolutional Neural Networks Towards Computationally Efficient and Accurate Classification of Malaria Parasites

Communications in Computer and Information Science - Information and Communication Technology for Development for Africa ◽

10.1007/978-3-030-26630-1_5 ◽

2019 ◽

pp. 46-56

Author(s):

Yaecob Girmay Gezahegn ◽

Abel Kahsay Gebreslassie ◽

Maarig Aregawi Hagos ◽

Achim Ibenthal ◽

Eneyew Adugna Etsub

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Convolutional Neural Networks ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Computationally Efficient ◽

Malaria Parasites

Download Full-text

Supplemental Material for One Model to Rule Them All? Using Machine Learning Algorithms to Determine the Number of Factors in Exploratory Factor Analysis

Psychological Methods ◽

10.1037/met0000262.supp ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Factor Analysis ◽

Exploratory Factor Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Number Of Factors

Download Full-text

Forecasting US movies box office performances in Turkey using machine learning algorithms

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189120 ◽

2020 ◽

Vol 39 (5) ◽

pp. 6579-6590

Author(s):

Sandy Çağlıyor ◽

Başar Öztayşi ◽

Selime Sezgin

Keyword(s):

Machine Learning ◽

Global Economy ◽

Learning Algorithms ◽

Forecast Model ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

High Stakes ◽

Box Office ◽

Industry Forecast ◽

The Impact

The motion picture industry is one of the largest industries worldwide and has significant importance in the global economy. Considering the high stakes and high risks in the industry, forecast models and decision support systems are gaining importance. Several attempts have been made to estimate the theatrical performance of a movie before or at the early stages of its release. Nevertheless, these models are mostly used for predicting domestic performances and the industry still struggles to predict box office performances in overseas markets. In this study, the aim is to design a forecast model using different machine learning algorithms to estimate the theatrical success of US movies in Turkey. From various sources, a dataset of 1559 movies is constructed. Firstly, independent variables are grouped as pre-release, distributor type, and international distribution based on their characteristic. The number of attendances is discretized into three classes. Four popular machine learning algorithms, artificial neural networks, decision tree regression and gradient boosting tree and random forest are employed, and the impact of each group is observed by compared by the performance models. Then the number of target classes is increased into five and eight and results are compared with the previously developed models in the literature.

Download Full-text

Intelligent system of English composition scoring model based on improved machine learning algorithm

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189235 ◽

2020 ◽

pp. 1-11

Author(s):

Jie Liu ◽

Lin Lin ◽

Xiufang Liang

Keyword(s):

Machine Learning ◽

Evaluation System ◽

Intelligent System ◽

Learning Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Assessment System ◽

English Composition ◽

Region Extraction ◽

Constraint Model

The online English teaching system has certain requirements for the intelligent scoring system, and the most difficult stage of intelligent scoring in the English test is to score the English composition through the intelligent model. In order to improve the intelligence of English composition scoring, based on machine learning algorithms, this study combines intelligent image recognition technology to improve machine learning algorithms, and proposes an improved MSER-based character candidate region extraction algorithm and a convolutional neural network-based pseudo-character region filtering algorithm. In addition, in order to verify whether the algorithm model proposed in this paper meets the requirements of the group text, that is, to verify the feasibility of the algorithm, the performance of the model proposed in this study is analyzed through design experiments. Moreover, the basic conditions for composition scoring are input into the model as a constraint model. The research results show that the algorithm proposed in this paper has a certain practical effect, and it can be applied to the English assessment system and the online assessment system of the homework evaluation system algorithm system.

Download Full-text

The Unlearnable Checkerboard Pattern

Communications of the Blyth Institute ◽

10.33014/issn.2640-5652.1.2.holloway.1 ◽

2019 ◽

Vol 1 (2) ◽

pp. 78-80

Author(s):

Eric Holloway

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Checkerboard Pattern ◽

Simple Task

Detecting some patterns is a simple task for humans, but nearly impossible for current machine learning algorithms. Here, the "checkerboard" pattern is examined, where human prediction nears 100% and machine prediction drops significantly below 50%.

Download Full-text

On Detecting Wi-Fi Unauthorized Access Utilizing Software Define Network (SDN) and Machine Learning Algorithms

International Review on Computers and Software (IRECOS) ◽

10.15866/irecos.v12i1.11020 ◽

2017 ◽

Vol 12 (1) ◽

pp. 21 ◽

Cited By ~ 1

Author(s):

Mohammad Masoud ◽

Yousef Jaradat ◽

Ismael Jannoud

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Unauthorized Access ◽

Software Define Network

Download Full-text

1290-P: Gut Microbiota in New-Onset Pediatric Patients with Type 1 Diabetes: Machine Learning Algorithms to Classify Microorganisms Disease-Linked

Diabetes ◽

10.2337/db20-1290-p ◽

2020 ◽

Vol 69 (Supplement 1) ◽

pp. 1290-P

Author(s):

GIUSEPPE D’ANNUNZIO ◽

ROBERTO BIASSONI ◽

MARGHERITA SQUILLARIO ◽

ELISABETTA UGOLOTTI ◽

ANNALISA BARLA ◽

...

Keyword(s):

Machine Learning ◽

Type 1 Diabetes ◽

Gut Microbiota ◽

Pediatric Patients ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

New Onset

Download Full-text