Machine Learning of Synthetic Biological Sequences for Designer Attribution

2019 ◽  
Author(s):  
Craig Howser ◽  
Claire Marie Filone ◽  
Jessica S Dymond ◽  
Brant Chee ◽  
Joseph Downs ◽  
...  

Advances in genome editing and gene synthesis technologies have increased the ease with which biological agents can be engineered. Existing methods to identify the engineering source are insufficient for attribution. We hypothesized that strategies used for DNA design and optimization could act as identifiable fingerprints of design software or particular vendors, making engineered agents more attributable to their source. To test this hypothesis, sequences optimized using various gene synthesis vendors were characterized using a machine learning model. By capturing optimization signatures unique to each vendor, the trained model showed an ability to identify a sequences origin with an accuracy up to 92%, indicating it is possible to distinguish the algorithm utilized to optimize a genetic sequence based on the DNA sequence output alone.

2018 ◽  
Author(s):  
Steen Lysgaard ◽  
Paul C. Jennings ◽  
Jens Strabo Hummelshøj ◽  
Thomas Bligaard ◽  
Tejs Vegge

A machine learning model is used as a surrogate fitness evaluator in a genetic algorithm (GA) optimization of the atomic distribution of Pt-Au nanoparticles. The machine learning accelerated genetic algorithm (MLaGA) yields a 50-fold reduction of required energy calculations compared to a traditional GA.


Author(s):  
Dhilsath Fathima.M ◽  
S. Justin Samuel ◽  
R. Hari Haran

Aim: This proposed work is used to develop an improved and robust machine learning model for predicting Myocardial Infarction (MI) could have substantial clinical impact. Objectives: This paper explains how to build machine learning based computer-aided analysis system for an early and accurate prediction of Myocardial Infarction (MI) which utilizes framingham heart study dataset for validation and evaluation. This proposed computer-aided analysis model will support medical professionals to predict myocardial infarction proficiently. Methods: The proposed model utilize the mean imputation to remove the missing values from the data set, then applied principal component analysis to extract the optimal features from the data set to enhance the performance of the classifiers. After PCA, the reduced features are partitioned into training dataset and testing dataset where 70% of the training dataset are given as an input to the four well-liked classifiers as support vector machine, k-nearest neighbor, logistic regression and decision tree to train the classifiers and 30% of test dataset is used to evaluate an output of machine learning model using performance metrics as confusion matrix, classifier accuracy, precision, sensitivity, F1-score, AUC-ROC curve. Results: Output of the classifiers are evaluated using performance measures and we observed that logistic regression provides high accuracy than K-NN, SVM, decision tree classifiers and PCA performs sound as a good feature extraction method to enhance the performance of proposed model. From these analyses, we conclude that logistic regression having good mean accuracy level and standard deviation accuracy compared with the other three algorithms. AUC-ROC curve of the proposed classifiers is analyzed from the output figure.4, figure.5 that logistic regression exhibits good AUC-ROC score, i.e. around 70% compared to k-NN and decision tree algorithm. Conclusion: From the result analysis, we infer that this proposed machine learning model will act as an optimal decision making system to predict the acute myocardial infarction at an early stage than an existing machine learning based prediction models and it is capable to predict the presence of an acute myocardial Infarction with human using the heart disease risk factors, in order to decide when to start lifestyle modification and medical treatment to prevent the heart disease.


Author(s):  
Dhaval Patel ◽  
Shrey Shrivastava ◽  
Wesley Gifford ◽  
Stuart Siegel ◽  
Jayant Kalagnanam ◽  
...  

Author(s):  
Juan C. Olivares-Rojas ◽  
Enrique Reyes-Archundia ◽  
Noel E. Rodriiguez-Maya ◽  
Jose A. Gutierrez-Gnecchi ◽  
Ismael Molina-Moreno ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document