An Improved Method for Splice Site Prediction in DNA Sequences Using Support Vector Machines

Accurate splice site prediction using support vector machines

BMC Bioinformatics ◽

10.1186/1471-2105-8-s10-s7 ◽

2007 ◽

Vol 8 (Suppl 10) ◽

pp. S7 ◽

Cited By ~ 93

Author(s):

Sören Sonnenburg ◽

Gabriele Schweikert ◽

Petra Philips ◽

Jonas Behr ◽

Gunnar Rätsch

Keyword(s):

Support Vector Machines ◽

Splice Site ◽

Support Vector ◽

Splice Site Prediction ◽

Site Prediction ◽

Vector Machines

Download Full-text

Splice site prediction using support vector machines with a Bayes kernel

Expert Systems with Applications ◽

10.1016/j.eswa.2005.09.052 ◽

2006 ◽

Vol 30 (1) ◽

pp. 73-81 ◽

Cited By ~ 32

Author(s):

Y ZHANG ◽

C CHU ◽

Y CHEN ◽

H ZHA ◽

X JI

Keyword(s):

Support Vector Machines ◽

Splice Site ◽

Support Vector ◽

Splice Site Prediction ◽

Site Prediction ◽

Vector Machines

Download Full-text

Accuracy test in identifying the splice site type of DNA sequences by using probabilistic neural networks and support vector machines

Malaysian Journal of Fundamental and Applied Sciences ◽

10.11113/mjfas.v1n1.10 ◽

2014 ◽

Vol 1 (1) ◽

Author(s):

Djati Kerami

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Support Vector Machines ◽

Splice Site ◽

Dna Sequences ◽

Support Vector ◽

Probabilistic Neural Networks ◽

Application Problem ◽

Vector Machines ◽

Accuracy Level

It has been known that Probabilistic Neural Networks as machine learning is very fast in it’s computation time and give a better accuracy comparing to another type of neural networks, on solving a real-world application problem. In the recent years, Support Vector Machines has become a popular model over other machine learning. It can be analyzed theoretically and can achieve a good performance at same time. This paper will describe the use of those machines learning to solve pattern recognition problems with a preliminary case study in detecting the type of splice site on the DNA sequences, particularity on the accuracy level. The results obtained show that Support Vector Machines have a good accuracy level about 95 % comparing to Probabilistic Neural Networks with 92 % approximately.

Download Full-text

Splicing-site recognition of rice (Oryza sativa L.) DNA sequences by support vector machines

Journal of Zhejiang University SCIENCE A ◽

10.1631/jzus.2003.0573 ◽

2003 ◽

Vol 4 (5) ◽

pp. 573-577

Author(s):

Peng Si-hua ◽

Fan Long-jiang ◽

Peng Xiao-ning ◽

Zhuang Shu-lin ◽

Du Wei ◽

...

Keyword(s):

Oryza Sativa ◽

Support Vector Machines ◽

Dna Sequences ◽

Oryza Sativa L ◽

Support Vector ◽

Vector Machines ◽

Site Recognition ◽

Splicing Site

Download Full-text

An Improved Method for Multi-class Support Vector Machines

2010 International Conference on Measuring Technology and Mechatronics Automation ◽

10.1109/icmtma.2010.34 ◽

2010 ◽

Cited By ~ 6

Author(s):

Chaobin Liu ◽

Yuexiang Yang ◽

Chuan Tang

Keyword(s):

Support Vector Machines ◽

Support Vector ◽

Improved Method ◽

Vector Machines

Download Full-text

Improving Training Speed of Support Vector Machines by Creating Exploitable Trends of Lagrangian Variables: An Application to DNA Splice Site Detection

2007 Frontiers in the Convergence of Bioscience and Information Technologies ◽

10.1109/fbit.2007.56 ◽

2007 ◽

Author(s):

Jason Li ◽

Saman K. Halgamuge

Keyword(s):

Support Vector Machines ◽

Splice Site ◽

Support Vector ◽

Lagrangian Variables ◽

Vector Machines

Download Full-text

Human Splice Site Identification with Multiclass Support Vector Machines and Bagging

Artificial Neural Networks and Neural Information Processing — ICANN/ICONIP 2003 - Lecture Notes in Computer Science ◽

10.1007/3-540-44989-2_29 ◽

2003 ◽

pp. 234-241 ◽

Cited By ~ 3

Author(s):

Ana Carolina Lorena ◽

André C. P. L. F. de Carvalho

Keyword(s):

Support Vector Machines ◽

Splice Site ◽

Support Vector ◽

Vector Machines ◽

Site Identification ◽

Multiclass Support Vector Machines

Download Full-text

Support Vector Machines for HIV-1 Protease Cleavage Site Prediction

Pattern Recognition and Image Analysis - Lecture Notes in Computer Science ◽

10.1007/11492542_51 ◽

2005 ◽

pp. 413-420

Author(s):

Loris Nanni ◽

Alessandra Lumini

Keyword(s):

Support Vector Machines ◽

Cleavage Site ◽

Support Vector ◽

Site Prediction ◽

Protease Cleavage Site ◽

Protease Cleavage ◽

Vector Machines ◽

Cleavage Site Prediction ◽

Hiv 1

Download Full-text

A novel artificial intelligence-based approach for identification of deoxynucleotide aptamers

PLoS Computational Biology ◽

10.1371/journal.pcbi.1009247 ◽

2021 ◽

Vol 17 (8) ◽

pp. e1009247

Author(s):

Frances L. Heredia ◽

Abiel Roche-Lima ◽

Elsie I. Parés-Matos

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Support Vector Machines ◽

Dna Binding ◽

Dna Sequences ◽

Support Vector ◽

Sequence Information ◽

Dna Aptamers ◽

Vector Machines ◽

Selection Of

The selection of a DNA aptamer through the Systematic Evolution of Ligands by EXponential enrichment (SELEX) method involves multiple binding steps, in which a target and a library of randomized DNA sequences are mixed for selection of a single, nucleotide-specific molecule. Usually, 10 to 20 steps are required for SELEX to be completed. Throughout this process it is necessary to discriminate between true DNA aptamers and unspecified DNA-binding sequences. Thus, a novel machine learning-based approach was developed to support and simplify the early steps of the SELEX process, to help discriminate binding between DNA aptamers from those unspecified targets of DNA-binding sequences. An Artificial Intelligence (AI) approach to identify aptamers were implemented based on Natural Language Processing (NLP) and Machine Learning (ML). NLP method (CountVectorizer) was used to extract information from the nucleotide sequences. Four ML algorithms (Logistic Regression, Decision Tree, Gaussian Naïve Bayes, Support Vector Machines) were trained using data from the NLP method along with sequence information. The best performing model was Support Vector Machines because it had the best ability to discriminate between positive and negative classes. In our model, an Accuracy (A) of 0.995, the fraction of samples that the model correctly classified, and an Area Under the Receiving Operating Curve (AUROC) of 0.998, the degree by which a model is capable of distinguishing between classes, were observed. The developed AI approach is useful to identify potential DNA aptamers to reduce the amount of rounds in a SELEX selection. This new approach could be applied in the design of DNA libraries and result in a more efficient and faster process for DNA aptamers to be chosen during SELEX.

Download Full-text

Learning to Generate Optimized Term Weighting for Web Documents Classification - A Parallel Mimetic Approach Based on Support Vector Machines

International Review on Computers and Software (IRECOS) ◽

10.15866/irecos.v11i12.10964 ◽

2016 ◽

Vol 11 (12) ◽

pp. 1147

Author(s):

Abderrahmane Bendahmane ◽

Abdelkader Benyettou

Keyword(s):

Support Vector Machines ◽

Support Vector ◽

Term Weighting ◽

Web Documents ◽

Vector Machines

Download Full-text