Research on folding diversity in statistical learning methods for RNA secondary structure prediction

Yu Zhu; ZhaoYang Xie; YiZhou Li; Min Zhu; Yi-Ping Phoebe Chen

doi:10.7150/ijbs.24595

Research on folding diversity in statistical learning methods for RNA secondary structure prediction

International Journal of Biological Sciences ◽

10.7150/ijbs.24595 ◽

2018 ◽

Vol 14 (8) ◽

pp. 872-882 ◽

Cited By ~ 2

Author(s):

Yu Zhu ◽

ZhaoYang Xie ◽

YiZhou Li ◽

Min Zhu ◽

Yi-Ping Phoebe Chen

Keyword(s):

Secondary Structure ◽

Statistical Learning ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Secondary Structure Prediction ◽

Learning Methods ◽

Rna Secondary Structure Prediction

Download Full-text

Deep Learning Method for RNA Secondary Structure Prediction with Pseudoknots Based on Large-Scale Data

Journal of Healthcare Engineering ◽

10.1155/2021/6699996 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Bowen Shen ◽

Hao Zhang ◽

Cong Li ◽

Tianheng Zhao ◽

Yuanning Liu

Keyword(s):

Deep Learning ◽

Secondary Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Large Scale ◽

Secondary Structure Prediction ◽

Learning Methods ◽

Rna Secondary Structure Prediction ◽

Large Scale Data ◽

Scale Data

Traditional machine learning methods are widely used in the field of RNA secondary structure prediction and have achieved good results. However, with the emergence of large-scale data, deep learning methods have more advantages than traditional machine learning methods. As the number of network layers increases in deep learning, there will often be problems such as increased parameters and overfitting. We used two deep learning models, GoogLeNet and TCN, to predict RNA secondary results. And from the perspective of the depth and width of the network, improvements are made based on the neural network model, which can effectively improve the computational efficiency while extracting more feature information. We process the existing real RNA data through experiments, use deep learning models to extract useful features from a large amount of RNA sequence data and structure data, and then predict the extracted features to obtain each base’s pairing probability. The characteristics of RNA secondary structure and dynamic programming methods are used to process the base prediction results, and the structure with the largest sum of the probability of each base pairing is obtained, and this structure will be used as the optimal RNA secondary structure. We, respectively, evaluated GoogLeNet and TCN models based on 5sRNA, tRNA data, and tmRNA data, and compared them with other standard prediction algorithms. The sensitivity and specificity of the GoogLeNet model on the 5sRNA and tRNA data sets are about 16% higher than the best prediction results in other algorithms. The sensitivity and specificity of the GoogLeNet model on the tmRNA dataset are about 9% higher than the best prediction results in other algorithms. As deep learning algorithms’ performance is related to the size of the data set, as the scale of RNA data continues to expand, the prediction accuracy of deep learning methods for RNA secondary structure will continue to improve.

Download Full-text

Advancements in RNA Secondary Structure Prediction using Machine Learning Methods

2020 IEEE International Conference for Innovation in Technology (INOCON) ◽

10.1109/inocon50539.2020.9298293 ◽

2020 ◽

Author(s):

Shubham Mittal ◽

Yasha Hasija

Keyword(s):

Machine Learning ◽

Secondary Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Secondary Structure Prediction ◽

Learning Methods ◽

Rna Secondary Structure Prediction ◽

Machine Learning Methods

Download Full-text

Faculty Opinions recommendation of COFOLD: an RNA secondary structure prediction method that takes co-transcriptional folding into account.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.718010599.793476797 ◽

2013 ◽

Author(s):

Scott Silverman

Keyword(s):

Secondary Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Secondary Structure Prediction ◽

Prediction Method ◽

Rna Secondary Structure Prediction ◽

Structure Prediction Method ◽

Secondary Structure Prediction Method

Download Full-text

A Discrete Hopfield Neural Network Based MIS Finding Algorithm for Stems Selecting and Its Application in RNA Secondary Structure Prediction

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2008.00051 ◽

2009 ◽

Vol 31 (1) ◽

pp. 51-58

Author(s):

Qi LIU ◽

Yin ZHANG ◽

Xiu-Zi YE ◽

Rong-Dong YU

Keyword(s):

Neural Network ◽

Secondary Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Secondary Structure Prediction ◽

Hopfield Neural Network ◽

Rna Secondary Structure Prediction

Download Full-text

A range of complex probabilistic models for RNA secondary structure prediction that includes the nearest-neighbor model and more

RNA ◽

10.1261/rna.030049.111 ◽

2011 ◽

Vol 18 (2) ◽

pp. 193-212 ◽

Cited By ~ 50

Author(s):

E. Rivas ◽

R. Lang ◽

S. R. Eddy

Keyword(s):

Secondary Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Probabilistic Models ◽

Nearest Neighbor ◽

Secondary Structure Prediction ◽

Rna Secondary Structure Prediction

Download Full-text

Evolutionary Algorithm for RNA Secondary Structure Prediction Based on Simulated SHAPE Data

PLoS ONE ◽

10.1371/journal.pone.0166965 ◽

2016 ◽

Vol 11 (11) ◽

pp. e0166965 ◽

Cited By ~ 4

Author(s):

Soheila Montaseri ◽

Mohammad Ganjtabesh ◽

Fatemeh Zare-Mirakabad

Keyword(s):

Secondary Structure ◽

Evolutionary Algorithm ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Secondary Structure Prediction ◽

Rna Secondary Structure Prediction ◽

Shape Data

Download Full-text

RNA Secondary Structure Prediction and Gene Regulation by Small RNAs

Frontiers in Computational and Systems Biology - Computational Biology ◽

10.1007/978-1-84996-196-7_2 ◽

2010 ◽

pp. 19-37

Author(s):

Ye Ding

Keyword(s):

Gene Regulation ◽

Secondary Structure ◽

Small Rnas ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Secondary Structure Prediction ◽

Rna Secondary Structure Prediction

Download Full-text

RNA Secondary Structure Prediction

Computing for Biologists ◽

10.1017/cbo9781107337510.018 ◽

2018 ◽

pp. 177-185

Keyword(s):

Secondary Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Secondary Structure Prediction ◽

Rna Secondary Structure Prediction

Download Full-text

Study of RNA Secondary Structure Prediction Algorithms

10.31979/etd.3ggp-5fwe ◽

2006 ◽

Author(s):

Lisa Yu

Keyword(s):

Secondary Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Secondary Structure Prediction ◽

Rna Secondary Structure Prediction ◽

Prediction Algorithms

Download Full-text

RNA secondary structure prediction using deep learning with thermodynamic integration

10.1101/2020.08.10.244442 ◽

2020 ◽

Author(s):

Kengo Sato ◽

Manato Akiyama ◽

Yasubumi Sakakibara

Keyword(s):

Deep Learning ◽

Secondary Structure ◽

Structure Prediction ◽

Rna Secondary Structure ◽

Secondary Structure Prediction ◽

Secondary Structures ◽

Thermodynamic Integration ◽

Rna Secondary Structure Prediction ◽

Rna Secondary Structures ◽

Non Coding Rnas

RNA secondary structure prediction is one of the key technologies for revealing the essential roles of functional non-coding RNAs. Although machine learning-based rich-parametrized models have achieved extremely high performance in terms of prediction accuracy, the risk of overfitting for such models has been reported. In this work, we propose a new algorithm for predicting RNA secondary structures that uses deep learning with thermodynamic integration, thereby enabling robust predictions. Similar to our previous work, the folding scores, which are computed by a deep neural network, are integrated with traditional thermodynamic parameters to enable robust predictions. We also propose thermodynamic regularization for training our model without overfitting it to the training data. Our algorithm (MXfold2) achieved the most robust and accurate predictions in computational experiments designed for newly discovered non-coding RNAs, with significant 2–10 % improvements over our previous algorithm (MXfold) and standard algorithms for predicting RNA secondary structures in terms of F-value.

Download Full-text