SOME INTRIGUING HIGH-THROUGHPUT DNA SEQUENCE VARIANTS PREDICTION OVER PROTEIN FUNCTIONALITY

2016 ◽  
Vol 78 (6-4) ◽  
Author(s):  
Atabak Kheirkhah ◽  
Salwani Mohd Daud ◽  
Noor Azurati Ahmad @ Salleh ◽  
Suriani Mohd Sam ◽  
Hafiza Abas ◽  
...  

This paper intends to review computational methods and high throughput automated tools for precisely prediction various functionalities of uncharacterized proteins based on their desired DNA sequence information alone. Then proposes a hybrid weighted network and Genetic Algorithm to improve prediction purpose. The main advantage of the method is the protein function and DNA sequence prediction can be computed precisely using best fitness parent in genetic algorithm. With the accomplishment of human genome sequencing, the number of sequence-known proteins has increased exponentially and the pace is much slower in determining their biological attributes. The gap between DNA sequence variants and their functionalities has become increasingly large. However, detection of sequences based on protein data bank has become benchmark for many researchers. As amount of DNA sequence data continues to increase, the fundamental problem stay at the front of genome analysis. In the course of developing these methods, the following matters were often needed to consider: benchmark dataset construction, gene sequence prediction, operating algorithm, anticipated accuracy, gene recommender and functional integrations. In this review, we are to discuss each of them, with a different focus on operational algorithms and how to increase the accuracy of DNA sequence variants prediction.

2012 ◽  
Vol 33 (4) ◽  
pp. 599-608 ◽  
Author(s):  
David R. Adams ◽  
Murat Sincan ◽  
Karin Fuentes Fajardo ◽  
James C. Mullikin ◽  
Tyler M. Pierson ◽  
...  

Author(s):  
Yue Cao ◽  
Yang Shen

Abstract Motivation Facing the increasing gap between high-throughput sequence data and limited functional insights, computational protein function annotation provides a high-throughput alternative to experimental approaches. However, current methods can have limited applicability while relying on protein data besides sequences, or lack generalizability to novel sequences, species and functions. Results To overcome aforementioned barriers in applicability and generalizability, we propose a novel deep learning model using only sequence information for proteins, named Transformer-based protein function Annotation through joint sequence–Label Embedding (TALE). For generalizability to novel sequences we use self attention-based transformers to capture global patterns in sequences. For generalizability to unseen or rarely seen functions (tail labels), we embed protein function labels (hierarchical GO terms on directed graphs) together with inputs/features (1D sequences) in a joint latent space. Combining TALE and a sequence similarity-based method, TALE+ outperformed competing methods when only sequence input is available. It even outperformed a state-of-the-art method using network information besides sequence, in two of the three gene ontologies. Furthermore, TALE and TALE+ showed superior generalizability to proteins of low similarity, new species, or rarely annotated functions compared to training data, revealing deep insights into the protein sequence–function relationship. Ablation studies elucidated contributions of algorithmic components toward the accuracy and the generalizability. Availability The data, source codes and models are available at https://github.com/Shen-Lab/TALE Supplementary information Supplementary data are available at Bioinformatics online.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Heleen Plaisier ◽  
Thomas R. Meagher ◽  
Daniel Barker

Abstract Objective Visualisation methods, primarily color-coded representation of sequence data, have been a predominant means of representation of DNA data. Algorithmic conversion of DNA sequence data to sound—sonification—represents an alternative means of representation that uses a different range of human sensory perception. We propose that sonification has value for public engagement with DNA sequence information because it has potential to be entertaining as well as informative. We conduct preliminary work to explore the potential of DNA sequence sonification in public engagement with bioinformatics. We apply a simple sonification technique for DNA, in which each DNA base is represented by a specific note. Additionally, a beat may be added to indicate codon boundaries or for musical effect. We report a brief analysis from public engagement events we conducted that featured this method of sonification. Results We report on use of DNA sequence sonification at two public events. Sonification has potential in public engagement with bioinformatics, both as a means of data representation and as a means to attract audience to a drop-in stand. We also discuss further directions for research on integration of sonification into bioinformatics public engagement and education.


2006 ◽  
Vol 11 (9) ◽  
pp. 837-846 ◽  
Author(s):  
S G Schwab ◽  
M Knapp ◽  
P Sklar ◽  
G N Eckstein ◽  
C Sewekow ◽  
...  

Zootaxa ◽  
2012 ◽  
Vol 3350 (1) ◽  
pp. 1 ◽  
Author(s):  
BARRY J. RICHARDSON ◽  
NICOLE L. GUNTER

The genus Servaea Simon 1887 is revised and redefined. Descriptions and identification keys are provided to the following sixspecies, of which three are described as new: Servaea incana (Karsch 1878), Servaea narraweena n. sp., Servaea melaina n.sp., Servaea spinibarbis Simon 1909, Servaea villosa (Keyserling 1881) and Servaea zabkai n. sp. The type species of thegenus, Servaea vestita (L. Koch 1879), is proposed here to be a junior synonym of Servaea incana. In addition to the diagnosesand descriptions, distributional and nucleotide sequence information are provided. DNA sequence data for the segment of COIused in other salticid studies was obtained for the five species for which suitable material was available. Intraspecific variationin S. villosa and S. incana were studied in more detail. Within-species divergence was S. melaina and S spinibarbis, had adjacent predicted distributions, one coastal on sandy soils and one inland on other soil types.


2013 ◽  
Vol 35 (1) ◽  
pp. 147-148 ◽  
Author(s):  
Frans P.M. Cremers ◽  
Johan T. den Dunnen ◽  
Muhammad Ajmal ◽  
Alamdar Hussain ◽  
Markus N. Preising ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document