Multifractal detrended cross-correlation analysis of coding and non-coding DNA sequences through chaos-game representation

2015 ◽  
Vol 436 ◽  
pp. 596-603 ◽  
Author(s):  
Mayukha Pal ◽  
B. Satish ◽  
K. Srinivas ◽  
P. Madhusudana Rao ◽  
P. Manimaran
Fractals ◽  
2006 ◽  
Vol 14 (01) ◽  
pp. 27-35 ◽  
Author(s):  
TOMOYA SUZUKI ◽  
TOHRU IKEGUCHI ◽  
MASUO SUZUKI

Iterative function systems are often used for investigating fractal structures. The method is also referred as Chaos Game Representation (CGR), and is applied for representing characteristic structures of DNA sequences visually. In this paper, we proposed an original way of plotting CGR to easily confirm the property of the temporal evaluation of a time series. We also showed existence of spurious characteristic structures of time series, if we carelessly applied the CGR to real time series. We revealed that the source of spurious identification came from non-uniformity of the frequency histograms of the time series, which is often the case of analyzing real time series. We also showed how to avoid such spurious identification by applying the method of surrogate data and introducing conditional probabilities of the time series.


2016 ◽  
Vol 2016 ◽  
pp. 1-7 ◽  
Author(s):  
Bhagwan N. Rekadwad ◽  
Juan M. Gonzalez ◽  
Chandrahasya N. Khobragade

A total of five highly related strains of an unidentified marine bacterium were analyzed through their short genome sequences (AM260709–AM260713). Genome-to-Genome Distance (GGDC) showed high similarity to Pseudoalteromonas haloplanktis (X67024). The generated unique Quick Response (QR) codes indicated no identity to other microbial species or gene sequences. Chaos Game Representation (CGR) showed the number of bases concentrated in the area. Guanine residues were highest in number followed by cytosine. Frequency of Chaos Game Representation (FCGR) indicated that CC and GG blocks have higher frequency in the sequence from the evaluated marine bacterium strains. Maximum GC content for the marine bacterium strains ranged 53-54%. The use of QR codes, CGR, FCGR, and GC dataset helped in identifying and interpreting short genome sequences from specific isolates. A phylogenetic tree was constructed with the bootstrap test (1000 replicates) using MEGA6 software. Principal Component Analysis (PCA) was carried out using EMBL-EBI MUSCLE program. Thus, generated genomic data are of great assistance for hierarchical classification in Bacterial Systematics which combined with phenotypic features represents a basic procedure for a polyphasic approach on unambiguous bacterial isolate taxonomic classification.


Sign in / Sign up

Export Citation Format

Share Document