scholarly journals Discovery of Large Disjoint Motif in Biological Network using Dynamic Expansion Tree

2018 ◽  
Author(s):  
Sabyasachi Patra ◽  
Anjali Mohapatra

AbstractNetwork motifs play an important role in structural analysis of biological networks. Identification of such network motifs leads to many important applications, such as: understanding the modularity and the large-scale structure of biological networks, classification of networks into super-families etc. However, identification of network motifs is challenging as it involved graph isomorphism which is computationally hard problem. Though this problem has been studied extensively in the literature using different computational approaches, we are far from encouraging results. Motivated by the challenges involved in this field we have proposed an efficient and scalable Motif discovery algorithm using a Dynamic Expansion Tree (MDET). In this algorithm embeddings corresponding to child node of expansion tree are obtained from the embeddings of parent node, either by adding a vertex with time complexity O(n) or by adding an edge with time complexity O(1) without involving any isomorphic check. The growth of Dynamic Expansion Tree (DET) depends on availability of patterns in the target network. DET reduces space complexity significantly and the memory limitation of static expansion tree can overcome. The proposed algorithm has been tested on Protein Protein Interaction (PPI) network obtained from MINT database. It is able to identify large motifs faster than most of the existing motif discovery algorithms.

PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e6917 ◽  
Author(s):  
Sabyasachi Patra ◽  
Anjali Mohapatra

Network motifs play an important role in the structural analysis of biological networks. Identification of such network motifs leads to many important applications such as understanding the modularity and the large-scale structure of biological networks, classification of networks into super-families, and protein function annotation. However, identification of large network motifs is a challenging task as it involves the graph isomorphism problem. Although this problem has been studied extensively in the literature using different computational approaches, still there is a lot of scope for improvement. Motivated by the challenges involved in this field, an efficient and scalable network motif finding algorithm using a dynamic expansion tree is proposed. The novelty of the proposed algorithm is that it avoids computationally expensive graph isomorphism tests and overcomes the space limitation of the static expansion tree (SET) which makes it enable to find large motifs. In this algorithm, the embeddings corresponding to a child node of the expansion tree are obtained from the embeddings of a parent node, either by adding a vertex or by adding an edge. This process does not involve any graph isomorphism check. The time complexity of vertex addition and edge addition are O(n) and O(1), respectively. The growth of a dynamic expansion tree (DET) depends on the availability of patterns in the target network. Pruning of branches in the DET significantly reduces the space requirement of the SET. The proposed algorithm has been tested on a protein–protein interaction network obtained from the MINT database. The proposed algorithm is able to identify large network motifs faster than most of the existing motif finding algorithms.


2018 ◽  
Vol 16 (06) ◽  
pp. 1850024 ◽  
Author(s):  
Sabyasachi Patra ◽  
Anjali Mohapatra

Networks are powerful representation of topological features in biological systems like protein interaction and gene regulation. In order to understand the design principles of such complex networks, the concept of network motifs emerged. Network motifs are recurrent patterns with statistical significance that can be seen as basic building blocks of complex networks. Identification of network motifs leads to many important applications, such as understanding the modularity and the large-scale structure of biological networks, classification of networks into super-families, protein function annotation, etc. However, identification of network motifs is challenging as it involves graph isomorphism which is computationally hard. Though this problem has been studied extensively in the literature using different computational approaches, we are far from satisfactory results. Motivated by the challenges involved in this field, an efficient and scalable network Motif Discovery algorithm based on Expansion Tree (MODET) is proposed. Pattern growth approach is used in this proposed motif-centric algorithm. Each node of the expansion tree represents a non-isomorphic pattern. The embeddings corresponding to a child node of the expansion tree are obtained from the embeddings of the parent node through vertex addition and edge addition. Further, the proposed algorithm does not involve any graph isomorphism check and the time complexities of these processes are [Formula: see text] and [Formula: see text], respectively. The proposed algorithm has been tested on Protein–Protein Interaction (PPI) network obtained from the MINT database. The computational efficiency of the proposed algorithm outperforms most of the existing network motif discovery algorithms.


2018 ◽  
Author(s):  
Wang Tao ◽  
Yadong Wang ◽  
Jiajie Peng ◽  
Chen Jin

AbstractNetwork motifs are recurring significant patterns of inter-connections, which are recognized as fundamental units to study the higher-order organizations of networks. However, the principle of selecting representative network motifs for local motif based clustering remains largely unexplored. We present a scalable algorithm called FSM for network motif discovery. FSM accelerates the motif discovery process by effectively reducing the number of times to do subgraph isomorphism labeling. Multiple heuristic optimizations for subgraph enumeration and subgraph classification are also adopted in FSM to further improve its performance. Experimental results show that FSM is more efficient than the compared models on computational efficiency and memory usage. Furthermore, our experiments indicate that large and frequent network motifs may be more appropriate to be selected as the representative network motifs for discovering higher-order organizational structures in biological networks than small or low-frequency network motifs.


2009 ◽  
pp. 27-53
Author(s):  
A. Yu. Kudryavtsev

Diversity of plant communities in the nature reserve “Privolzhskaya Forest-Steppe”, Ostrovtsovsky area, is analyzed on the basis of the large-scale vegetation mapping data from 2000. The plant community classi­fication based on the Russian ecologic-phytocoenotic approach is carried out. 12 plant formations and 21 associations are distinguished according to dominant species and a combination of ecologic-phytocoenotic groups of species. A list of vegetation classification units as well as the characteristics of theshrub and woody communities are given in this paper.


1996 ◽  
pp. 64-67 ◽  
Author(s):  
Nguen Nghia Thin ◽  
Nguen Ba Thu ◽  
Tran Van Thuy

The tropical seasonal rainy evergreen broad-leaved forest vegetation of the Cucphoung National Park has been classified and the distribution of plant communities has been shown on the map using the relations of vegetation to geology, geomorphology and pedology. The method of vegetation mapping includes: 1) the identifying of vegetation types in the remote-sensed materials (aerial photographs and satellite images); 2) field work to compile the interpretation keys and to characterize all the communities of a study area; 3) compilation of the final vegetation map using the combined information. In the classification presented a number of different level vegetation units have been identified: formation classes (3), formation sub-classes (3), formation groups (3), formations (4), subformations (10) and communities (19). Communities have been taken as mapping units. So in the vegetation map of the National Park 19 vegetation categories has been shown altogether, among them 13 are natural primary communities, and 6 are the secondary, anthropogenic ones. The secondary succession goes through 3 main stages: grassland herbaceous xerophytic vegetation, xerophytic scrub, dense forest.


2020 ◽  
Vol 27 (4) ◽  
pp. 265-278 ◽  
Author(s):  
Ying Han ◽  
Liang Cheng ◽  
Weiju Sun

The interactions among proteins and genes are extremely important for cellular functions. Molecular interactions at protein or gene levels can be used to construct interaction networks in which the interacting species are categorized based on direct interactions or functional similarities. Compared with the limited experimental techniques, various computational tools make it possible to analyze, filter, and combine the interaction data to get comprehensive information about the biological pathways. By the efficient way of integrating experimental findings in discovering PPIs and computational techniques for prediction, the researchers have been able to gain many valuable data on PPIs, including some advanced databases. Moreover, many useful tools and visualization programs enable the researchers to establish, annotate, and analyze biological networks. We here review and list the computational methods, databases, and tools for protein−protein interaction prediction.


2020 ◽  
Author(s):  
Thomas Gaisl ◽  
Naser Musli ◽  
Patrick Baumgartner ◽  
Marc Meier ◽  
Silvana K Rampini ◽  
...  

BACKGROUND The health aspects, disease frequencies, and specific health interests of prisoners and refugees are poorly understood. Importantly, access to the health care system is limited for this vulnerable population. There has been no systematic investigation to understand the health issues of inmates in Switzerland. Furthermore, little is known on how recent migration flows in Europe may have affected the health conditions of inmates. OBJECTIVE The Swiss Prison Study (SWIPS) is a large-scale observational study with the aim of establishing a public health registry in northern-central Switzerland. The primary objective is to establish a central database to assess disease prevalence (ie, International Classification of Diseases-10 codes [German modification]) among prisoners. The secondary objectives include the following: (1) to compare the 2015 versus 2020 disease prevalence among inmates against a representative sample from the local resident population, (2) to assess longitudinal changes in disease prevalence from 2015 to 2020 by using cross-sectional medical records from all inmates at the Police Prison Zurich, Switzerland, and (3) to identify unrecognized health problems to prepare successful public health strategies. METHODS Demographic and health-related data such as age, sex, country of origin, duration of imprisonment, medication (including the drug name, brand, dosage, and release), and medical history (including the International Classification of Diseases-10 codes [German modification] for all diagnoses and external results that are part of the medical history in the prison) have been deposited in a central register over a span of 5 years (January 2015 to August 2020). The final cohort is expected to comprise approximately 50,000 to 60,000 prisoners from the Police Prison Zurich, Switzerland. RESULTS This study was approved on August 5, 2019 by the ethical committee of the Canton of Zurich with the registration code KEK-ZH No. 2019-01055 and funded in August 2020 by the “Walter and Gertrud Siegenthaler” foundation and the “Theodor and Ida Herzog-Egli” foundation. This study is registered with the International Standard Randomized Controlled Trial Number registry. Data collection started in August 2019 and results are expected to be published in 2021. Findings will be disseminated through scientific papers as well as presentations and public events. CONCLUSIONS This study will construct a valuable database of information regarding the health of inmates and refugees in Swiss prisons and will act as groundwork for future interventions in this vulnerable population. CLINICALTRIAL ISRCTN registry ISRCTN11714665; http://www.isrctn.com/ISRCTN11714665 INTERNATIONAL REGISTERED REPORT DERR1-10.2196/23973


Author(s):  
Mathieu Turgeon-Pelchat ◽  
Samuel Foucher ◽  
Yacine Bouroubi

Sign in / Sign up

Export Citation Format

Share Document