scholarly journals Identifying frequent patterns in biochemical reaction networks - a workflow

Author(s):  
Fabienne Lambusch ◽  
Dagmar Waltemath ◽  
Olaf Wolkenhauer ◽  
Kurt Sandkuhl ◽  
Christian Rosenke ◽  
...  

Computational models in biology encode molecular and cell biological processes. These models often can be represented as biochemical reaction networks. Studying such networks, one is mostly interested in systems that share similar reactions and mechanisms. Typical goals of an investigation include understanding of the parts of a model, identification of reoccurring patterns, and recognition of biologically relevant motifs. The large number and size of available models, however, require automated methods to support researchers in achieving their goals. Specifically for the problem of finding patterns in large networks only partial solutions exist. We propose a workflow that identifies frequent structural patterns in biochemical reaction networks encoded in the Systems Biology Markup Language. The workflow utilises a subgraph mining algorithm to detect frequent network patterns. Once patterns are identified, the textual pattern description can automatically be converted into a graphical representation.Furthermore, information about the distribution of patterns among the selected set of models can be retrieved.The workflow was validated with 575 models from the curated branch of BioModels. In this paper, we highlight interesting and frequent structural patterns. Further, we provide exemplary patterns that incorporate terms from the Systems Biology Ontology. Our workflow can be applied to a custom set of models or to models already existing in our graph database MaSyMoS. The occurrences of frequent patterns may give insight into the encoding of central biological processes, evaluate postulated biological motifs, or serve as a similarity measure for models that share common structures. Availability: https://github.com/FabienneL/BioNet-Mining Contact: [email protected]

2017 ◽  
Author(s):  
Fabienne Lambusch ◽  
Dagmar Waltemath ◽  
Olaf Wolkenhauer ◽  
Kurt Sandkuhl ◽  
Christian Rosenke ◽  
...  

Computational models in biology encode molecular and cell biological processes. These models often can be represented as biochemical reaction networks. Studying such networks, one is mostly interested in systems that share similar reactions and mechanisms. Typical goals of an investigation include understanding of the parts of a model, identification of reoccurring patterns, and recognition of biologically relevant motifs. The large number and size of available models, however, require automated methods to support researchers in achieving their goals. Specifically for the problem of finding patterns in large networks only partial solutions exist. We propose a workflow that identifies frequent structural patterns in biochemical reaction networks encoded in the Systems Biology Markup Language. The workflow utilises a subgraph mining algorithm to detect frequent network patterns. Once patterns are identified, the textual pattern description can automatically be converted into a graphical representation.Furthermore, information about the distribution of patterns among the selected set of models can be retrieved.The workflow was validated with 575 models from the curated branch of BioModels. In this paper, we highlight interesting and frequent structural patterns. Further, we provide exemplary patterns that incorporate terms from the Systems Biology Ontology. Our workflow can be applied to a custom set of models or to models already existing in our graph database MaSyMoS. The occurrences of frequent patterns may give insight into the encoding of central biological processes, evaluate postulated biological motifs, or serve as a similarity measure for models that share common structures. Availability: https://github.com/FabienneL/BioNet-Mining[p] Contact: [email protected]


2018 ◽  
Author(s):  
Fabienne Lambusch ◽  
Dagmar Waltemath ◽  
Olaf Wolkenhauer ◽  
Kurt Sandkuhl ◽  
Christian Rosenke ◽  
...  

Computational models in biology encode molecular and cell biological processes. These models often can be represented as biochemical reaction networks. Studying such networks, one is mostly interested in systems that share similar reactions and mechanisms. Typical goals of an investigation include understanding of the parts of a model, identification of reoccurring patterns, and recognition of biologically relevant motifs. The large number and size of available models, however, require automated methods to support researchers in achieving their goals. Specifically for the problem of finding patterns in large networks only partial solutions exist. We propose a workflow that identifies frequent structural patterns in biochemical reaction networks encoded in the Systems Biology Markup Language. The workflow utilises a subgraph mining algorithm to detect frequent network patterns. Once patterns are identified, the textual pattern description can automatically be converted into a graphical representation.Furthermore, information about the distribution of patterns among the selected set of models can be retrieved.The workflow was validated with 575 models from the curated branch of BioModels. In this paper, we highlight interesting and frequent structural patterns. Further, we provide exemplary patterns that incorporate terms from the Systems Biology Ontology. Our workflow can be applied to a custom set of models or to models already existing in our graph database MaSyMoS. The occurrences of frequent patterns may give insight into the encoding of central biological processes, evaluate postulated biological motifs, or serve as a similarity measure for models that share common structures. Availability: https://github.com/FabienneL/BioNet-Mining Contact: [email protected]


2015 ◽  
Author(s):  
Ron Henkel ◽  
Fabienne Lambusch ◽  
Dagmar Waltemath

Biological questions today are often answered with the help of simulation models. Many of these models encode biological processes as biochemical reaction networks. The increasing amount of published models and the growing size of encoded reaction networks demand methods to analyse models. Specifically, researchers need to identify reoccurring and biologically relevant patterns. However, pattern recognition in large networks is a hard problem, and only partial solutions for very specific biological networks exist until now. In addition, while such patterns where already postulated, identifying them manually is barley feasible given a large set of complex models. This paper examines automatic methods to find reoccurring patterns structural similarities in models represented as bipartite graphs. An approach is presented to find the most frequent structures within the models. Appropriate patterns were found, which occur in a major part of the 575 input models. The occurrences of the resulting structures can provide insight into the encoding of certain biological processes, evaluate the postulated structures and serve as a reasonable similarity measure for grouping models that share many common structures.


2016 ◽  
Author(s):  
Ron Henkel ◽  
Fabienne Lambusch ◽  
Olaf Wolkenhauer ◽  
Kurt Sandkuhl ◽  
Christian Rosenke ◽  
...  

Computational models in biology encode molecular and cell biological processes. Many of them can be represented as biochemical reaction networks. Studying such networks, one is often interested in systems that share similar reactions and mechanisms. Typical goals are to understand the parts of a model, to identify reoccurring patterns, and to find biologically relevant motifs. The large number of models are available for such a search, but also the large size of models require automated methods.Specifically the generic problem of finding patterns in large networks is computationally hard. As a consequence, only partial solutions for a structural analysis of models exist. Here we introduce a tool chain that identifies reoccurring patterns in biochemical reaction networks. We started this work with an evaluation of algorithms for the identification of frequent subgraphs. Then, we created graph representations of existing SBML models and ran the most suitable algorithm on the data. The result was a list of reaction patterns together with statistics about the occurrence of each pattern in the data set. The approach was validated with 575 SBML models from the curated branch of BioModels. We analysed how the resulting patterns confirm with expectations from the literature and from previous model statistics. In the future, the identified patterns can serve as a tool to measure the similarity of models.


2006 ◽  
pp. 127-148 ◽  
Author(s):  
Frank J. Bruggeman ◽  
Barbara M. Bakker ◽  
Jorrit J. Hornberg ◽  
Hans V. Westerhoff

Database ◽  
2018 ◽  
Vol 2018 ◽  
Author(s):  
Fabienne Lambusch ◽  
Dagmar Waltemath ◽  
Olaf Wolkenhauer ◽  
Kurt Sandkuhl ◽  
Christian Rosenke ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document