Identifying lncRNA-mediated regulatory modules via ChIA-PET network analysis
AbstractBackgroundAlthough several studies have provided insights into the role of long non-coding RNAs (lncRNAs), the majority of them has unknown function. Recent evidence has shown the importance of both lncR-NAs and chromatin interactions in transcriptional regulation. Although network-based methods, mainly exploiting gene-lncRNA co-expression, have been applied to characterize lncRNA of unknown function by means of ‘guilt-by-association’ strategies, no method exists which combines co-expression analysis with 3D chromatin interaction data.ResultsTo better understand the function of chromatin interactions in the context of lncRNA-mediated gene regulation, we have developed a multi-step graph analysis approach to examine the RNA polymerase II ChIA-PET chromatin interaction network in the K562 human cell line. We have annotated the network with gene and lncRNA coordinates, and chromatin states from the ENCODE project. We used centrality measures, as well as an adaptation of our previously developed Markov State Models (MSM) clustering method, to gain a better understanding of lncRNAs in transcriptional regulation. The novelty of our approach resides into the detection of fuzzy regulatory modules based on network properties and their optimization based on co-expression analysis between genes and gene-lncRNA pairs. This results in our method returning morebona fideregulatory modules than other state-of-the art approaches for clustering on graphs.ConclusionsInterestingly, we find that lncRNA network hubs tend to be significantly enriched in disease association, positional conservation and enhancer-like functions. We validated regulatory functions for well known lncRNAs, such as MALAT1 and the enhancer-like lncRNA FALEC. In addition, by investigating the modular structure of bigger components we show that we can propose regulatory functional mechanisms for uncharacterized lncRNAs, such FLJ37453, RP11442N24 B.1 and LINC00910.