PageRank-flux On Graphlet-Guided-Network(PRO-GGNet): A Method for Pathway Reconstruction and Multi-Omic Data Integration

2022-10
Arıcı, Kaan
Tunçbağ, Nurcan
The recent advancement of omic technologies provides snapshots of cells, tissues, or patients identifying prominent genes, proteins, metabolites, and small molecules. However, accumulated big data on various omic data types may inherently make diseases or perturbations incomprehensible. Network inference or reconstruction methods map a set of significantly altered proteins/genes/metabolites to a given reference network that is composed of already known relations or interactions. Followingly, the signals from these significant hits are propagated so as to identify biomarkers, drivers, pathways or disease modules consisting of a relatively small number of genes. Including Steiner trees/forest, random walk, or heat diffusion, network inference or reconstruction approaches are mainly based on global and local topological features of networks considering graph theory. However, the problem of identifying interacting or related genes in an extensive reference network and assessing associations between multiple sparse omics data is one of the critical problems in system biology. A network motif is a significantly frequent subgraph pattern in a given network. In this study, using graphlets (small non-isomorphic subgraphs) as network motifs, we reduce the dimension of the reference network dramatically and focus on the functionally important regions of the network. However, identifying network motifs and saving their knowledge in an extensive network requires high computational costs, limiting the usage of graphlet motifs for real problems. Our approach, PageRankflux On Graphlet-Guided-Network (PRO-GGNet), focuses on the motifs of a given gene-set in an extensive network and constructs the graphlet-guided network to reduce the computational cost followed by applying PageRank algorithm. Next we compared the performance of PRO-GGNet , with OmicsIntegrator and PathLinker that use prize collecting Steiner forest solution and shortest path extraction methods, respectively. PRO-GGNet outperformed the tools in testing the inference of Immune and Cancer Signaling Pathways in NetPath by providing the highest precision, recall and F1- scores.

Suggestions

Inference of large-scale networks via statistical approaches
Ayyıldız Demirci, Ezgi; Purutçuoğlu Gazi, Vilda; Department of Statistics (2019)
In system biology, the interactions between components such as genes, proteins, can be represented by a network. To understand the molecular mechanism of complex biological systems, construction of their networks plays a crucial role. However, estimation of these networks is a challenging problem because of their high dimensional and sparse structures. The Gaussian graphical model (GGM) is widely used approach to construct the undirected networks. GGM define the interactions between species by using the con...
Abiotic stress tolerance and growth responses of transgenic potato (Solanum tuberosum L. cv. Kennebec) plants expressing rice Osmyb4 gene
AYDIN, GÜLSÜM; Yucel, Meral; Öktem, Hüseyin Avni (2012-09-23)
MYB transcription factors are involved in diverse biochemical and physiological processes such as regulation of secondary metabolism, meristem formation, cell morphogenesis and floral and seed development. They are also involved in certain defence and stress responses and in hormone signalling. In the present study, we developed transgenic potato (Solanum tuberosum L. cv. Kennebec) expressing Oryza sativa myb4 gene, encoding MYB4 transcription factor, driven by either CaMV35S constitutive promoter or cold i...
Inference of Gene Regulatory Networks Via Multiple Data Sources and a Recommendation Method
Ozsoy, Makbule Gulcin; Polat, Faruk; Alhajj, Reda (2015-11-12)
Gene regulatory networks (GRNs) are composed of biological components, including genes, proteins and metabolites, and their interactions. In general, computational methods are used to infer the connections among these components. However, computational methods should take into account the general features of the GRNs, which are sparseness, scale-free topology, modularity and structure of the inferred networks. In this work, observing the common aspects between recommendation systems and GRNs, we decided to ...
Genome-wide sequence analysis of human splice acceptor regions for motif discovery
Karaduman Bahçe, Gülşah; Aydın Son, Yeşim; Department of Medical Informatics (2020-12-23)
For eukaryotic cells, alternative splicing of genes is a vital mechanism that drives protein diversity. Splicing signals on the genomic sequence controls the regulatory factors that orchestrate the alternative splicing. 3’ and 5’ splice sites and common branchpoint sequences are the primary splicing signals, and changes in these signals can be disease- causing. Nevertheless, an extensive genome-wide analysis of the sequences around these signals is lacking. In this study, we focused on the genome-wide motif...
MicroRNAs show a wide diversity of expression profiles in the developing and mature central nervous system
Erson Bensan, Ayşe Elif (2008-10-01)
Since the discovery of microRNAs (miRNAs) in Caenorhabditis elegans, mounting evidence illustrates the important regulatory roles for miRNAs in various developmental, differentiation, cell proliferation, and apoptosis pathways of diverse organisms. We are just beginning to elucidate novel aspects of RNA mediated gene regulation and to understand how heavily various molecular pathways rely on miRNAs for their normal function. miRNAs are small non-protein-coding transcripts that regulate gene expression post-...
Citation Formats
K. Arıcı and N. Tunçbağ, “PageRank-flux On Graphlet-Guided-Network(PRO-GGNet): A Method for Pathway Reconstruction and Multi-Omic Data Integration,” Erdemli, Mersin, TÜRKİYE, 2022, p. 3127, Accessed: 00, 2023. [Online]. Available: https://hibit2022.ims.metu.edu.tr/.