CONSTRUCTION AND ANALYSIS OF TISSUE/DISEASE SPECIFIC PROTEIN-PROTEIN INTERACTION NETWORKS BY INTEGRATING LARGE SCALE TRANSCRIPTOME DATA WITH GENOME SCALE PROTEIN-PROTEIN INTERACTION NETWORKS

Download
2022-7-25
SÖNMEZ, ARZU BURÇAK
Analysis of integrated genome-scale networks is a challenging problem due to heterogeneity of high-throughput data. There are several topological measures, such as graphlet counts and degree distributions, for characterization of biological networks. In this dissertation, we present methods for counting graphlet patterns in integrated genome-scale networks which are modeled as labeled multidigraphs. We have obtained physical, regulatory, and metabolic interactions between H. sapiens proteins from the Pathway Commons database. For the first part of the dissertation, the integrated network is filtered for tissue/disease specific proteins by using a large-scale human transcriptional profiling study, resulting in several tissue and disease specific sub-networks. We have applied and extended the idea of graphlet counting in undirected protein-protein interaction (PPI) networks to directed multi-labeled networks and represented each network as a vector of graphlet counts. Graphlet counts are assessed for statistical significance by comparison against a set of randomized networks. We present our results on analysis of differential graphlets between different conditions and on the utility of graphlet count vectors for clustering multiple condition specific networks. Our results show that there are numerous statistically significant graphlets in integrated biological networks and the graphlet signature vector can be used as an effective representation of a multilabeled network for clustering and systems level analysis of tissue/disease specific networks. The second part of the dissertation provides methods for comparing the network nodes based on the counts of 3- and 4-node multilabeled graphlets starting from these nodes. Then, we use different types of cancer pathways to retrieve the integrated interactions occurring between them from Pathway Commons dataset. These interactions are further investigated for identifying recurring 3- and 4-node graphlet patterns and Pathway Commons dataset and on different cancer pathways.

Suggestions

Comparison of tissue/disease specific integrated networks using directed graphlet signatures
Sönmez, Arzu Burçak; Can, Tolga (2017-03-22)
Background: Analysis of integrated genome-scale networks is a challenging problem due to heterogeneity of high-throughput data. There are several topological measures, such as graphlet counts, for characterization of biological networks.
A systematic approach to the integration of overlapping partitions in service-oriented data grids
Sunercan, H. Kevser; Alpdemir, M. Nedim; Çiçekli, Fehime Nihan (Elsevier BV, 2011-06-01)
This paper aims to provide a service-oriented data integration solution over data Grids for cases where distributed data sources are partitioned with overlapping sections of various proportions. This is an interesting variation which combines both replicated and partitioned data within the same data management framework. Thus, the data management infrastructure has to deal with specific challenges regarding the identification, access and aggregation of partitioned data with varying proportions of overlappin...
Investigation and comparison of the preprocessing algorithms for microarrayanalysis for robust gene expression calculation and performance analysis of technical replicates
İLK, HAKKI GÖKHAN; İlk Dağ, Özlem; KONU KARAKAYALI, ÖZLEN; ÖZDAĞ, Hilal (2006-04-19)
Preprocessing of microarray data involves the necessary steps of background correction, normalization and summarization of the raw intensity data obtained from cDNA or oligo-arrays before statistical analysis. Several algorithms, namely RMA, dChip, and MAS5 exist for the preprocessing of Affymetrix microarray data. Previous studies have identified RMA as one of most accurate algorithms while MAS5 was characterized with lower accuracy and sensitivity levels. In this study, performance of different preprocess...
Investigation of Stationarity for Graph Time Series Data Sets
Güneyi, Eylem Tuğçe; Vural, Elif (2021-01-07)
Graphs permit the analysis of the relationships in complex data sets effectively. Stationarity is a feature that facilitates the analysis and processing of random time signals. Since graphs have an irregular structure, the definition of classical stationarity does not apply to graphs. In this study, we study how stationarity is defined for graph random processes and examine the validity of the stationarity assumption with experiments on synthetic and real data sets.
Network density estimators and density-aware wireless networks
Eroğlu, Alperen; Onur, Ertan; Department of Computer Engineering (2020-11)
New network architectures and communication technologies continue to emerge to meet rapidly increasing and changing user demands requiring continuous connectivity and high data rate transmissions. These ubiquitous infrastructures result in a paradigm shift in mobile communications with the advent of mobile robots equipped with sensors, unmanned aerial vehicles, and mobile small-cells, which makes the future networks highly dynamic. This dynamism poses unpredictable variations in the network density causing ...
Citation Formats
A. B. SÖNMEZ, “CONSTRUCTION AND ANALYSIS OF TISSUE/DISEASE SPECIFIC PROTEIN-PROTEIN INTERACTION NETWORKS BY INTEGRATING LARGE SCALE TRANSCRIPTOME DATA WITH GENOME SCALE PROTEIN-PROTEIN INTERACTION NETWORKS,” Ph.D. - Doctoral Program, Middle East Technical University, 2022.