CROssBAR: Comprehensive Resource of Biomedical Relations with Network Representations and Deep Learning

2019-07-21
Joshi, Vishal
Doğan, Tunca
Atalay, Rengül
Martin, Maria-Jesus
Saidi, Rabie
Zellner, Hermann
Volynkin, Vladimir
Sinoplu, Esra
Atas, Heval
Nightingale, Andrew
Rifaioğlu, Ahmet Süreyya
Atalay, Mehmet Volkan
Biomedical information is scattered across different biological data resources, which are biologically related but only loosely linked to each other in terms of data connections. This hinders the applications of integrative systems biology applications on data. We aim to develop a comprehensive resource, CROssBAR, to address these shortcomings by establishing relationships between relevant biological data sources to present a well-connected database, focusing on the fields of drug discovery and precision medicine. CROssBAR will contain 3 modules: (1) novel computational methods using graph theory and deep learning algorithms, to reveal unknown drug-target interactions and gene/protein-disease associations; (2) multi-partite biological networks where nodes will represent compounds/drugs, genes/proteins, pathways/systems and diseases, the edges will represent known and predicted pairwise relations in-between; and (3) an open access database and web-service to provide access to the resultant networks with its components. We have developed data pipelines for the heavy lifting of data from different data sources like UniProt, ChEMBL, PubChem, Drugbank and EFO persisting only specific data attributes for biomedical entity networks. The database is hosted in self-sufficient collections in MongoDB. The CROssBAR resource should help researchers in the interpretation of biomedical data by observing biological entities together with their relations.

Suggestions

Network structure based pathway enrichment system to analyze pathway activities
Işık, Zerrin; Atalay, Mehmet Volkan; Atalay, Rengül; Department of Computer Engineering (2011)
Current approaches integrating large scale data and information from a variety of sources to reveal molecular basis of cellular events do not adequately benefit from pathway information. Here, we portray a network structure based pathway enrichment system that fuses and exploits model and data: signalling pathways are taken as the biological models while microarray and ChIP-seq data are the sample input data sources among many other alternatives. Our model- and data-driven hybrid system allows to quantitati...
Modeling diseases with multiple disease characteristics: comparison of models and estimation methods
Erdem, Münire Tuğba; Kalaylıoğlu Akyıldız, Zeynep Işıl; Department of Statistics (2011)
Epidemiological data with disease characteristic information can be modelled in several ways. One way is taking each disease characteristic as a response and constructing binary or polytomous logistic regression model. Second way is using a new response which consists of disease subtypes created by cross-classification of disease characteristic levels, and then constructing polytomous logistic regression model. The former may be disadvantageous since any possible covariation between disease characteristics ...
Short Time Series Microarray Data Analysis and Biological Annotation
Sökmen, Zerrin; Atalay, Mehmet Volkan; Atalay, Rengül (2008-01-01)
Significant gene list is the result of microarray data analysis should be explained for the purpose of biological functions. The aim of this study is to extract the biologically related gene clusters over the short time series microarray gene data by applying unsupervised methods and automatically perform biological annotation of those clusters. In the first step of the study, short time series microarray expression data is clustered according to similar expression profiles. After that, several biological d...
Inference of Gene Regulatory Networks Via Multiple Data Sources and a Recommendation Method
Ozsoy, Makbule Gulcin; Polat, Faruk; Alhajj, Reda (2015-11-12)
Gene regulatory networks (GRNs) are composed of biological components, including genes, proteins and metabolites, and their interactions. In general, computational methods are used to infer the connections among these components. However, computational methods should take into account the general features of the GRNs, which are sparseness, scale-free topology, modularity and structure of the inferred networks. In this work, observing the common aspects between recommendation systems and GRNs, we decided to ...
HYPERSPECTRAL CLASSIFICATION USING STACKED AUTOENCODERS WITH DEEP LEARNING
Özdemir, Ataman; Cetin, C. Yasemin Yardimci (2014-06-27)
In this study, stacked autoencoders which are widely utilized in deep learning research are applied to remote sensing domain for hyperspectral classification. High dimensional hyperspectral data is an excellent candidate for deep learning methods. However, there are no works in literature that focuses on such deep learning approaches for hyperspectral imagery. This study aims to fill this gap by utilizing stacked autoencoders. Experiments are conducted on the Pavia University scene. Using stacked autoencode...
Citation Formats
V. Joshi et al., “CROssBAR: Comprehensive Resource of Biomedical Relations with Network Representations and Deep Learning,” 2019, Accessed: 00, 2021. [Online]. Available: https://www.iscb.org/cms_addon/conferences/ismbeccb2019/bioontologies.php.