Show/Hide Menu
Hide/Show Apps
anonymousUser
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Açık Bilim Politikası
Açık Bilim Politikası
Frequently Asked Questions
Frequently Asked Questions
Browse
Browse
By Issue Date
By Issue Date
Authors
Authors
Titles
Titles
Subjects
Subjects
Communities & Collections
Communities & Collections
The Use of Informed Priors in Biclustering of Gene Expression with the Hierarchical Dirichlet Process.
Date
2019-02-26
Author
Tercan, Bahar
Acar, Aybar Can
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
6
views
0
downloads
We motivate and describe the application of Hierarchical Dirichlet Process (HDP) models to the "soft" biclustering of gene expression data, in which we obtain modules (biclusters) where the affiliation of genes and samples with the modules are weighted, instead of being hard memberships. As a distinct contribution, we propose a method which HDP is informed with prior beliefs, significantly increasing the quality of the biclustering in terms of both the correctness of the number of modules inferred, and the precision of these modules, especially when evidence is sparse. We outline two such informed priors; one based on co-expression relationships inherent in the data, the other based on an externally provided regulatory network. We validate these results and compare the performance of our approach to Weighted Gene Correlation Network Analysis (WGCNA), another model that features weighted modules. We have, to this end, performed experiments on semi-synthetic data. The results show that HDP, with the addition of a well-informed prior, is able to capture the correct number of modules with increased accuracy. Furthermore, the model becomes robust to changes in the strength of the prior. We conclude by discussing these results and the benefits provided by our approach for gene expression analysis and network validation.
Subject Keywords
Gene expression
,
Analytical models
,
Data models
,
Cancer
,
Biological system modeling
,
Bioinformatics
,
Probabilistic logic
,
Transcriptomics
,
Gene expression analysis
,
Mixed-membership models
,
Bayesian non-parametrics
,
Hierarchical Dirichlet process (HDP)
,
Systems biology
,
Bioinformatics
,
Probabilistic algorithms
URI
https://hdl.handle.net/11511/31264
Journal
IEEE/ACM transactions on computational biology and bioinformatics
DOI
https://doi.org/10.1109/tcbb.2019.2901676
Collections
Graduate School of Informatics, Article