HPO2GO: prediction of human phenotype ontology term associations for proteins using cross ontology annotation co-occurrences

2018-8-2
Doğan, Tunca
Analysing the relationships between biomolecules and the genetic diseases is a highly active area of research, where the aim is to identify the genes and their products that cause a particular disease due to functional changes originated from mutations. Biological ontologies are frequently employed in these studies, which provides researchers with extensive opportunities for knowledge discovery through computational data analysis. In this study, a novel approach is proposed for the identification of relationships between biomedical entities by automatically mapping phenotypic abnormality defining HPO terms with biomolecular function defining GO terms, where each association indicates the occurrence of the abnormality due to the loss of the biomolecular function expressed by the corresponding GO term. The proposed HPO2GO mappings were extracted by calculating the frequency of the co-annotations of the terms on the same genes/proteins, using already existing curated HPO and GO annotation sets. This was followed by the filtering of the unreliable mappings that could be observed due to chance, by statistical resampling of the co-occurrence similarity distributions. Furthermore, the biological relevance of the finalized mappings were discussed over selected cases, using the literature. The resulting HPO2GO mappings can be employed in different settings to predict and to analyse novel gene/protein—ontology term—disease relations. As an application of the proposed approach, HPO term—protein associations (i.e., HPO2protein) were predicted. In order to test the predictive performance of the method on a quantitative basis, and to compare it with the state-of-the-art, CAFA2 challenge HPO prediction target protein set was employed. The results of the benchmark indicated the potential of the proposed approach, as HPO2GO performance was among the best (<jats:italic>Fmax</jats:italic> = 0.35). The automated cross ontology mapping approach developed in this work may be extended to other ontologies as well, to identify unexplored relation patterns at the systemic level. The datasets, results and the source code of HPO2GO are available for download at: <jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/cansyl/HPO2GO">https://github.com/cansyl/HPO2GO

Suggestions

GOPred: GO Molecular Function Prediction by Combined Classifiers
Sarac, Oemer Sinan; Atalay, Mehmet Volkan; Atalay, Rengül (Public Library of Science (PLoS), 2010-08-31)
Functional protein annotation is an important matter for in vivo and in silico biology. Several computational methods have been proposed that make use of a wide range of features such as motifs, domains, homology, structure and physicochemical properties. There is no single method that performs best in all functional classification problems because information obtained using any of these features depends on the function to be assigned to the protein. In this study, we portray a novel approach that combines ...
Epigenetic Mechanisms Underlying the Dynamic Expression of Cancer-Testis Genes, PAGE2, -2B and SPANX-B, during Mesenchymal-to-Epithelial Transition
Yilmaz-Ozcan, Sinem; Sade, Asli; Kucukkaraduman, Baris; Kaygusuz, Yasemin; Senses, Kerem Mert; Banerjee, Sreeparna; GÜRE, ALİ OSMAY (Public Library of Science (PLoS), 2014-09-17)
Cancer-testis (CT) genes are expressed in various cancers but not in normal tissues other than in cells of the germline. Although DNA demethylation of promoter-proximal CpGs of CT genes is linked to their expression in cancer, the mechanisms leading to demethylation are unknown. To elucidate such mechanisms we chose to study the Caco-2 colorectal cancer cell line during the course of its spontaneous differentiation in vitro, as we found CT genes, in particular PAGE2, -2B and SPANX-B, to be up-regulated duri...
Autoinflammation in addition to combined immunodeficiency: SLC29A3 gene defect
Cagdas, Deniz; Surucu, Naz; TAN, ÇAĞMAN; ÖZGÜL, RIZA KÖKSAL; Akkaya-Ulum, Yeliz Z.; Aydinoglu, Ayse Tulay; Aytac, Selin; GÜMRÜK, FATMA; Balci-Hayta, Burcu; Balci-Peynircioglu, Banu; ÖZEN, SEZA; Gürsel, Mayda; Tezcan, Ilhan (Elsevier BV, 2020-05-01)
Introduction: H Syndrome is an autosomal recessive (AR) disease caused by defects in SLCA29A3 gene. This gene encodes the equilibrative nucleoside transporter, the protein which is highly expressed in spleen, lymph node and bone marrow. Autoinflammation and autoimmunity accompanies H Syndrome (HS).
Novel BRCA2 pathogenic genotype and breast cancer phenotype discordance in monozygotic triplets
Duzkale, Neslihan; EYERCİ, NİLNUR; Oksuzoglu, Berna; Teker, Taner; Kandemir, Olcay (Elsevier BV, 2020-04-01)
BRCA1/2 genes with high-penetrance are tumor suppressor and tumor susceptibility genes that play important roles in the homologous recombination mechanism in DNA repair and increase breast cancer risk. Variants in BRCA1 or BRCA2 are the main causes of familial and early-onset breast cancer. This study investigated pathogenic variant belonging to the BRCA2 gene splice region in monozygotic triplets. A 44-year-old woman was diagnosed with breast cancer when she was 32 years old. Her monozygotic sister had a h...
ImaGene: a convolutional neural network to quantify natural selection from genomic data
Torada, Luis; Lorenzon, Lucrezia; Beddis, Alice; Isildak, Ulas; Pattini, Linda; Mathieson, Sara; Fumagalli, Matteo (Springer Science and Business Media LLC, 2019-11-22)
Background: The genetic bases of many complex phenotypes are still largely unknown, mostly due to the polygenic nature of the traits and the small effect of each associated mutation. An alternative approach to classic association studies to determining such genetic bases is an evolutionary framework. As sites targeted by natural selection are likely to harbor important functionalities for the carrier, the identification of selection signatures in the genome has the potential to unveil the genetic mechanisms...
Citation Formats
T. Doğan, “HPO2GO: prediction of human phenotype ontology term associations for proteins using cross ontology annotation co-occurrences,” PeerJ, 2018, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/51628.