Automated Negative Gene Ontology Based Functional Predictions for Proteins with UniGOPred

2018-07-07
Doğan, Tunca
Rifaioğlu, Ahmet Süreyya
Saidi, Rabi
Martin, Maria Jesus
Atalay, Mehmet Volkan
Atalay, Rengül
Functional annotation of biomolecules in the gene and protein databases is mostly incomplete. This is especially valid for multi-domain proteins. There is a grey area in the protein function data resources, where the truly negative functions and the ones possessed by the protein but have not been discovered or documented yet (i.e. false negatives), reside together. In many cases the information about the functions absent from the target biomolecule can be as important as the assigned functions. It’s possible to resolve a portion of this grey area by predicting the functions that the target proteins most probably do not possess. In this study, we present an approach to produce negative functional annotations for protein sequences, along with regular positive associations. Using this approach, we have developed an automated function prediction tool "UniGOPred". The negative prediction performance (recall) was measured as 0.82 for both MF and BP, and 0.66 for CC GO terms (with prediction scores ≤ 0.3), in cross-validation. To the best of our knowledge, the ability of a protein function prediction method to predict negative functions using sequence features is investigated here for the first time.
26th Conference on Intelligent Systems for Molecular Biology (2018)

Suggestions

Inference of Gene Regulatory Networks Via Multiple Data Sources and a Recommendation Method
Ozsoy, Makbule Gulcin; Polat, Faruk; Alhajj, Reda (2015-11-12)
Gene regulatory networks (GRNs) are composed of biological components, including genes, proteins and metabolites, and their interactions. In general, computational methods are used to infer the connections among these components. However, computational methods should take into account the general features of the GRNs, which are sparseness, scale-free topology, modularity and structure of the inferred networks. In this work, observing the common aspects between recommendation systems and GRNs, we decided to ...
Comparing Clustering Techniques for Real Microarray Data
Purutçuoğlu Gazi, Vilda (2012-08-29)
The clustering of genes detected as significant or differentially expressed provides useful information to biologists about functions and functional relationship of genes. There are variant types of clustering methods that can be applied in genomic data. These are mainly divided into the two groups, namely, hierarchical and partitional methods. In this paper, as the novelty, we perform a detailed clustering analysis for the recently collected boron microarray dataset to investigate biologically more interes...
Using Adaptive Neuro-Fuzzy Inference System for Classification of Microarray Gene Expression Cancer Profiles
Haznedar, Bülent; Arslan, Mustafa Turan; Kalınlı, Adem (2018-05-01)
Microarray is a technology that enables simultaneously analysis of thousands of genes in DNA structure depending on the advances in biochemistry. With this technology, it has become possible to diagnose and treat heredity diseases by analyzing thousands of gene expression levels. This study proposes an artificial intelligence method, Adaptive neuro-fuzzy inference system (ANFIS), to classify cancer gene expression profiles. The findings obtained with the proposed ANFIS approach are compared with the results...
Integer linear programming based solutions for construction of biological networks
Eren Özsoy, Öykü; Can, Tolga; Department of Health Informatics (2014)
Inference of gene regulatory or signaling networks from perturbation experiments and gene expression assays is one of the challenging problems in bioinformatics. Recently, the inference problem has been formulated as a reference network editing problem and it has been show that finding the minimum number of edit operations on a reference network in order to comply with perturbation experiments is an NP-complete problem. In this dissertation, we propose linear programming based solutions for reconstruction o...
Optimization of nucleic acid delivery via cationic polymers for genome engineering
Öktem, Ayşegül; Erson Bensan, Ayşe Elif; Department of Molecular Biology and Genetics (2019)
One of the most challenging aspects of genome engineering is the delivery of genome editing components such as plasmids, oligonucleotides, RNA and protein. In this work, in-house synthetized cationic polymer poly (2-hydroxypropylene imine) (pHP) was tested in order to achieve substantial delivery efficiency while preserving high culture viability. Applicability of this cationic polymer mediated nucleic acid delivery method for both plant and mammalian cells were demonstrated. Several parameters of plasmid a...
Citation Formats
T. Doğan, A. S. Rifaioğlu, R. Saidi, M. J. Martin, M. V. Atalay, and R. Atalay, “Automated Negative Gene Ontology Based Functional Predictions for Proteins with UniGOPred,” presented at the 26th Conference on Intelligent Systems for Molecular Biology (2018), Chicago, Amerika Birleşik Devletleri, 2018, Accessed: 00, 2021. [Online]. Available: https://www.iscb.org/cms_addon/conferences/ismb2018/function.php.