Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Prediction of enzymatic properties of protein sequences based on the enzyme commission nomenclature
Download
index.pdf
Date
2017
Author
Dalkıran, Alperen
Metadata
Show full item record
Item Usage Stats
837
views
67
downloads
Cite This
The volume of expert manual annotation of biomolecules is steady due to high costs associated with it, although the number of sequenced genomes continues to grow exponentially. Computational methods have been proposed in order to predict the attributes of gene products. The prediction of Enzyme Commission (EC) numbers is a challenging issue in this area. Enzymes have crucial roles in metabolic pathways, therefore they are widely employed in biotechnological and biomedical pplications. EC numbers are numerical representations of enzymatic functions based on chemical reactions that they catalyze. Due to the cost and labor extensiveness of in vitro experiments EC classification annotation of catalytically active proteins are limited. Therefore, computational tools have been proposed to classify these proteins to annotate them with EC nomenclature. However, the performance of existing tools indicates that EC number prediction still requires improvement. Here, we present an EC number prediction tool, ECPred, to obtain predictions for large-scale protein sets. In ECPred, we employed hierarchical data preparation and evaluation steps by utilizing the functional relations among the four levels of EC annotation system. The main features that distinguish our approach from existing studies are the use of a combination of independent classifiers, and novel data preparation and evaluation methods. Totally, 858 EC classifiers are trained which consists of 6 main, 55 subfamily, 163 sub-subfamily and 634 substrate EC class classifiers. The average F-score value of 0.99 is obtained for all EC classes using the validation datasets. Enzyme or non-enzyme classification is incorporated into ECPred along with a hierarchical prediction approach. To the best of our knowledge, this is the first study that predicts the enzymatic function of proteins starting from Level 0 (enzyme/non-enzyme) going up to Level 4 (substrate class). Finally, ECPred is compared with other similar tools on independent test sets and ECPred obtained better results than existing tools, however, the results show that there is still room for improvement.
Subject Keywords
Enzymes.
,
Biomolecules.
,
Machine learning.
,
Sequences (Mathematics).
URI
http://etd.lib.metu.edu.tr/upload/12621380/index.pdf
https://hdl.handle.net/11511/26707
Collections
Graduate School of Natural and Applied Sciences, Thesis
Suggestions
OpenMETU
Core
Characterization of Metamaterials using a New Design and Measurement Technique for Microstrip Circuit Applications
Nesimoglu, Tayfun; Sabah, Cumali (2014-12-14)
Characterization of a metamaterial based on an S-Shaped resonator and a feeding transmission line using a new measurement technique is studied in this paper. First, the metamaterial is designed and simulated using conventional methods. Then, it is fabricated on microstrip and excited by using a microstrip transmission feedline. Measurements were carried out using a vector network analyzer by (SubMiniature version A) SMA coaxial terminations without needing any waveguide transmission lines. For both simulate...
Prediction of the effects of single amino acid variations on protein functionality with structural and annotation centric modeling
Cankara, Fatma; Tunçbağ, Nurcan; Department of Bioinformatics (2020)
Whole-genome and exome sequencing studies have indicated that genomic variations may cause deleterious effects on protein functionality via various mechanisms. Single nucleotide variations that alter the protein sequence, and thus, the structure and the function, namely non-synonymous SNPs (nsSNP), are associated with many genetic diseases in human. The current rate of manually annotating the reported nsSNPs cannot catch up with the rate of producing new sequencing data. To aid this process, automated compu...
Preparation and Characterization of Amino-Functionalized Zeolite/SiO2 Materials for Trypsin-Chymotrypsin Co-immobilization
Dogan, Demet; Sezer, Selda; Ulu, Ahmet; KÖYTEPE, SÜLEYMAN; ATEŞ, BURHAN (2021-08-01)
Inorganic supports have attracted increased attention in enzyme immobilization since they not only improve enzyme stability but also reduce the final cost of enzymatic reactions. Herein, we explored the suitability of the amino-functionalized zeolite/SiO2 materials to co-immobilize trypsin-chymotrypsin mixture. For this purpose, the trypsin-chymotrypsin mixture was co-immobilized on the amino-functionalized zeolite/SiO2 materials and the immobilization yield was 80.7 +/- 7.6%. The pre-support and its modifi...
Estimation of Anaerobic Debromination Rate Constants of PBDE Pathways Using an Anaerobic Dehalogenation Model
KARAKAS, Filiz; İmamoğlu, İpek (2017-04-01)
This study aims to estimate anaerobic debromination rate constants (k(m)) of PBDE pathways using previously reported laboratory soil data. k(m) values of pathways are estimated by modifying a previously developed model as Anaerobic Dehalogenation Model. Debromination activities published in the literature in terms of bromine substitutions as well as specific microorganisms and their combinations are used for identification of pathways. The range of estimated k(m) values is between 0.0003 and 0.0241 d(-1). T...
Discovering functional interaction patterns in protein-protein interaction networks
Turanalp, Mehmet E.; Can, Tolga (Springer Science and Business Media LLC, 2008-06-11)
Background: In recent years, a considerable amount of research effort has been directed to the analysis of biological networks with the availability of genome-scale networks of genes and/or proteins of an increasing number of organisms. A protein-protein interaction (PPI) network is a particular biological network which represents physical interactions between pairs of proteins of an organism. Major research on PPI networks has focused on understanding the topological organization of PPI networks, evolution...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
A. Dalkıran, “Prediction of enzymatic properties of protein sequences based on the enzyme commission nomenclature,” M.S. - Master of Science, Middle East Technical University, 2017.