Prediction of enzymatic properties of protein sequences based on the enzyme commission nomenclature

Download

index.pdf

Date

2017

Author

Dalkıran, Alperen

Metadata

Show full item record

Item Usage Stats

852
views

74
downloads

The volume of expert manual annotation of biomolecules is steady due to high costs associated with it, although the number of sequenced genomes continues to grow exponentially. Computational methods have been proposed in order to predict the attributes of gene products. The prediction of Enzyme Commission (EC) numbers is a challenging issue in this area. Enzymes have crucial roles in metabolic pathways, therefore they are widely employed in biotechnological and biomedical pplications. EC numbers are numerical representations of enzymatic functions based on chemical reactions that they catalyze. Due to the cost and labor extensiveness of in vitro experiments EC classification annotation of catalytically active proteins are limited. Therefore, computational tools have been proposed to classify these proteins to annotate them with EC nomenclature. However, the performance of existing tools indicates that EC number prediction still requires improvement. Here, we present an EC number prediction tool, ECPred, to obtain predictions for large-scale protein sets. In ECPred, we employed hierarchical data preparation and evaluation steps by utilizing the functional relations among the four levels of EC annotation system. The main features that distinguish our approach from existing studies are the use of a combination of independent classifiers, and novel data preparation and evaluation methods. Totally, 858 EC classifiers are trained which consists of 6 main, 55 subfamily, 163 sub-subfamily and 634 substrate EC class classifiers. The average F-score value of 0.99 is obtained for all EC classes using the validation datasets. Enzyme or non-enzyme classification is incorporated into ECPred along with a hierarchical prediction approach. To the best of our knowledge, this is the first study that predicts the enzymatic function of proteins starting from Level 0 (enzyme/non-enzyme) going up to Level 4 (substrate class). Finally, ECPred is compared with other similar tools on independent test sets and ECPred obtained better results than existing tools, however, the results show that there is still room for improvement.

Subject Keywords

Enzymes., Biomolecules., Machine learning., Sequences (Mathematics).

URI

http://etd.lib.metu.edu.tr/upload/12621380/index.pdf
https://hdl.handle.net/11511/26707

Collections

Graduate School of Natural and Applied Sciences, Thesis

Suggestions

OpenMETU
Core

Characterization of Metamaterials using a New Design and Measurement Technique for Microstrip Circuit Applications Nesimoglu, Tayfun; Sabah, Cumali (2014-12-14) Characterization of a metamaterial based on an S-Shaped resonator and a feeding transmission line using a new measurement technique is studied in this paper. First, the metamaterial is designed and simulated using conventional methods. Then, it is fabricated on microstrip and excited by using a microstrip transmission feedline. Measurements were carried out using a vector network analyzer by (SubMiniature version A) SMA coaxial terminations without needing any waveguide transmission lines. For both simulate...
Prediction of the effects of single amino acid variations on protein functionality with structural and annotation centric modeling Cankara, Fatma; Tunçbağ, Nurcan; Department of Bioinformatics (2020) Whole-genome and exome sequencing studies have indicated that genomic variations may cause deleterious effects on protein functionality via various mechanisms. Single nucleotide variations that alter the protein sequence, and thus, the structure and the function, namely non-synonymous SNPs (nsSNP), are associated with many genetic diseases in human. The current rate of manually annotating the reported nsSNPs cannot catch up with the rate of producing new sequencing data. To aid this process, automated compu...
Preparation and Characterization of Amino-Functionalized Zeolite/SiO2 Materials for Trypsin-Chymotrypsin Co-immobilization Dogan, Demet; Sezer, Selda; Ulu, Ahmet; KÖYTEPE, SÜLEYMAN; ATEŞ, BURHAN (2021-08-01) Inorganic supports have attracted increased attention in enzyme immobilization since they not only improve enzyme stability but also reduce the final cost of enzymatic reactions. Herein, we explored the suitability of the amino-functionalized zeolite/SiO2 materials to co-immobilize trypsin-chymotrypsin mixture. For this purpose, the trypsin-chymotrypsin mixture was co-immobilized on the amino-functionalized zeolite/SiO2 materials and the immobilization yield was 80.7 +/- 7.6%. The pre-support and its modifi...
Estimation of Anaerobic Debromination Rate Constants of PBDE Pathways Using an Anaerobic Dehalogenation Model KARAKAS, Filiz; İmamoğlu, İpek (2017-04-01) This study aims to estimate anaerobic debromination rate constants (k(m)) of PBDE pathways using previously reported laboratory soil data. k(m) values of pathways are estimated by modifying a previously developed model as Anaerobic Dehalogenation Model. Debromination activities published in the literature in terms of bromine substitutions as well as specific microorganisms and their combinations are used for identification of pathways. The range of estimated k(m) values is between 0.0003 and 0.0241 d(-1). T...
Discovering functional interaction patterns in protein-protein interaction networks Turanalp, Mehmet E.; Can, Tolga (Springer Science and Business Media LLC, 2008-06-11) Background: In recent years, a considerable amount of research effort has been directed to the analysis of biological networks with the availability of genome-scale networks of genes and/or proteins of an increasing number of organisms. A protein-protein interaction (PPI) network is a particular biological network which represents physical interactions between pairs of proteins of an organism. Major research on PPI networks has focused on understanding the topological organization of PPI networks, evolution...

Citation Formats

A. Dalkıran, “Prediction of enzymatic properties of protein sequences based on the enzyme commission nomenclature,” M.S. - Master of Science, Middle East Technical University, 2017.