Prediction of protein subcellular localization based on primary sequence data

2003-01-01
This paper describes a system called prediction of protein subcellular localization (P2SL) that predicts the subcellular localization of proteins in eukaryotic organisms based on the amino acid content of primary sequences using amino acid order. Our approach for prediction is to find the most frequent motifs for each protein (class) based on clustering and then to use these most frequent motifs as features for classification. This approach allows a classification independent of the length of the sequence. Another important property of the approach is to provide a means to perform reverse analysis and analysis to extract rules. In addition to these and more importantly, we describe the use of a new encoding scheme for the amino acids that conserves biological function based on point of accepted mutations (PAM) substitution matrix. We present preliminary results of our system on a two class (dichotomy) classifier. However, it can be extended to multiple classes with some modifications.
COMPUTER AND INFORMATION SCIENCES - ISCIS 2003

Suggestions

Prediction of protein subcellular localization based on primary sequence data
Özarar, Mert; Atalay, Mehmet Volkan; Department of Computer Engineering (2003)
Subcellular localization is crucial for determining the functions of proteins. A system called prediction of protein subcellular localization (P2SL) that predicts the subcellular localization of proteins in eukaryotic organisms based on the amino acid content of primary sequences using amino acid order is designed. The approach for prediction is to nd the most frequent motifs for each protein in a given class based on clustering via self organizing maps and then to use these most frequent motifs as features...
Predicting Protein-Protein Interactions from the Molecular to the Proteome Level
Keskin, Ozlem; Tunçbağ, Nurcan; Gursoy, Attila (2016-04-27)
Identification of protein protein interactions (PPIs) is at the center of molecular biology considering the unquestionable role of proteins in cells. Combinatorial interactions result in a repertoire of multiple functions; hence, knowledge of PPI and binding regions naturally serve to functional proteomics and drug discovery. Given experimental limitations to find all interactions in a proteome, computational prediction/modeling of protein interactions is a prerequisite to proceed on the way to complete int...
Prediction of protein subcellular localization based on primary sequence data
Ozarar, M; Atalay, Mehmet Volkan; Atalay, Rengül (2004-04-30)
Subcellular localization is crucial for determining the functions of proteins. A system called prediction of protein subcellular localization (P2SL) that predicts the subcellular localization of proteins in eukaryotic organisms based on the amino acid content of primary sequences using amino acid order is designed. The approach for prediction is to find the most frequent motifs for each protein in a given class based on clustering via self organizing maps and then to use these most frequent motifs as featur...
Analysis of motifs in microRNA-transcription factor gene regulatory networks
Sürün, Bilge; Acar, Aybar Can; Purutçuoğlu Gazi, Vilda; Department of Bioinformatics (2014)
MicroRNAs are small non-coding RNA molecules which contain 21-25 nucleotides, and function in post transcriptional regulation by inhibiting the translation of mRNA targets. miRNAs typically affect gene regulation by forming composite feed forward circuits (cFFCs) which also comprise a transcription factor (TF) and a target gene. By analyzing these cFFCs, the contribution of miRNAs in altering TF networks can be revealed. These contributions could either be the de-escalation of the target gene repertoire or ...
Prediction of the effects of single amino acid variations on protein functionality with structural and annotation centric modeling
Cankara, Fatma; Tunçbağ, Nurcan; Department of Bioinformatics (2020)
Whole-genome and exome sequencing studies have indicated that genomic variations may cause deleterious effects on protein functionality via various mechanisms. Single nucleotide variations that alter the protein sequence, and thus, the structure and the function, namely non-synonymous SNPs (nsSNP), are associated with many genetic diseases in human. The current rate of manually annotating the reported nsSNPs cannot catch up with the rate of producing new sequencing data. To aid this process, automated compu...
Citation Formats
M. Ozarar, M. V. Atalay, and R. Atalay, “Prediction of protein subcellular localization based on primary sequence data,” COMPUTER AND INFORMATION SCIENCES - ISCIS 2003, pp. 611–618, 2003, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/52956.