Pattern search in pathogenic bacterial proteins for localization and secretory systems

Download
2015
Özcan, Orhan
Computational prediction of bacterial protein localization (BPL) is a very useful tool which provides clues about protein function. For pathogenic proteins in particular, detection of their subcellular location and their secretory pathways have great implications for vaccine and drug design. Cell surface and/or secreted proteins of microbes can also be used as biomarkers for sensor applications. At present, there are numerous BPL prediction algorithms and programs available, however, most of them give false positive results in order to maximize the number of positive predictions. Moreover, state of the art algorithms, specifically PSORT, successfully identify protein localization for every organism from any given sequence information but they usually fail in pathogenic sequences. Because the most of the pathogenic proteins are surface-localized, there is an imminent need for pathogen-specific secretion motif search algorithms as well. These motifs would also provide information on bacterial protein localization. In the present work, we built databases of pathogenic sequences and searched for selected 5 to 18 amino acid long motifs as a new approach, namely Pathogenic Sequence Motif Search (PSMS). The algorithm is based on a total of 52 distinct secretion-associated patterns covering 6 different secretory pathways for the prediction of surface and secreted proteins. The datasets for each of the following groups of proteins were next established for our validation studies which involved the tests for the success rate of these 52 patterns: Secreted, immunoreactive and patented vaccine, cytoplasmic and orphan-secreted with 3241, 1740, 2582 and 2533 members, respectively. A total of 3241 proteins in secreted proteins dataset represented TISSS, T2SS, T3SS, T4SS, T5SS and T6SS systems of secretion with 954, 668, 381, 770, 221 and 274 protein sequences, respectively. Cytoplasmic protein dataset, on the other hand, was used to exclude certain candidate patterns. 43 out of 52 patterns were truly secretion-related, pointing directly to a specific secretion system. Rest 9 patterns were found in secreted proteins though not related to a specific secretion system. Additionaly, LC-MS data formerly obtained in our laboratories from Bordetella pertussis surface proteome and secretome analyses were also included in the secreted protein sequence dataset. The selected patterns were demonstrated for instance in 503 out of a total of 1740 proteins in the immunoreactive protein dataset. With the help of our patterns, 75 proteins which were formerly predicted to have an intracellular localization and mistakenly ruled out as potential drug targets/vaccine candidates were successfully predicted as surface- associated/secreted ones. Besides the development of PSMS program predicting pathogenic sequences with high accuracy, the separate databases constructed in this work with respect to immunoreactivity and distinct secretory pathways are expected to constitute valuable bioinformatics resources for researchers of the field.

Suggestions

Combinatorial Tau pseudophosphorylation: markedly different regulatory effects on microtubule assembly and dynamic instability than the sum of the individual parts.
Kiriş, Erkan; Sargin, ME; Gaylord, MR; Altinok, A; Rose, K; Manjunath, BS; Jordan, MA; Wilson, L; Feinstein, SC (American Society for Biochemistry & Molecular Biology (ASBMB), 2011-04-22)
Tau is a multiply phosphorylated protein that is essential for the development and maintenance of the nervous system. Errors in Tau action are associated with Alzheimer disease and related dementias. A huge literature has led to the widely held notion that aberrant Tau hyperphosphorylation is central to these disorders. Unfortunately, our mechanistic understanding of the functional effects of combinatorial Tau phosphorylation remains minimal. Here, we generated four singly pseudophosphorylated Tau proteins ...
Endogenous signal peptides in recombinant protein production by Pichia pastoris: From in-silico analysis to fermentation
Massahi, Aslan; Çalık, Pınar (2016-11-07)
For extracellular recombinant protein production, the efficiency of five endogenous secretion signal peptides (SPs) of Pichia pastoris, SP13 (MLSTILNIFILLLFIQASLQ), SP23 (MKILSALLLLFTLAFA), SP24 (MKVSTTKFLAVFLLVRLVCA), SP26 (MWSLFISGLLIFYPLVLG), SP34 (MRPVLSLLLLLASSVLA), selected based on their D-score which quantifies the signal peptide-ness of a given sequence segment, was investigated using recombinant human growth hormone (rhGH) as the model protein. The expression was conducted under glyceraldehyde-3-p...
Structural Insights into Alternate Aggregated Prion Protein Forms
POLANO, maurizio; Bek, Alpan; BENETTİ, federico; lazzarino, marco; LEGNAME, giuseppe (Elsevier BV, 2009-11-13)
The conversion of the cellular form of the prion protein (PrPC) to an abnormal, alternatively folded isoform (PrPSc) is the central event in prion diseases or transmissible spongiform encephalopathies. Recent studies have demonstrated de novo generation of murine prions from recombinant prion protein (recPrP) after inoculation into transgenic and wild-type mice. These so-called synthetic prions lead to novel prion diseases with unique neuropathological and biochemical features. Moreover, the use of recPrP i...
Investigation of structural and functional properties of OMPG mutants by FTIR spectroscopy
Yılmaz, İrem; Bek, Alpan; Korkmaz Özkan, Filiz; Department of Physics (2017)
The subject of the current study is a bacterial protein OmpG from Escherichia coli. OmpG-16S and OmpG-16SL are the mutants of OmpG that are genetically engineered for potential biotechnological applications. In this study, structural and functional similarities and/or differences of these mutants with respect to the wild type OmpG were revealed using IR spectroscopy. Also, a new spectrometer attachment enabling in situ 1H/2H exchange was developed. For the collection of mass amount of spectral data from eac...
DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks
Rifaioğlu, Ahmet Süreyya; Martin, Maria Jesus; Atalay, Rengül; Atalay, Mehmet Volkan (2019-05-14)
Automated protein function prediction is critical for the annotation of uncharacterized protein sequences, where accurate prediction methods are still required. Recently, deep learning based methods have outperformed conventional algorithms in computer vision and natural language processing due to the prevention of overfitting and efficient training. Here, we propose DEEPred, a hierarchical stack of multi-task feed-forward deep neural networks, as a solution to Gene Ontology (GO) based protein function pred...
Citation Formats
O. Özcan, “Pattern search in pathogenic bacterial proteins for localization and secretory systems,” Ph.D. - Doctoral Program, Middle East Technical University, 2015.