creators_name: Rashid, Mamoon creators_name: Ramasamy, Sumathy creators_name: Raghava, G.P.S. type: article datestamp: 2011-12-08 18:57:26 lastmod: 2011-12-08 18:57:26 metadata_visibility: show title: A simple approach for predicting protein-protein interactions. ispublished: pub subjects: QH301 full_text_status: none keywords: Protein interaction, protein sequence, support vector machine, interactome, protein interaction prediction, Position Specific Scoring Matrix, note: Copyright of this article belongs to Bentham Science abstract: The availability of an increased number of fully sequenced genomes demands functional interpretation of the genomic information. Despite high throughput experimental techniques and in silico methods of predicting protein-protein interaction (PPI); the interactome of most organisms is far from completion. Thus, predicting the interactome of an organism is one of the major challenges in the post-genomic era. This manuscript describes Support Vector Machine (SVM) based models that have been developed for discriminating interacting and non-interacting pairs of proteins from their amino acid sequence. We have developed SVM models using various types of sequence compositions e.g. amino acid, dipeptide, biochemical property, split amino acid and pseudo amino acid composition. We also developed SVM models using evolutionary information in the form of Position Specific Scoring Matrix (PSSM) composition. We achieved maximum Matthews's correlation coefficient (MCC) of 1.00, 0.52 and 0.74 for Escherichia coli, Saccharomyces cerevisiae, and Helicobacter pylori, using dipeptide based SVM model at default threshold. It was observed that the performance of a prediction model depends on the dataset used for training and testing. In case of E. coli MCC decreased from 1.0 to 0.67 when evaluated on a new dataset. In order to understand PPI in different cellular environment, we developed species-specific and general models. It was observed that species-specific models are more accurate than general models. We conclude that the primary amino acid sequence based descriptors could be used to differentiate interacting from non-interacting protein pairs. Some amino acids tend to be favored in interacting pairs than non-interacting ones. Finally, a web server has been developed for predicting protein-protein interactions. date: 2010-11 date_type: published publication: Current protein & peptide science volume: 11 number: 7 publisher: Bentham Science pagerange: 589-600 refereed: TRUE issn: 1875-5550 official_url: http://www.benthamdirect.org/pages/content.php?CPPS/2010/00000011/00000007/0009K.SGM related_url_url: http://www.benthamdirect.org/pages/content.php?CPPS/2010/00000011/00000007/0009K.SGM related_url_type: pub citation: Rashid, Mamoon and Ramasamy, Sumathy and Raghava, G.P.S. (2010) A simple approach for predicting protein-protein interactions. Current protein & peptide science, 11 (7). pp. 589-600. ISSN 1875-5550