Reference: Chaudhary et., al., (2016) A Web Server and Mobile App for Computing Hemolytic Potency of Peptides. Sci Rep. 2016.


Algorithm of HemoPI

Datasets-

The datasets were derived mainly from Hemolytik database. We have developed two following datasets:

Main dataset-

This comprises of 552 experimentally validated highly hemolytic peptide sequences from Hemolytik Database as positive dataset and same number of peptide sequences randomly generated fron SwissProt as negative dataset as shown in following figure.
algo1.png

Alternative dataset-

In this dataset (alternate dataset), 552 experimentally validated highly hemolytic peptides were taken as positive examples and 462 experimentally validated non-hemolytic/poor hemolytic peptides (i.e. peptides having IC50>200uM, <10% hemolysis at 100 or >100uM concentration and MHC >500uM) were taken as negative example as shown in following figure.
algo2.png
The algorithm of HemoPI is based on the following models:

Motif based model

A number of motifs were observed in hemolytic peptides. Therefore, we used this information for differentiating between hemolytic peptides and non-hemolytic peptides. These motifs were used for distinguishing hemolytic peptides from non-hemolytic peptides.

Hybrid approach (Motif and SVM)

To increase the robustness of prediction, we used the hybrid model by integrating motif based and SVM based methods. If the user's query peptide has the any one of the motifs as those from our datasets, we assign peptide type to it based on motif that it contains. SVM model of prediction is used in case no motif is found.

References for the Physicochemical properties calculated for the peptides

HydrophobicityEISD840101
HydrophilicityHOPT810101
Steric HinderanceCHAM810101
Net HydrogenFAUJ880109
SolvationEISD860101
ChargeKLEP840101
HydropathyKYTJ820101
AmphiphilicityMITS020101
WeightFASG760101

Rules for hemolytic peptides used in HemoPI-3 dataset

Amount of HemolysisConcentration of peptide tested
≥ 5%≤10µM
≥ 10%≤ 20µM
≥ 15%≤ 50µM
≥ 20%≤ 100µM
≥ 30%≤ 200µM
≥ 50%≤ 300µM
MHC≤ 50µM
Removed peptides with hemolysis between 0-5%, which have been tested at concentration < 5µM
Borderline cases, which do not follow these rules have been removed.

Rules for low (non-hemolytic) peptides used in HemoPI-3 dataset

Amount of HemolysisConcentration of peptide tested
≤ 2%≥ 10µM
≤5%≥ 20µM
≤ 10%≥ 50µM
≤ 15%≥ 100µM
≤ 20%≥ 200 µM
≤ 30%≥ 300µM
≤ 50%≥ 500µM
MHC≥ 100µM

Help about HemoPI
This section helps the user to get the details of all the modules with explanatory text and supportive figures. User can directly go to the particular section by clicking on the names of the modules below.


Hemolytic Potency Virtual Screening
Protein Mapping Q. Matrices


Hemolytic Potency

This module helps the users to predict the hemolytic potency of peptides and design peptides with the option to increase/decrease hemolytic potency of the peptide.Further mutants are generated of the original peptide and also physiochemical properties are displayed according to the user's choice.
RESULT on clicking Run Analysis! button

1.To predict hemolytic potency type or paste peptide sequence.

2.Choose the prediction model

3.Select the physiochemical properties to be predicted.

4.Click the Run Analysis button to submit query.

5.Displays the prediction results for query peptide.

6.Displays the probablity score of the peptide and its mutants.

7.Shows all possible single position mutants along with their probability score and physicochemical properties.





Virtual Screening

This tool helps the users to submit multiple peptides as a single query in single fasta format.This module will help the user to choose best peptide from batch of peptides.Mutants of each peptide and their physiochemical properties are displayed accordingly.
RESULT on clicking Run Analysis! button
RESULT on clicking on any of the sequences

1.Type or paste peptide sequences and choose the prediction model.

2.Select the physiochemical properties to be predicted.

3.Click the Run Analysis button to submit query.

4.Displays the original peptides.

5.Displays the sortable physiochemical properties.

6.Displays the original peptide.

7.Displays the physiochemical properties.

8.Red colour displays the position of mutation.





Protein Mapping

This module helps the users to scan the hemolytic region from the protein sequence.The user can select the peptide fragment length and then all possible overlapping peptides are generated and further their mutants along with svm score are displayed.Here results are displayed in two forms a)Tabular form b)Graphical form.

RESULTS in Tabular Form

RESULTS in Graphical Form

1.Type or paste protein sequence.

2.Select peptide fragment length for which all possible overlapping peptides will be generated and select the prediction model.

3.Select the physiochemical properties to be predicted.

4.Displays the original query peptide.

5.Displays the overlapping peptides

6.Original sequence of whole protein .

7.Graphical representation of results for whole protein.

8.Click to see graphical results of overlapping peptide fragment





Quantitative Matrices

This tool represents quantitative matrix generated on the basis of relative frequency of each amino acid at every position (1-30).Quantitative matrix is displayed for each data set, with monopeptide , dipeptide amino acid composition and physiochemical properties.

1.Select the type of quantitative matrix to be displayed

2.List of amino acid

3.Position of each residue