Datasets used |
Our data can be divided as:
|
Amino acid preference analysis |
Amino acid composition plot of different therapeutic peptides with toxin peptides
|
Prediction Approach |
In the present study, SVM classifier was used from freely available SVM_light package. This package is powerful as well as user-friendly where we can adjust the parameters and kernel functions like Linear, Polynomial, RBF and Sigmoid. |
Input features for SVM |
In this study we have been used various features as SVM input for the prediction of toxic peptides. |
1. Amino Acid Composition: Amino Acid Composition is the fraction of each amino acid present in a peptide. There are 20 vectors generated in which one corresponds to one amino acid and these vectors used for as SVM input.
|
Hybrid Method |
We observed that there are number of motifs present in the toxic peptides. So, we have used this motif information for the prediction of toxic peptides. Motifs in toxic peptides were searched by the MEME software and then query sequences were hit with the toxic peptide motif list by MAST software. If hit was found against a peptide, its SVM score is increased by 5. So, it will be predicted as toxic peptide irrespective of SVM threshold. This approach increases the reliability of our prediction method. |
Quantitative Matrix |
Quantitative matrix was generated for each residue on the basis of contribution of eve\ry residue on each position.The quantitative matrix was generated on the basis of probability or frequency of amino acid at \particular position.The performance was evaluated by using 5 fold cross validation technique. |
Evaluation or Performance |
Five-fold cross validation technique has been used. Four sets are used for training and remaining one in used for testing, i\n this way the process repeats five times. Evaluation of performance of different SVM modules has been done by calculating a\ccuracy and Matthew's correlation coefficient (MCC). |