Here are the links to download datasets used in development of AntiCP: |
AntiCP Dataset | ||
---|---|---|
Dataset | Description | Download |
Main Dataset: | This dataset contain 225 experimentally validated anticancer (positive examples) and 2250 random or potential non-anticancer peptides (negative examples). | P N |
Alternate Dataset: | This dataset contain 225 experimentally validated anticancer peptides and 1372 non-anticancer (AMPs without anticancer activities, negative examples. |
P N |
Balanced datasets:1 | It is a well known fact that classification techniques, particularly machine learning techniques performed best on balanced datasets. Thus, we generated balanced datasets for both main and alternate datasets. Our main balanced dataset contain 225 anticancer and 225 non-anticancer or random peptides (randomly obtained 2250 Swissprot peptides).Similarly, alternate balanced dataset1 contain 225 anticancer and 225 non-anticancer (randomly obtained from 1372 AMPs). | P N |
Independent:1 | It contains 50 known AntiCPs collected from the literature, which were not included in the training and 50 randomly generated Non-AntiCPs | P N |
Independent:2 | It contains 50 known AntiCPs collected from the literature, which were not included in the training and 50 non-anticancer or AMPs | P N |