Here are the links to download datasets used in development of AntiCP:



AntiCP Dataset

DatasetDescriptionDownload

Main Dataset:

This dataset contain 225 experimentally validated anticancer (positive examples) and 2250 random or potential
non-anticancer peptides (negative examples).
P    N

Alternate Dataset:

This dataset contain 225 experimentally validated anticancer peptides and 1372 non-anticancer (AMPs without
anticancer activities, negative examples.
P    N

Balanced datasets:1

It is a well known fact that classification techniques, particularly machine learning techniques performed best on balanced datasets. Thus, we
generated balanced datasets for both main and alternate datasets. Our main balanced dataset contain 225 anticancer and 225 non-anticancer
or random peptides (randomly obtained 2250 Swissprot peptides).Similarly, alternate balanced dataset1 contain 225 anticancer and 225
non-anticancer (randomly obtained from 1372 AMPs).
P    N

Balanced datasets:2

It is a well known fact that classification techniques, particularly machine learning techniques performed best on balanced datasets. Thus, we
generated balanced datasets for both main and alternate datasets. Our main balanced dataset contain 225 anticancer and 225 non-anticancer
or random peptides (randomly obtained 2250 Swissprot peptides).Similarly, alternate balanced dataset2 contain 225 anticancer and 225
AMPs (randomly obtained from 1372 AMPs).
P    N

Independent:1

It contains 50 known AntiCPs collected from the literature, which were not included in the training and 50 randomly
generated Non-AntiCPs
P    N

Independent:2

It contains 50 known AntiCPs collected from the literature, which were not included in the training and 50 non-anticancer or AMPs P    N