Download PEP2D Dataset |
The main dataset of PEP2D consist of 3107 unique peptides. Following protocol was used to create PEP2D dataset. A total of 5778 PDB chains of peptides (both from X-ray and NMR solved structures) were extracted from PDB with sequence length between 5 and 10. After removing chains, which were having more than 10% unknown amino acid (X) composition, 3107 unique peptides were left. This dataset consists of 3107 peptides and was termed as PEP2D dataset. Download
Download PEP2DNR Dataset |
Using PEP2D dataset, we have created an 80% non-redundant dataset using CD-HIT. This dataset is named as PEP2DNR, which contains a total of 1980 peptides. Download