Supplementary Table 1.  List of non-redundant sequences in the PDNA-62 dataset (Ahmad et al., 2004.

 Bioinformatics, 20:477-486) for prediction of DNA-binding residues.

PDB ID chain

Structure resolution (Å)

Sequence annotation

1A02_F

2.70

Human proto-oncogene protein c-fos

1A02_J

2.70

Human transcription factor AP-1 (c-jun)

1A02_N

2.70

Human T cell transcription factor NFAT1

1A74_A

2.20

Intron-encoded endonuclease I-PpoI from Physarum polycephalum

1AAY_A

1.60

Mouse transcription factor Zif268 (zinc finger protein)

1AZQ_A

1.94

DNA-binding protein 7d from Sulfolobus acidocaldarius

1B3T_A

2.20

Human herpesvirus 4 nuclear antigen EBNA1 (DNA-binding domain)

1BF5_A

2.90

Human Stat-1 (signal transducer and activator of transcription)

1BHM_A

2.20

Endonuclease BamHI from Bacillus amyloliquefaciens

1BL0_A

2.30

E. coli transcriptional activator, multiple antibiotic resistance protein

1C0W_B

3.20

Diphtheria toxin repressor from Corynebacterium diphtheriae

1CDW_A

1.90

Human TATA-box binding protein (TBP)

1CF7_A

2.60

Transcription factor E2F-4

1CJG_A

NMR

E. coli lactose operon repressor

1CMA_A

2.80

E. coli Met repressor (metJ)

1D02_A

1.70

Type II restriction enzyme Muni from Mycoplasma

1D66_A

2.70

Yeast GAL4 transcriptional activator

1DP7_P

1.50

Human MHC class II regulatory factor RFX1

1ECR_A

2.70

E. coli replication terminator protein

1FJL_A

2.00

Drosophila homeodomain protein paired

1GAT_A

NMR

Erythroid transcription factor GATA-1 from Gallus gallus

1GCC_A

NMR

Ethylene-responsive transcription factor 1A (ERF1A) from Arabidopsis thaliana

1GDT_A

3.00

E. coli recombinase, gamma delta resolvase

1HCQ_A

2.40

Human estrogen receptor DNA-binding domain

1HCR_A

1.80

DNA invertase hin from Salmonella typhimurium

1HDD_C

2.80

Drosophila Segmentation polarity homeobox protein engrailed

1HLO_A

2.80

Human transcription factor Max (Myc-associated factor X)

1HRY_A

NMR

Human sex-determining region Y protein (SRY)

1HWT_D

2.50

Yeast activatory protein CYP1 (HAP1)

1IF1_A

3.00

Interferon regulatory factor 1 from Mus musculus

1IGN_A

2.25

Yeast DNA-binding protein RAP1

1IHF_A

2.50

E. coli integration host factor (DNA-binding, bacterial histone-like)

1IHF_B

2.50

E. coli integration host factor (DNA-binding, bacterial histone-like)

1J59_A

2.50

E. coli catabolite gene activator protein (CAP)

1LMB_4

1.80

Bacteriophage lambda repressor protein CI

1MDY_A

2.80

Mouse MyoD bHLH domain

1MEY_F

2.20

Designed consensus zinc finger

1MHD_A

2.80

Human Smad3 transcriptional activator

1MNM_A

2.25

Yeast Mcm1 transcriptional regulator

1MNM_C

2.25

Yeast Mat alpha-2 transcriptional repressor

1MSE_C

NMR

Mouse Myb proto-oncogene protein

1OCT_C

3.00

Human Oct-1 (POU domain)

1PAR_B

2.60

Bacteriophage P22 transcriptional repressor arc

1PDN_C

2.50

Prd paired domain from Drosophila melanogaster

1PER_L

2.50

Bacteriophage 434 repressor

1PNR_A

2.70

E. coli HTH-type transcription repressor purR (purine repressor)

1PUE_E

2.10

Transcription factor Pu.1 (Ets domain) from Mus musculus

1PVI_B

2.80

Type II restriction enzyme PvuII from Proteus vulgaris

1PYI_A

3.20

Yeast pyrimidine pathway regulator 1 (PPR1)

1REP_C

2.60

E. coli replication initiation protein

1SRS_A

3.20

Human serum response factor (SRF)

1SVC_P

2.60

Human nuclear factor NF-kappa-B p105 subunit (NFKB1)

1TC3_C

2.45

Transposable element Tc3 transposase from Caenorhabditis elegans

1TF3_A

NMR

Transcription factor IIIA from Xenopus laevis

1TRO_A

1.90

E. coli Trp operon repressor

1TSR_A

2.20

Human p53 tumor suppressor

1UBD_C

2.50

Human Yy1 protein zinc finger domain

1XBR_A

2.50

Brachyury transcription factor (T domain) from Xenopus laevis

1YRN_A

2.50

Yeast mating-type protein A1 (MATA1)

1YRN_B

2.50

Yeast mating-type protein ALPHA2 (MATALPHA2)

1YSA_C

2.90

Yeast transcription factor GCN4

1YUI_A

NMR

Transcription factor GAGA from Drosophila melanogaster

2BOP_A

1.70

Regulatory protein E2 from bovine papillomavirus type 1

2DRP_D

2.80

Drosophila Tramtrack protein beta isoform (Fushi tarazu repressor protein)

2GLI_A

2.60

Human zinc finger protein Gli1

2HDC_A

NMR

Rat forkhead box protein D3

3CRO_L

2.50

Bacteriophage 434 Cro protein