Training Dataset
Dataset contains experimentally validated proteins as positive proteins (15525 Transcription Factors sequences) and negative proteins (418848 Non Transcription Factor) sequences.
Independent Dataset
Dataset contains experimentally validated proteins as positive proteins (3882 Transcription Factors sequences) and negative proteins (104712 Non Transcription Factor) sequences.