Single-Instance TasksEpitopeIEDB, Jespersen et al.

IEDB, Jespersen et al.

Epitope

Dataset Description

Epitope prediction is to predict the active region in the antigen. This dataset is from Bepipred, which curates a dataset from IEDB. It collects B-cell epitopes and non-epitope amino acids determined from crystal structures.

Task Description

Token-level classification. Given an amino acid sequence, predict amino acid token that is active in binding, i.e. X is amino acid sequence, Y is a list of indices for the active positions in X.

Dataset Statistics

3,159 antigens.

Available Splits

Random Split

Usage Example

from tdc_ml.single_pred import Epitope

data = Epitope(name='IEDB_Jespersen')

# Access the data
df = data.get_data()
print(df.head())

# Get train/val/test splits
split = data.get_split()
print(split)

License

This dataset is licensed under CC BY 4.0.