Multi-Instance TasksPeptideMHCMHC Class I, IEDB-IMGT, Nielsen et al.

MHC Class I, IEDB-IMGT, Nielsen et al.

PeptideMHC

Dataset Description

Binding of peptides to MHC class I molecules (MHC-I) is essential for antigen presentation to cytotoxic T-cells. An organized datasets by NetMHCpan for MHC class I collected from IEDB and IMGT/HLA database.

Task Description

Regression. Given the amino acid sequence of peptide and the pseudo amino acid sequence of MHC, predict the binding affinity.

Dataset Statistics

185,985 pairs, 43,018 peptides and 150 MHC class 1s

Available Splits

Random Split

Usage Example

from tdc_ml.multi_pred import PeptideMHC

data = PeptideMHC(name='MHC1_IEDB-IMGT_Nielsen')

# Access the data
df = data.get_data()
print(df.head())

# Get train/val/test splits
split = data.get_split()
print(split)

License

This dataset is licensed under CC BY 4.0.