HIV

HTS

Dataset Description

The HIV dataset was introduced by the Drug Therapeutics Program (DTP) AIDS Antiviral Screen, which tested the ability to inhibit HIV replication for over 40,000 compounds. From MoleculeNet.

Task Description

Binary classification. Given a drug SMILES string, predict its activity against HIV virus.

Dataset Statistics

41,127 drugs.

Available Splits

Random SplitScaffold Split

Usage Example

from tdc_ml.single_pred import HTS

data = HTS(name='HIV')

# Access the data
df = data.get_data()
print(df.head())

# Get train/val/test splits
split = data.get_split()
print(split)

License

This dataset is licensed under CC BY 4.0.