Single-Instance TasksToxAMES Mutagenicity

AMES Mutagenicity

Tox

Dataset Description

Mutagenicity means the ability of a drug to induce genetic alterations. Drugs that can cause damage to the DNA can result in cell death or other severe adverse effects. Nowadays, the most widely used assay for testing the mutagenicity of compounds is the Ames experiment which was invented by a professor named Ames. The Ames test is a short-term bacterial reverse mutation assay detecting a large number of compounds which can induce genetic damage and frameshift mutations. The dataset is aggregated from four papers

Task Description

Binary classification. Given a drug SMILES string, predict whether it is mutagenic (1) or not mutagenic (0).

Dataset Statistics

7,255 drugs.

Available Splits

Random SplitScaffold Split

Usage Example

from tdc_ml.single_pred import Tox

data = Tox(name='AMES')

# Access the data
df = data.get_data()
print(df.head())

# Get train/val/test splits
split = data.get_split()
print(split)

License

This dataset is licensed under CC BY 4.0.