Hello,
thanks for this nice repository. I was going through the dataset and I was wondering
- where the curation pipeline of the raw ChEMBL data is implemented
- and what the activity_id values are for the measurements (or from which sets of measurements they might be aggregated).
The raw data only seems to contain compound ChEMBL identifiers, but not the activity or assay identifiers. I didn't find any information on this in the paper (Study Setup section). Probably I am looking in the wrong place. Can you help me with this?
Best,
Michael