Skip to content

Uncurated SACT for 100K participants 2026 release

Treatment timelines offer clinical signal regarding the response or resistance to drugs during the cancer journey. The systemic anti-cancer therapy (SACT) tables contain detailed logs of drug administration for a given patient.

Both SACT curated and uncurated tables have been released in the past in the research environment.

This release brings an extra year of SACT data, giving visibility of treatments into 2024 for participants. This means we have very rich treatment follow-up for the 100kGP cohort, including significantly larger cohorts of participants on newer drugs such as PARP inhibitors.

This data can be found at s3://907999473992-pathimages-consent/20260413_sact_uncurated_refresh_filtered.tsv.

You can access the data from the RE at /mnt/pathology-images/20260413_sact_uncurated_refresh_filtered.tsv

In CloudOS you will need to mount s3://907999473992-pathimages-consent/

Data context and applications

This data should be used to identify courses of treatment in combination with the other (curated) SACT table. The less processed nature of Uncurated SACT means that there is significantly newer data, particularly for newer drugs which may be of interest. It is best to collect evidence relevant chemotherapy prescriptions/drugs and time periods from both datasets and put them together for analysis or cohort definition of participants.

Note the data is submitted by individual hospital trusts, so there may be some variation in formatting - in particular regarding how complete submitted cycles are, or how comprehensive the data is on supporting drugs such as anti-emetics.

For users using the previous SACT uncurated release, all code and logic should remain functional on the new release with no changes. If there are issues please raise a Service Desk ticket.

Data dictionary

Please review the existing data dictionary for SACT uncurated. All the columns, data types and descriptions are identical.