Skip to content

Application data versions

Many of the applications in the RE do not access the data directly, but instead have their own local data stores. This means that there is sometimes a delay in when the data accessed by the applications is updated and there may be differences in the data you can see from different sources.

This table shows the current data version that an RE application or data product is using and when this was last updated:

Application / data product 100kGP data version NHS-GMS data version COVID-19 data version1
LabKey v19 Oct 2024 v4
The genomes folder v19 Oct 2024 v4
Participant Explorer v19 Jan 2025 tbd
Aggregated variant calls (100kGP)/COVID-19 v102 Sep 2020 -
Somatic aggregated variant calls v122 Sep 2021 -
IVA/OpenCGA v16 Feb 2023 -
CloudOS S3 v18 Apr 2024 -
CloudOS Cohort browser v174 -
CloudOS OMOP v16 -

  1. Only available in CloudOS 

  2. Participants from later releases will not be part of these aggregates, however we do provide lists of consented individuals based on current releases in order for researchers to work with this data. 

  3. COVID-19 data in OpenCGA can only be accessed via the CloudOS Cohort browser. ISARIC, PHOSP and VITT cohorts are not included in OpenCGA. 89.6% of samples in CloudOS are in OpenCGA and at least 79.8% of samples in OpenCGA are in CloudOS with concordant vcf files. 

  4. Structured/Clinical data version. For variant data versions in CloudOS Cohort browser, see IVA/OpenCGA