Skip to content

COVID-19 aggregations

Aggregates of the COVID-19 gVCFs are available in s3 buckets accessible via CloudOS. These can be accessed in CloudOS under Data & Results > GEL data resources > aggregations > covid-19.

Multi-participant datasets can be accessed in CloudOS under Data & Results > GEL data resources > aggregations > covid-19.

Similar to the 100,000 Genomes Project, we make available useful outputs from our internal bioinformatics pipeline and analyses for the Covid-19 Project. We have:

  • Aggregated variant calls
  • Functional annotation of all variants in the aggregate VCFs
  • Principal components and sample relatedness information

Description

The latest multi-participant aggregate of COVID-19 data is the aggCovidV4.2, which includes the following participants:

Cohort Number of participants
Realigned cancer controls 4,183
Mild COVID-19 cohort 1,809
Severe COVID-19 cohort 8,794
Total 14,786