COVID-19 aggregations¶
Aggregates of the COVID-19 gVCFs are available in s3 buckets accessible via CloudOS. These can be accessed in CloudOS under Data & Results > GEL data resources > aggregations > covid-19
.
Multi-participant datasets can be accessed in CloudOS under Data & Results > GEL data resources > aggregations > covid-19
.
Similar to the 100,000 Genomes Project, we make available useful outputs from our internal bioinformatics pipeline and analyses for the Covid-19 Project. We have:
- Aggregated variant calls
- Functional annotation of all variants in the aggregate VCFs
- Principal components and sample relatedness information
Description¶
The latest multi-participant aggregate of COVID-19 data is the aggCovidV4.2, which includes the following participants:
Cohort | Number of participants |
---|---|
Realigned cancer controls | 4,183 |
Mild COVID-19 cohort | 1,809 |
Severe COVID-19 cohort | 8,794 |
Total | 14,786 |