COVID-19 aggregations¶
Aggregates of the COVID-19 gVCFs are available in s3 buckets accessible via CloudOS. These can be accessed in CloudOS under Data & Results > GEL data resources > aggregations > covid-19.
Multi-participant datasets can be accessed in CloudOS under Data & Results > GEL data resources > aggregations > covid-19.
Similar to the 100,000 Genomes Project, we make available useful outputs from our internal bioinformatics pipeline and analyses for the Covid-19 Project. We have:
- Aggregated variant calls
- Functional annotation of all variants in the aggregate VCFs
- Principal components and sample relatedness information
Description¶
The latest multi-participant aggregate of COVID-19 data is the aggCovidV4.2, which includes the following participants:
| Cohort | Number of participants |
|---|---|
| Realigned cancer controls | 4,183 |
| Mild COVID-19 cohort | 1,809 |
| Severe COVID-19 cohort | 8,794 |
| Total | 14,786 |