ClinVar Annotations¶
The properly rendered version of this document can be found at Read The Docs. If you are reading this on github, you should instead click here. |
Annotations from ClinVar were loaded into Google Genomics for use in sample annotation pipelines. This data reflects the state of ClinVar at a particular point in time.
Google Cloud Platform data locations¶
- Google Cloud Storage folder gs://genomics-public-data/clinvar/
- Google Genomics annotation sets
Provenance¶
Each of the annotation sets listed below was imported into the API from the source files. The source files are also mirrored in Google Cloud Storage.
ClinVar (downloaded 2/5/2015 10:18AM PST):
Caveats¶
A number of ClinVar entries were omitted during ingestion due to data incompatibility with the Google Genomics API.
- 14737 were aligned to NCBI36, which the Google Genomics API does not currently support.
- 5952 did not specify a reference assembly.
- 1324 were labeled as insertions but did not specify the inserted bases.
- 220 were labeled as SNPs, but did not specify an alternate base.
- 148 were larger than 100MBp.