ClinVar Annotations

The properly rendered version of this document can be found at Read The Docs.

If you are reading this on github, you should instead click here.

Annotations from ClinVar were loaded into Google Genomics for use in sample annotation pipelines. This data reflects the state of ClinVar at a particular point in time.

Google Cloud Platform data locations

Provenance

Each of the annotation sets listed below was imported into the API from the source files. The source files are also mirrored in Google Cloud Storage.

ClinVar (downloaded 2/5/2015 10:18AM PST):

Caveats

A number of ClinVar entries were omitted during ingestion due to data incompatibility with the Google Genomics API.

  • 14737 were aligned to NCBI36, which the Google Genomics API does not currently support.
  • 5952 did not specify a reference assembly.
  • 1324 were labeled as insertions but did not specify the inserted bases.
  • 220 were labeled as SNPs, but did not specify an alternate base.
  • 148 were larger than 100MBp.

Have feedback or corrections? All improvements to these docs are welcome! You can click on the “Edit on GitHub” link at the top right corner of this page or file an issue.

Need more help? Please see https://cloud.google.com/genomics/support.