UCSC Annotations¶
The properly rendered version of this document can be found at Read The Docs. If you are reading this on github, you should instead click here. |
UCSC Sequence and Annotation Data were loaded into Google Genomics for use in sample annotation pipelines. This data reflects the state of UCSC Sequence and Annotation Data at a particular point in time.
Google Cloud Platform data locations¶
- Google Cloud Storage folder gs://genomics-public-data/ucsc/
- Google Genomics annotation sets
Provenance¶
Each of the annotation sets listed below was imported into the API from the source files. The source files are also mirrored in Google Cloud Storage.
UCSC GRCh38 (downloaded 12/29/2014 14:00 PST):
- http://hgdownload.cse.ucsc.edu/goldenPath/hg38/database/refFlat.txt.gz
- http://hgdownload.cse.ucsc.edu/goldenPath/hg38/database/refGene.txt.gz
- http://hgdownload.cse.ucsc.edu/goldenPath/hg38/database/knownGene.txt.gz
UCSC hg19 (downloaded 3/5/2015 17:00 PST):
- http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/refFlat.txt.gz
- http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/refGene.txt.gz
- http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/knownGene.txt.gz