1,000 Genomes

The properly rendered version of this document can be found at Read The Docs.

If you are reading this on github, you should instead click here.

This dataset comprises roughly 2,500 genomes from 25 populations around the world. See the 1,000 Genomes project website and publications for full details:

Pilot publication

The 1000 Genomes Project Consortium
Published: November 1, 2012
DOI: 10.1038/nature11632

Phase 1 publication

The 1000 Genomes Project Consortium
Published: October 28, 2010
DOI: 10.1038/nature09534

Phase 3 publications

The 1000 Genomes Project Consortium
Published: September 30,2015
DOI: 10.1038/nature15393

The 1000 Genomes Project Consortium
Published: September 30,2015
DOI: 10.1038/nature15394

Google Cloud Platform data locations

Beacon and GA4GH

You can find a Global Alliance for Genomics and Health Beacon at http://webdev.dnastack.com/p/beacon/thousandgenomes?chromosome=1&coordinate=10177&allele=AC

You can find an instance of the GA4GH reference server hosting this data at http://1kgenomes.ga4gh.org/.

Provenance

The source files for this dataset include:
  • These files were copied to Google Cloud Storage, uploaded to Google Genomics, and the variants were exported to Google BigQuery.

Have feedback or corrections? All improvements to these docs are welcome! You can click on the “Edit on GitHub” link at the top right corner of this page or file an issue.

Need more help? Please see https://cloud.google.com/genomics/support.