Simons Genome Diversity Project

The properly rendered version of this document can be found at Read The Docs.

If you are reading this on github, you should instead click here.

This dataset comprises 279 publicly available genomes from 127 diverse populations for the Simons Genome Diversity Project. See the journal articles for full details:

Pilot Publication

Kay Prufer, Fernando Racimo, Nick Patterson, Flora Jay, Sriram Sankararaman, Susanna Sawyer, Anja Heinze, Gabriel Renaud, Peter H. Sudmant, Cesare de Filippo, Heng Li, Swapan Mallick, Michael Dannemann, Qiaomei Fu, Martin Kircher, Martin Kuhlwilm, Michael Lachmann, Matthias Meyer, Matthias Ongyerth, Michael Siebauer, Christoph Theunert, Arti Tandon, Priya Moorjani, Joseph Pickrell, James C. Mullikin, et al.
Published December 18, 2013
DOI: 10.1038/nature12886

Full Dataset Publication

Swapan Mallick, Heng Li, Mark Lipson, Iain Mathieson, Melissa Gymrek, Fernando Racimo, Mengyao Zhao, Niru Chennagiri, Susanne Nordenfelt, Arti Tandon, Pontus Skoglund, Iosif Lazaridis, Sriram Sankararaman, Qiaomei Fu, Nadin Rohland, Gabriel Renaud, Yaniv Erlich, Thomas Willems, Carla Gallo, Jeffrey P. Spence, Yun S. Song, Giovanni Poletti, Francois Balloux, George van Driem, Peter de Knijff et al.
Published 21 September 2016
DOI:10.1038/nature18964

Google Cloud Platform data locations

Provenance

For the full dataset of 279 genomes:

For the pilot dataset of 25 genomes, the BAMs were imported into Google Genomics.


Have feedback or corrections? All improvements to these docs are welcome! You can click on the “Edit on GitHub” link at the top right corner of this page or file an issue.

Need more help? Please see https://cloud.google.com/genomics/support.