The Genome-Wide SNP 5.0 sample data set is a useful tool for software and workflow demonstrations, development of probe-level analysis methods for making genotype calls from probe intensity data, and a variety of other applications.

Additional information about the Genome-Wide Human SNP Array 5.0 can be found on the product page.

The data set consists of 10 trios comprised of 30 distinct HapMap CEPH samples, including some replicates: Four samples are replicated four times, and one sample is replicated three times.

The HapMap Project has made available a large number of reference genotypes which can be used in conjunction with this data set. The HapMap data access policy limits redistribution rights on these genotypes so they cannot be made available directly by Affymetrix, but the reference data can be downloaded from the HapMap Project. As of HapMap release 21a, a total of about 485,509 SNPs have reference genotypes available for the samples shared here. These numbers are steadily increasing with each HapMap update.

The data set has been split into five parts for convenient download. The data consists of GeneChip™ Operating Software (GCOS) CEL and XML files as well as AGCC CEL, CHP and ARR files.

The README file contains additional information about these samples.

The available options for download are:

File type Size Download
GCOS XML 132 KB GW_SNP5_GCOS_XML.zip
GCOS CEL 417 MB
412 MB

GW_SNP5_GCOS_CEL_1.zip

GW_SNP5_GCOS_CEL_2.zip

AGCC ARR 116 KB GW_SNP5_AGCC_ARR.zip
AGCC CEL 845 MB GW_SNP5_AGCC_CEL_1.zip
AGCC CHP 281 MB GW_SNP5_AGCC_CHP_1.zip

 

845 MB
116 KB
116 KB