Tools

SoySNP50K iSelect BeadChip

An Illumina Infinium BeadChip containing over 50,000 SNPs from soybean (Glycine max L. Merr.) has been developed (Song et al. 2013; Song et al. 2015). A total of 498,921,777 reads 35-45 bp in length were obtained from DNA sequence analysis of reduced representation libraries from several soybean accessions which included six cultivated and two wild soybean (G. soja Sieb. et Zucc.) genotypes. These reads were mapped to the Wm82.a1 soybean whole genome sequence (Gemome Browser) and 209,903 SNPs were identified. After applying several filters, a total of 146,161 SNPs were determined to be candidates for Illumina Infinium II BeadChip design. To equalize the distance between selected SNPs, increase assay success rate, and minimize the number of SNPs with low minor allele frequency, an iteration algorithm based on a selection index was developed and used to select 60,800 SNPs for Infinium BeadChip design. Of the 60,800 SNPs, 50,701 were targeted to euchromatic regions and 10,000 to heterochromatic regions of the 20 soybean chromosomes. In addition, 99 SNPs were targeted to unanchored sequence scaffolds. Of the 60,800 SNPs, a total of 52,041 passed Illumina's manufacturing phase to produce the SoySNP50K iSelect BeadChip.

View SoySNP50K SNPs in SoyBase Genome Browser

Download SNP Data

The SoySNP50K iSelect BeadChip has been used to genotype the USDA Soybean Germplasm Collection (Song, Qijian, David L. Hyten, Gaofeng Jia, Charles V. Quigley, Edward W. Fickus, Randall L. Nelson, and Perry B. Cregan. 2015. Fingerprinting soybean germplasm and its utility in genomic research, G3: Genes| Genomes| Genetics 50(10):1999-2006.) and the data generously provided by the authors.

SoySNP50K SNP positional information in relationship to the Wm82.a2 and Wm82.a1 assemblies can be found in Construction of high resolution genetic linkage maps to improve the soybean genome sequence assembly Glyma1.01, Song et al. 2016 BMC Genomics 17:33 in Supplemental table S1.

A subset of the SoySNP50K dataset was produced by Song et al. called the BARCSoySNP6K SNP set described in Soybean BARCSoySNP6K: An assay for soybean genetics and breeding research, Plant Journal (2020) 104(3):800-811. The SNP set assembled can be found in Supplemental Table S1.

Bulk Downloads

The complete data set for 20,087 G. max and G. soja accessions genotyped with 42,509 SNPs is available for Wm82.a1 and Wm82.a2 in either vcf or HapMap format. You can extract a large list of cultivars using BCFtools and a VCF file chosen below.

SNP50K Downloads
Wm82.a1 VCF
113.08 Megabytes
HapMap
114.43 Megabytes
Wm82.a2 VCF
159.54 Megabytes
HapMap
131.09 Megabytes
Wm82.a4 VCF
155.05 Megabytes
HapMap
128.22 Megabytes

Download file of Minor Allele

Frequencies (MAF) of SNPs

MAFs

20.24 Megabytes


Download allele counts for each accession in the dataset