SoySNP50K iSelect BeadChip
An Illumina Infinium BeadChip containing over 50,000 SNPs from soybean (Glycine max L. Merr.) has been developed (
Song et al. 2013;
Song et al. 2015). A total of 498,921,777 reads 35-45 bp in length were obtained from DNA sequence analysis of reduced representation libraries from several soybean accessions which included six cultivated and two wild soybean (G. soja Sieb. et Zucc.) genotypes. These reads were mapped to the
Wm82.a1 soybean whole genome sequence (
Gemome Browser) and 209,903 SNPs were identified. After applying several filters, a total of 146,161 SNPs were determined to be candidates for Illumina Infinium II BeadChip design. To equalize the distance between selected SNPs, increase assay success rate, and minimize the number of SNPs with low minor allele frequency, an iteration algorithm based on a selection index was developed and used to select 60,800 SNPs for Infinium BeadChip design. Of the 60,800 SNPs, 50,701 were targeted to euchromatic regions and 10,000 to heterochromatic regions of the 20 soybean chromosomes. In addition, 99 SNPs were targeted to unanchored sequence scaffolds. Of the 60,800 SNPs, a total of 52,041 passed Illumina's manufacturing phase to produce the SoySNP50K iSelect BeadChip.
View SoySNP50K SNPs in SoyBase Genome Browser
Download SNP Data
The SoySNP50K iSelect BeadChip has been used to genotype the
USDA Soybean Germplasm Collection
(
Song, Qijian, David L. Hyten, Gaofeng Jia, Charles V. Quigley, Edward W. Fickus,
Randall L. Nelson, and Perry B. Cregan. 2015. Fingerprinting soybean germplasm and its utility in genomic research, G3: Genes| Genomes| Genetics
50(10):1999-2006.) and the data generously provided by the authors.
SoySNP50K SNP positional information in relationship to the Wm82.a2 and Wm82.a1 assemblies can be found in
Construction of high resolution genetic linkage maps to improve the soybean genome sequence assembly Glyma1.01, Song et al. 2016 BMC Genomics 17:33
in
Supplemental table S1.
A subset of the SoySNP50K dataset was produced by Song et al. called the BARCSoySNP6K SNP set described in
Soybean BARCSoySNP6K: An assay for soybean genetics and breeding research, Plant Journal (2020) 104(3):800-811
.
The SNP set assembled can be found in
Supplemental Table S1.
Bulk Downloads
The complete data set for 20,087 G. max and G. soja accessions genotyped with 42,509 SNPs is available for Wm82.a1 and Wm82.a2 in either vcf or HapMap format. You can extract a large list of
cultivars using
BCFtools and a VCF file chosen below.
SNP50K Downloads
Wm82.a1 |
VCF
113.08 Megabytes
|
HapMap
114.43 Megabytes
|
Wm82.a2
| VCF
159.54 Megabytes
|
HapMap
131.09 Megabytes
|
Wm82.a4 |
VCF
155.05 Megabytes
|
HapMap
128.22 Megabytes
|