Tools

Glycine (soybean)

The best-known species in Glycine is the cultivated soybean, G. max, which was domesticated in Central and East Asia. The majority of the species in the genus are found only in Australia, while a few species extend from Australia to East Asia.

NCBI taxonomy ID: 3847

Overview - soybean genome and annotation statistics and nomenclature

Since the release of the first full soybean genome assembly in 2010, assemblies have been generated for more than 50 accessions, including multiple assemblies for the first reference, Williams 82 (Wm82).

There are several nomenclature patterns for the assemblies and annotations. The pattern used by the DOE-JGI and SoyBase has generally taken the form Wm82.a4.v1, with the middle field ("a4") indicating assembly version and the last field (v1) indicating the annotation version. Within the SoyBase and LegumeInfo Data Store, the pattern takes the form Wm82.gnm4.ann1 -- again, with the middle field ("gnm4") indicating assembly version and the last field (ann1) indicating the annotation version.

Access the genome and annotation data for download via the DATA COLLECTIONS tab.

Access the genome and annotation via JBrowse the GENOMICS tab.

See additional details about the main reference assemblies at the Genome Assembly page.

To examine statistics about all genome assemblies and annotations held at SoyBase, use these two links:

Tools and resources for the genus as a whole

GlycineMine
InterMine interface for accessing genetic and genomic data for several species in Glycine.
ZZBrowse
Association viewers (QTL, GWAS)
GCViT
Genotype comparison visualization tool
Genome Context Viewer
Browser for dynamically discovering and viewing genomic synteny across selected species.
Grin Data Explorer
Tool to facilitate searches of GRIN Descriptor Data
SoyMapII project
SoyMap II project to sequence perennial relatives of soybean.

Tools and resources for particular species


  • Glycine max: soybean

    Soybean (Glycine max), the predominant oil-seed legume worldwide, was likely domesticated in East Asia, ~6000-9000 years ago (Sedivy et al., 2017; https://doi.org/10.1111/nph.14418). It has many culinary and industrial uses. Some of the culinary uses include: for direct consumption of the green seed (i.e. edamame) and leaves (cooked, much like spinach); for tofu, soymilk, textured vegetable protein, soy sauce, tempeh, natto, and vegetable oil. Industrial uses include: oils, soap, cosmetics, and biodiesel. Soybean is also used as a high-protein forage, and can be prepared for fish- and animal-feed.

    NCBI taxonomy ID: 3847

    Glycine max resources

    GlycineMine
    InterMine interface for accessing genetic and genomic data for several species in Glycine.
    ZZBrowse
    Association viewers (QTL, GWAS)
    GCViT
    Genotype comparison visualization tool
    Genome Context Viewer
    Browser for dynamically discovering and viewing genomic synteny across selected species.
    Grin Data Explorer
    Tool to facilitate searches of GRIN Descriptor Data

    Glycine max accessions

    Reference - Williams 82

    Wm82.gnm6
    Glycine max accession Williams 82 (ISU01) genome assembly v6; renamed from Wm82 ISU-01 v2.1; JGI name Wm82.a6.v1
    Wm82.gnm5
    Glycine max accession Williams 82 (Wm82), genome assembly 5 doi.org/10.1002/tpg2.20382
    Wm82.gnm4
    Glycine max accession Williams 82 genome assembly v4.0; JGI name Wm82.a4.v1 doi.org/10.1111/tpj.14500
    Wm82.gnm2
    Glycine max accession Williams 82 genome assembly v2.0; JGI name Wm82.a2.v1 doi.org/10.1038/nature08670
    Wm82.gnm1
    Glycine max accession Williams genome assembly v1.0; JGI name Glycine max v1.1 doi.org/10.1038/nature08670

    Reference - Lee

    Lee.gnm3
    Glycine max accession Lee, genome assembly 3 doi.org/10.1002/tpg2.20382
    Lee.gnm2
    Glycine max genotype Lee genome assembly v2.0 doi.org/10.1016/j.jare.2021.10.009
    Lee.gnm1
    Glycine max accession Lee Genome assembly 1; JGI name Lee v1.1 doi.org/10.1111/tpj.14500

    Reference - Fiskeby III

    FiskebyIII.gnm1
    Glycine max genotype Fiskeby III genome assembly 1; JGI name Fiskeby v1.1

    Reference - Zhonghuang 13

    Zh13.gnm2
    Genome assembly version 2 files for cultivar Zhonghuang 13, Shen et al. (2019) doi.org/10.1007/s11427-019-9822-2
    Zh13.gnm1
    Genome assembly files for cultivar Zhonghuang 13, Shen et al. (2018) doi.org/10.1007/s11427-018-9360-0

    Reference - Hwangkeum

    Hwangkeum.gnm1
    Glycine max genotype Hwangkeum genome assembly v1.0 doi.org/10.1093/g3journal/jkab272

    Reference - Jidou 17

    JD17.gnm1
    Glycine max accession Jidou 17 (JD17), genome assembly 1 doi.org/10.1093/g3journal/jkac017

    Chu, Peng et al., 2021

    Citation (DOI) for this accession group: doi.org/10.1038/s41597-021-00947-2
    Hefeng25_IGA1002.gnm1
    Genome assembly files for cultivar Hefeng 25 (Hefeng25_IGA1002 in publication; WHFS_GmHF25_1.0 in the GenBank assembly record)
    Huaxia3_IGA1007.gnm1
    Genome assembly files for cultivar Huaxia3 (Huaxia3_IGA1007 in publication; WHFS_GmHX3_1.0 in the GenBank assembly record)
    Jinyuan_IGA1006.gnm1
    Genome assembly files for cultivar Jinyuan (Jinyuan_IGA100 in the publication; 6HFS_GmJY_1.0 in the GenBank assembly record)
    Wenfeng7_IGA1001.gnm1
    Genome assembly files for cultivar Wenfeng 7 (Wenfeng7_IGA1001 in publication; WHFS_GmWF7_1.0 in the GenBank assembly record); Chu et al. (2021)
    Wm82_IGA1008.gnm1
    Genome assembly files for cultivar Williams 82 (Wm82_IGA1008 in publication; WHFS_GmW82_1.0 in the GenBank assembly record)
    Zh13_IGA1005.gnm1
    Genome assembly files for cultivar Zhonghuang 13 (Zh13_IGA1005 in publication; WHFS_GmZH13_1.0 in the GenBank assembly record)
    Zh35_IGA1004.gnm1
    Genome assembly files for cultivar Zhonghuang 35 (Zh35_IGA1004 in publication; WHFS_GmZH35_1.0 in the GenBank assembly record)

    Liu, Du et al., 2020

    Citation (DOI) for this accession group: doi.org/10.1016/j.cell.2020.05.023
    58-161.gnm1
    Genome assembly for Glycine max accession 58-161 (SoyL04)
    Amsoy.gnm1
    Genome assembly for Glycine max accession Amsoy (SoyC05)
    DongNongNo_50.gnm1
    Genome assembly for Glycine max accession DongNongNo_50 (SoyC12)
    FengDiHuang.gnm1
    Genome assembly for Glycine max accession FengDiHuang (SoyL07)
    HanDouNo_5.gnm1
    Genome assembly for Glycine max accession HanDouNo_5 (SoyC09)
    HeiHeNo_43.gnm1
    Genome assembly for Glycine max accession HeiHeNo_43 (SoyC13)
    JiDouNo_17.gnm1
    Genome assembly for Glycine max accession JiDouNo_17 (SoyC11)
    JinDouNo_23.gnm1
    Genome assembly for Glycine max accession JinDouNo_23 (SoyC07)
    JuXuanNo_23.gnm1
    Genome assembly for Glycine max accession JuXuanNo_23 (SoyC03)
    KeShanNo_1.gnm1
    Genome assembly for Glycine max accession KeShanNo_1 (SoyC14)
    PI_398296.gnm1
    Genome assembly for Glycine max accession PI_398296 (SoyL05)
    PI_548362.gnm1
    Genome assembly for Glycine max accession PI_548362 (SoyC10)
    QiHuangNo_34.gnm1
    Genome assembly for Glycine max accession QiHuangNo_34 (SoyC08)
    ShiShengChangYe.gnm1
    Genome assembly for Glycine max accession ShiShengChangYe (SoyL09)
    TieFengNo_18.gnm1
    Genome assembly for Glycine max accession TieFengNo_18 (SoyC02)
    TieJiaSiLiHuang.gnm1
    Genome assembly for Glycine max accession TieJiaSiLiHuang (SoyL08)
    TongShanTianEDan.gnm1
    Genome assembly for Glycine max accession TongShanTianEDan (SoyL03)
    WanDouNo_28.gnm1
    Genome assembly for Glycine max accession WanDouNo_28 (SoyC04)
    XuDouNo_1.gnm1
    Genome assembly for Glycine max accession XuDouNo_1 (SoyC01)
    YuDouNo_22.gnm1
    Genome assembly for Glycine max accession YuDouNo_22 (SoyC06)
    ZhangChunManCangJin.gnm1
    Genome assembly for Glycine max accession ZhangChunManCangJin (SoyL06)
    Zhutwinning2.gnm1
    Genome assembly for Glycine max accession Zhutwinning2 (SoyL01)
    ZiHuaNo_4.gnm1
    Genome assembly for Glycine max accession ZiHuaNo_4 (SoyL02)

    Wm82_NJAU.gnm1
    Glycine max accession Williams 82 from Nanjing Agricultural University (Wm82-NJAU), genome assebly v1 doi.org/10.1016/j.molp.2023.08.012

  • Glycine soja: soybean

    Glycine soja is the closest wild relative of soybean, Glycine max. Populations of G. soja exist in the wild in China, Japan, Korea, and Russia. Analysis of genetic differences between the two species suggests that the two separated approximately 200 thousand years ago. The species remain interfertile, and G. soja accessions are used in breeding projects in order to introgress traits such as tolerance to particular diseases or environmental stresses.

    NCBI taxonomy ID: 3848

    Glycine soja resources

    GlycineMine
    InterMine interface for accessing genetic and genomic data for several species in Glycine.
    ZZBrowse
    Association viewers (QTL, GWAS)
    GCViT
    Genotype comparison visualization tool
    Genome Context Viewer
    Browser for dynamically discovering and viewing genomic synteny across selected species.
    Grin Data Explorer
    Tool to facilitate searches of GRIN Descriptor Data

    Glycine soja accessions

    Valliyodan, Cannon et al., 2019

    Citation (DOI) for this accession group: doi.org/10.1111/tpj.14500
    PI483463.gnm1
    Glycine soja accession PI 483463 genome assembly, v1.0; JGI name Glycine soja v1.1

    Xie, Chung et al., 2019

    Citation (DOI) for this accession group: doi.org/10.1038/s41467-019-09142-9
    W05.gnm1
    Genome assembly files for cultivar W05 from Xie, Lam et al. (2019): A reference-grade wild soybean genome

    Chu, Peng et al., 2021

    Citation (DOI) for this accession group: doi.org/10.1038/s41597-021-00947-2
    F_IGA1003.gnm1
    Genome assembly files for Glycine soja F (F_IGA1003 in publication; WHFS_GsojaF_1.0 in the GenBank assembly record)

    Liu, Du et al., 2020

    Citation (DOI) for this accession group: doi.org/10.1016/j.cell.2020.05.023
    PI_549046.gnm1
    Genome assembly for Glycine soja accession PI_549046 (SoyW02)
    PI_562565.gnm1
    Genome assembly for Glycine soja accession PI_562565 (SoyW01)
    PI_578357.gnm1
    Genome assembly for Glycine soja accession PI_578357 (SoyW03)

  • Glycine cyrtoloba: soybean

    G. cyrotoloba (Tind) is a perennial plant with twining and stiff stems. G. cyrotoloba pods are curved and somewhat mottled in appearance containing 3-9 seeds that and are dark brown to black in color (Tindale, MD et al., 1984). G. cyrotoloba is a diploid (2n=40) member of the C-genome of Glycine. It is found along the coast of Queensland and Northern New South Wales (Ratnaparkhe et al 2011 ; Gonzalez-Orozco et al., 2012).

    NCBI taxonomy ID: 45689

    Glycine cyrtoloba resources

    SoyMap II project
    SoyMap II project to sequence perennial relatives of soybean.
    SoyMap2 Diversity Browser on Glyma.Wm82.a1 (Gmax1.01)
    GBrowse for G. cyrtoloba Bac End Sequence alignments on Glyma.Wm82.a1 (Gmax1.01)
    SoyMap2 Diversity Browser on Glyma.Wm82.a2 (Gmax2.0)
    GBrowse for G. cyrtoloba Bac End Sequence alignments on Glyma.Wm82.a2 (Gmax2.0)

    Glycine cyrtoloba accessions

    Zhuang, Wang et al., 2022

    Citation (DOI) for this accession group: doi.org/10.1038/s41477-022-01102-4
    G1267.gnm1
    Genome assemblies for Glycine cyrtoloba, accession G1267

  • Glycine dolichocarpa: soybean

    Glycine dolichocarpa (Tateishi & Ohashi) is a twining plant with long straight dark brown pods with 5-7 seeds. Seeds are square and dark brown in color. G. dolichocarpa is an allotetraploid (2n = 4x = 80) formed by hybridizatoin between G. syndetika and G. tomentella D3 (both 2n = 40). (This species was formerly part of the Glycine tomentella species complex and was referred to as G. tomentella T2.) It has a limited Australian range in Queensland, but like several other Glycine allopolyploids, has colonized islands of the Pacific Ocean (in this case Taiwan) where no perennial diploid Glycine species have been found ( Ratnaparkhe et al 2011; Harbert et al 2014).

    NCBI taxonomy ID: 82538

    Glycine dolichocarpa resources

    SoyMap2 Diversity Browser on Glyma.Wm82.a1 (Gmax1.01)
    GBrowse for G. dolichocarpa Bac End Sequence alignments on Glyma.Wm82.a1 (Gmax1.01)
    SoyMap2 Diversity Browser on Glyma.Wm82.a2 (Gmax2.0)
    GBrowse for G. dolichocarpa Bac End Sequence alignments on Glyma.Wm82.a2 (Gmax2.0)
    SoyMap II project
    SoyMap II project to sequence perennial relatives of soybean.

    Glycine dolichocarpa accessions

    Zhuang, Wang et al., 2022

    Citation (DOI) for this accession group: doi.org/10.1038/s41477-022-01102-4
    G1134.gnm1
    Genome assemblies for Glycine dolichocarpa, accession G1134

  • Glycine falcata: soybean

    Glycine falcata (Benth.) is unique among perennial Glycine species in that it does not form a vine but rather short, erect stems from a fibrous woody root system instead of the more common taproot. Seeds are round and smooth similar to the annual species. G. falcata is a diploid (2n = 40) and is the sole member of the F-genome. It is sister to the remainder of subgenus Glycine, and is distinctive ecologically, characteristically growing in the black soil region of Queensland and possessing both chasmogamous and below- ground cleistogamous flowers, the latter producing geocarpic fruits (Ratnaparkhe et al 2011; Gonzalez-Orozco et al., 2012).

    NCBI taxonomy ID: 45690

    Glycine falcata resources

    SoyMap2 Diversity Browser on Glyma.Wm82.a1 (Gmax1.01)
    GBrowse for G. falcata Bac End Sequence alignments on Glyma.Wm82.a1 (Gmax1.01)
    SoyMap2 Diversity Browser on Glyma.Wm82.a2 (Gmax2.0)
    GBrowse for G. falcata Bac End Sequence alignments on Glyma.Wm82.a2 (Gmax2.0)
    SoyMap II project
    SoyMap II project to sequence perennial relatives of soybean.

    Glycine falcata accessions

    Zhuang, Wang et al., 2022

    Citation (DOI) for this accession group: doi.org/10.1038/s41477-022-01102-4
    G1718.gnm1
    Genome assemblies for Glycine falcata, accession G1718

  • Glycine stenophita: soybean

    Glycine stenophita (B.E. Pfeil & Tind.) is a scrambling or climbing perennial that is glabrous or with sparse white hairs covering the stems. Pods are 4 to 6 seeded and seeds are generally barrel shaped with some variation in shape from elliptical to square. G. stenophita is a diploid (2n = 40) member of the B-genome group. It occurs in the Australian states of Queensland and New South Wales (Ratnaparkhe et al 2011; Gonzalez-Orozco et al., 2012).

    NCBI taxonomy ID: 96944

    Glycine stenophita resources

    SoyMap2 Diversity Browser on Glyma.Wm82.a1 (Gmax1.01)
    GBrowse for G. stenophita Bac End Sequence alignments on Glyma.Wm82.a1 (Gmax1.01)
    SoyMap2 Diversity Browser on Glyma.Wm82.a2 (Gmax2.0)
    GBrowse for G. stenophita Bac End Sequence alignments on Glyma.Wm82.a2 (Gmax2.0)
    SoyMap II project
    SoyMap II project to sequence perennial relatives of soybean.

    Glycine stenophita accessions

    Zhuang, Wang et al., 2022

    Citation (DOI) for this accession group: doi.org/10.1038/s41477-022-01102-4
    G1974.gnm1
    Genome assemblies for Glycine stenophita, accession G1974

  • Glycine syndetika: soybean

    Glycine syndetika (B.E. Pfeil & Craven) is a twining perennial plant with three leathery, often persistent leaflets. Flowers are somewhat clustered towards to the top of the inflorescences and pods contain 4-9 relatively large square seeds (Pfeil. BE et al., 2006). G. syndetika is diploid (2n = 40) member of the A-genome clade. (This species was formerly part of the Glycine tomentella species complex and was referred to as G. tomentella D4. ) It is has a restricted range in the Eastern Queensland region of Australia (Ratnaparkhe et al 2011; Gonzalez-Orozco et al., 2012).

    NCBI taxonomy ID: 713886

    Glycine syndetika resources

    SoyMap2 Diversity Browser on Glyma.Wm82.a1 (Gmax1.01)
    GBrowse for G. syndetika Bac End Sequence alignments on Glyma.Wm82.a1 (Gmax1.01)
    SoyMap2 Diversity Browser on Glyma.Wm82.a2 (Gmax2.0)
    GBrowse for G. syndetika Bac End Sequence alignments on Glyma.Wm82.a2 (Gmax2.0)
    SoyMap II project
    SoyMap II project to sequence perennial relatives of soybean.

    Glycine syndetika accessions

    Zhuang, Wang et al., 2022

    Citation (DOI) for this accession group: doi.org/10.1038/s41477-022-01102-4
    G1300.gnm1
    Genome assemblies for Glycine syndetika, accession G1300

  • Glycine D3-tomentella: soybean

    A complex of diploid and tetraploid taxa are lumped under the name "G. tomentella" but are each reproductively isolated species, e.g. G. tomentella D3 belongs to the D-genome, whereas D1 G. tomentella belongs to the E-genome.

    NCBI taxonomy ID: 2908013

    Glycine D3-tomentella resources

    SoyMap2 Diversity Browser on Glyma.Wm82.a1 (Gmax1.01)
    GBrowse for G. tomentella Bac End Sequence alignments on Glyma.Wm82.a1 (Gmax1.01)
    SoyMap2 Diversity Browser on Glyma.Wm82.a2 (Gmax2.0)
    GBrowse for G. tomentella Bac End Sequence alignments on Glyma.Wm82.a2 (Gmax2.0)
    SoyMap II
    SoyMap II project to sequence perennial relatives of soybean.

    Glycine D3-tomentella accessions

    Zhuang, Wang et al., 2022

    Citation (DOI) for this accession group: doi.org/10.1038/s41477-022-01102-4
    G1403.gnm1
    Genome assemblies for Glycine D3 tomentella, accession G1403