Skip to main content

Table 1 Summary of datasets for eight sequenced plant genomes included in this study

From: A genome triplication associated with early diversification of the core eudicots

Species

Annotation version

Number of annotated genes

Arabidopsis thaliana (thale cress)

TAIR version 9

27,379

Carica papaya (papaya)

ASGPB release

25,536

Cucumis sativus (cucumber)

BGI release

21,635

Populus trichocarpa (black cottonwood)

JGI version 2.0

41,377

Glycine max (soybean)

Phytozome version 1.0

55,787

Vitis vinifera (grape vine)

Genoscope release

30,434

Oryza sativa (rice)

RGAP release 6.1

56,979

Sorghum bicolor

JGI version 1.4

34,496

  1. These eight genome sequences were used to construct orthogroups, which were then populated with additional unigenes of asterids, basal eudicots, non-grass monocots, and basal angiosperms. The number of annotated genes in each genome is indicated. ASGPB, Advanced Studies of Genomics, Proteomics and Bioinformatics; JGI, Joint Genome Institute; RGAP, Rice Genome Annotation Project; TAIR, The Arabidopsis Information Resource.