Skip to main content

Table 1 Total number of genes and protein-coding isoforms in current versions of CHESS, RefSeq, and GENCODE. Genes are counted on the primary chromosomes and unplaced scaffolds from the human reference genome GRCh38, excluding the alternative scaffolds. Pseudogenes, VDJ segments, and C regions are not included in the totals shown in the final column

From: CHESS 3: an improved, comprehensive catalog of human genes and transcripts based on large-scale expression data, phylogenetic analysis, and protein structure

Database

Number of protein-coding gene loci

Number of protein-coding transcripts

Number of distinct protein sequences

Number of gene loci (all types)

CHESS v3

19,839

99,202

73,767

41,356

RefSeq v110

19,884

129,740

88,662

43,380

GENCODE v41

19,419

110,309

92,968

46,181