Skip to main content

Table 1 Genomes and motifs

From: Characterizing and measuring bias in sequence data

  

GC extremes

Special motifs

Sample

Genome size

GC ≤ 10%

GC ≥ 75%

GC ≥ 85%

(AT)15

G|C ≥ 80%

Bad promoters

P. falciparum

23,263,391

10,030,724 (43%)

0

0

1,258,098 (5.4%)

0

-

E. coli

4,638,920

0

2,705 (0.058%)

0

0

0

-

R. sphaeroides

4,131,450

0

2,479,536 (60%)

90,207 (2.2%)

0

0

-

Human

2,684,573,005

6,228,029 (0.23%)

20,669,681 (0.77%)

2,980,450 (0.11%)

1,253,245 (0.047%)

802,554 (0.030%)

190,041 (0.0071%)

  1. For each genome sequenced as part of this work, we show its size in bases, along with the number of bases of each bias motif (see text). Only unambiguous (A, C, T, or G) bases from each reference are included. Plasmids, mitochondria, and sex chromosomes were excluded from the counts.