- Open Access
Evidence for symmetric chromosomal inversions around the replication origin in bacteria
© GenomeBiology.com 2000
- Received: 7 August 2000
- Accepted: 19 October 2000
- Published: 4 December 2000
Whole-genome comparisons can provide great insight into many aspects of biology. Until recently, however, comparisons were mainly possible only between distantly related species. Complete genome sequences are now becoming available from multiple sets of closely related strains or species.
By comparing the recently completed genome sequences of Vibrio cholerae,Streptococcus pneumoniae and Mycobacterium tuberculosis to those of closely related species - Escherichia coli, Streptococcus pyogenes and Mycobacterium leprae, respectively - we have identified an unusual and previously unobserved feature of bacterial genome structure. Scatterplots of the conserved sequences (both DNA and protein) between each pair of species produce a distinct X-shaped pattern, which we call an X-alignment. The key feature of these alignments is that they have symmetry around the replication origin and terminus; that is, the distance of a particular conserved feature (DNA or protein) from the replication origin (or terminus) is conserved between closely related pairs of species. Statistically significant X-alignments are also found within some genomes, indicating that there is symmetry about the replication origin for paralogous features as well.
The most likely mechanism of generation of X-alignments involves large chromosomal inversions that reverse the genomic sequence symmetrically around the origin of replication. The finding of these X-alignments between many pairs of species suggests that chromosomal inversions around the origin are a common feature of bacterial genome evolution.
- Complete Genome Sequence
- Bacterial Genome
- Chlamydia Trachomatis
- Tandem Duplication
- Replication Origin
Large-scale genomic rearrangements and duplications are important in the evolution of species. Previously, these large-scale genome-changing events were studied through genetic or cytological studies. With the availability of many complete genome sequences it is now possible to study such events through comparative genomics. The publication of the yeast genome has led to much better insight into the duplication events that have occurred in fungal and eukaryotic evolution (for example, see ). Large chromosomal duplications have also been found from analysis of completed chromosomes of Arabidopsis thaliana [2,3]. The ability to detect large-scale genomic changes is dependent in large part on which genomes are available. Such studies in bacteria, for example, have been limited by the availability of genomes only from distantly related sets of species. Recently, however, the genomes of sets of closely related bacterial species have become available. We have compared these closely related bacterial genomes and have discovered an unusual phenomenon - alignments of whole genomes that show an X-shaped pattern (which we refer to as X-alignments). Here we present the evidence for these X-alignments and discuss mechanisms that might have produced them.
Whole-genome X-alignments between species at the DNA level
Whole-genome DNA alignments using MUMmer
4.7 ± 10-59
V. cholerae (rev)
1.8 ± 10-90
V. cholerae (rev)
8.2 ± 10-16
E. coli (rev)
1.5 ± 10-23
4.5 ± 10-80
S. pyogenes (rev)
2.3 ± 10-90
S. pyogenes (rev)
1.1 ± 10-18
S. pneumoniae (rev)
1.5 ± 10-6
M. leprae (rev)
3.5 ± 10-10
M. tuberculosis (rev)
E. coli (rev)
We have found that X-alignments of whole genomes are not limited to the V. cholerae versus E. coli comparison. For example, a whole-genome comparison of two bacteria in the genus Streptococcus - S. pyogenes  and S. pneumoniae (H. Tettelin, personal communication) - reveals a global X-alignment similar to that of V. cholerae versus E. coli (Figure 1d) which is also statistically significant (Table 1). In addition, an X-alignment is found between two species in the genus Mycobacterium - M. tuberculosis  and M. leprae  (Figure 1e) - as well as between two strains of Helicobacter pylori (data not shown). The X-alignments observed between any two pairs of genomes are not identical in every aspect. For example, in the alignment between the two Mycobacterium species, each conserved region is much longer than in the other genome pairs. We believe this is due to different numbers of evolutionary events between the species (see below). Whole-genome X-alignments were not found between any other pairs of species, although a related pattern was seen between some of the chlamydial species (see below).
Whole-genome X-alignments between species are also found at the proteome level
Whole-genome protein-level comparisons
3.2 ± 10-40
V. cholerae (rev)
2.3 ± 10-70
3.6 ± 10-17
V. cholerae (rev)
3.6 ± 10-10
E. coli (rev)
1.0 ± 10-15
V. cholerae (rev)
8.4 ± 10-4
Whole-genome X-alignments within species
Model I: whole-genome inverted duplications
One possible explanation for an X-alignment within and between species is an ancestral inverted duplication of the whole genome, as has been suggested for E. coli . The weak or missing X-alignment within species could be explained by gene loss of one of the two duplicates of many of the pairs of genes in the different lineages. Gene loss has been found to follow large chromosomal or genome duplications [11,12,13]. This gene loss is thought to stabilize large duplications by preventing recombination events between duplicate genes. If gene loss is responsible for the weak X-alignment within species, then to maintain the X-alignments between species, the member of the gene pair lost in a particular lineage should be essentially random. If an ancient inverted duplication followed by differential gene loss is the correct explanation for the observed X-alignments, one would expect the genes along one diagonal to be orthologous between species (related to each other by the speciation event), while the genes along the other diagonal should be paralogous (related to each other by the genome duplication event before the speciation of the two lineages). However, the evidence appears to contradict this model: likely orthologous gene pairs are equally distributed on each diagonal (data not shown).
Model II: chromosomal inversions about the origin and/or terminus
Inversions about the origin and terminus could also produce an X-alignment within species, through the splitting of tandemly duplicated sequence. Many sets of tandemly duplicated genes are found in most bacterial genomes [19,20] (also see Figure 3a,c). As tandem duplications are inherently unstable (one of the duplicates can be rapidly eliminated by slippage and/or recombination events ), the fact that many tandem pairs are present within each genome suggests that tandem duplications occur frequently. Thus, it is reasonable to assume that occasionally a large inversion will split a pair of tandemly duplicated genes. An inversion that pivots about the origin and also splits a tandem duplication will result in a pair of paralogous genes spaced symmetrically on opposite sides of the origin.
If our inversion model is correct, then the genes along both diagonals in the between-species alignments should be orthologous, which is the case (see above). In contrast, genes along the anti-diagonal in the within-species X-alignments should be recent tandem duplicates that have been separated by inversions. This also appears to be the case - in the within-species analysis of V. cholerae chrI ORFs, the X-alignment shows up best when only recent duplicates are analyzed (Figure 2d). The splitting of tandem duplicates by inversions may be a general mechanism to stabilize the coexistence of duplicated genes, as it will prevent their elimination by unequal crossing-over or replication slippage events.
What could cause inversions that pivot around the origin and terminus of the genome to occur more frequently than other inversions? One possibility is that many inversions occur, but there is selection against those that change the distance of a gene from the origin or terminus. Such a possibility has been suggested by experimental work in E. coli [14,15]. Additional studies have, however, suggested that there is little selective difference between inversions and that instead there may be certain regions that are more prone to inversion than others [16,17,18,22,23]. Alternatively, the inversion events could be linked to replication, as has been suggested for small local inversion events . Whatever the mechanisms, the fact that we find evidence for such inversions between many pairs of species suggests that they are a common feature of bacterial evolution. Many aspects of the X-alignments require further exploration. For example, to split a tandem duplication, an inversion must fall precisely on the boundary between two duplicated genes. This would appear to be unlikely, requiring a large number of inversions in order to generate a sufficient number of split gene pairs. If the mechanisms of gene duplication are somehow related to the mechanisms of inversion, however, then this model is more plausible. The process of duplicating a gene, if it occurs during replication, might promote a recombination event within the bacterial chromosome that inverts the sequence from the origin up to that point. As with inversion events, recombination and replication have been found to be tightly coupled .
We present here a novel observation regarding the conservation between bacterial species of the distance of particular genes from the replication origin or terminus. The initial observation was only possible due to the availability of complete genome sequences from pairs of moderately closely related species (for example, V. cholerae and E. coli). This shows the importance of having genome pairs from many levels of evolutionary relatedness. Comparisons of distantly related species enable the determination of universal features of life as well as of events that occur very rarely. Comparison of very closely related species allows the identification of frequent events such as transitional changes at third codon positions or tandem duplications. To elucidate all other events in the history of life, genome pairs covering all the intermediate levels of evolutionary relatedness will be needed.
Complete published genome sequences were obtained from the National Center for Biotechnology Information website  or from the TIGR Comprehensive Microbial Resource . These included Aeropyrum pernix , Aquifex aeolicus , Archaeoglobus fulgidus , Bacillus subtilis , Borrelia burgdorferi , Campylobacter jejuni , Chlamydia pneumoniae AR39 , Chlamydia pneumoniae CWL029 , Chlamydia trachomatis (D/UW-3/Cx) , Chlamydia trachomatis MoPn , Deinococcus radiodurans , Escherichia coli , Haemophilus influenzae , Helicobacter pylori , Helicobacter pylori J99 , Methanobacterium thermoautotrophicum , Methanococcus jannaschii , Mycobacterium tuberculosis , Mycoplasma genitalium , Mycoplasma pneumoniae , Neisseria meningitidis MC58 , Neisseria meningitidis serogroup A strain Z2491 , Pyrococcus horikoshii , Rickettsia prowazekii , Synechocystis sp. , Thermotoga maritima , Treponema pallidum , and Vibrio cholerae . In addition, a few unpublished genomes were analyzed: Streptococcus pyogenes (obtained from the Oklahoma University Genome Center website ), Streptococcus pneumoniae (H. Tettelin, personal communication), and Mycobacterium leprae (obtained from the Sanger Centre Pathogen Sequencing Group website ).
Whole-genome DNA alignments
DNA alignments of the complete genomic sequences of all bacteria used in this study were accomplished with the MUMmer program . This program uses an efficient suffix tree construction algorithm to rapidly compute alignments of entire genomes. The algorithm identifies all exact matches of nucleotide subsequences that are contained in both input sequences; these exact matches must be longer than a specified minimum length, which was set to 20 base pairs for this comparison. To search for genome-scale alignments within species, complete bacterial and archaeal genomes (25 in total including all published genomes) were aligned with their own reverse complements. To search for between-species alignments, all genomes were aligned against all others in both orientations.
Whole-genome protein comparisons
The predicted proteome of each complete genome sequence (all predicted proteins in the genome) was compared to the proteomes of all complete genome sequences (including itself) using the fasta3 program . Matches with an expected score (e-value) of 10-5 or less were considered significant.
Statistical significance of X-alignments
To calculate the statistical significance of the X-alignments, the maximal unique matching subsequences (MUMs) for unrelated genomes were examined and found to be uniformly distributed . With a uniform background, the expected density of MUMs in any region of an alignment plot is a simple proportion of the area of that region to the entire plot. In particular, in an alignment with N total MUMs, the probability (Pr) of observing at least m matches in a region with area p can be computed using the binomial distribution in Equation 1:
The alignment of V. cholerae chrI (both forward and reverse strands) versus E. coli contains 926 MUMs. The MUMs forming X-alignments appear along the diagonal (y = x) and the anti-diagonal (y = L -x, where L is the genome length). To estimate the significance of the alignments in both directions, diagonal strips were sampled along each of the diagonals. The width of each strip was set at 10% of the plot area and significance values were calculated (Table 1).
Identification of origins of replication
The origins of replication for the bacterial genomes have been characterized by a variety of methods. For E. coli, M. tuberculosis and M. leprae, the origins have been well-characterized by laboratory studies [51,52]. The origins and termini of C. trachomatis, C. pneumoniae and V. cholerae were identified by GC-skew  and by characteristic genes in the region of the origin [4,19]. GC-skew uses the function (G-C)/(G+C) computed on 2,000 bp windows across the genome, which exhibits a clear tendency in many bacterial genomes to be positive for the leading strand and negative for the lagging strand. The origin of H. pylori was determined by oligomer skew  and confirmed by GC-skew. The origins and termini of S. pneumoniae and S. pyogenes were determined by the authors of the present study using GC-skew analysis and the locations of characteristic genes, particularly the chromosome replication initiator gene dnaA.
We thank S. Eddy, M.A. Riley, T. Read, A. Stoltzfus, M-I Benito and I. Paulsen for helpful comments, suggestions and discussions. S.L.S. was supported in part by NSF grant IIS-9902923 and NIH grant R01 LM06845. S.L.S. and J.A.E were supported in part by NSF grant KDI-9980088. Data for all published complete genome sequences were obtained from the NCBI genomes database  or from The Institute for Genomic Research (TIGR) Microbial Genome Database . The sequences of V. cholerae, S. pneumoniae, and M. tuberculosis (CDC 1551) were determined at TIGR with support from NIH and the NIAID. The M. leprae sequence data were produced by the Pathogen Sequencing Group at the Sanger Centre. Sequencing of M. leprae is funded by the Heiser Program for Research in Leprosy and Tuberculosis of The New York Community Trust and by L'Association Raoul Follereau. The M. tuberculosis CDC 1551 genome sequence was obtained from TIGR. The source of the S. pyogenes genome sequence was the Streptococcal Genome Sequencing Project funded by USPHS/NIH grant AI38406, and was kindly made available by B. A. Roe, S.P. Linn, L. Song, X. Yuan, S. Clifton, R.E. McLaughlin, M. McShan and J. Ferretti, and can be obtained from the website of the Oklahoma University Genome Center .
- Seoighe C, Wolfe KH: Updated map of duplicated regions in the yeast genome. Gene. 1999, 238: 253-261. 10.1016/S0378-1119(99)00319-4.PubMedView ArticleGoogle Scholar
- Lin X, Kaul S, Rounsley S, Shea TP, Benito MI, Town CD, Fujii CY, Mason T, Bowman CL, Barnstead M, et al: Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana. Nature. 1999, 402: 761-768. 10.1038/45471.PubMedView ArticleGoogle Scholar
- Mayer K, Schuller C, Wambutt R, Murphy G, Volckaert G, Pohl T, Dusterhoft A, Stiekema W, Entian KD, Terryn N, et al: Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana. Nature. 1999, 402: 769-777. 10.1038/47134.PubMedView ArticleGoogle Scholar
- Heidelberg JF, Eisen JA, Nelson WC, Clayton RA, Gwinn ML, Dodson RJ, Haft DH, Hickey EK, Peterson JD, Umayam L, et al: The genome sequence of Vibrio cholerae, the aetiologic agent of cholera. Nature. 2000, 406: 477-483. 10.1086/172459.PubMedView ArticleGoogle Scholar
- Blattner FR, Plunkett GI, Bloch CA, Perna NT, Burland V, Riley M, Collado-Vides J, Glasner JD, Rode CK, Mayhew GF, et al: The complete genome sequence of Escherichia coli K-12. Science. 1997, 277: 1453-1462. 10.1126/science.277.5331.1453.PubMedView ArticleGoogle Scholar
- Delcher AL, Kasif S, Fleischmann RD, Peterson J, White O, Salzberg SL: Alignment of whole genomes. Nucleic Acids Res. 1999, 27: 2369-2376. 10.1093/nar/27.11.2369.PubMedPubMed CentralView ArticleGoogle Scholar
- Oklahoma University Genome Center. [http://www.genome.ou.edu/strep.html]
- Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D, Gordon SV, Eiglmeier K, Gas S, Barry CE, et al: Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature. 1998, 393: 537-544. 10.1038/31159.PubMedView ArticleGoogle Scholar
- Sanger Centre Pathogen Sequencing Group. [ftp://ftp.sanger.ac.uk/pub/pathogens/leprae]
- Zipkas D, Riley M: Proposal concerning mechanism of evolution of the genome of Escherichia coli. Proc Natl Acad Sci USA. 1975, 72: 1354-1358.PubMedPubMed CentralView ArticleGoogle Scholar
- Wagner A: The fate of duplicated genes: loss or new function?. BioEssays. 1998, 20: 785-788. 10.1002/(SICI)1521-1878(199810)20:10<785::AID-BIES2>3.0.CO;2-M.PubMedView ArticleGoogle Scholar
- Lynch M, Force A: The probability of duplicate gene preservation by subfunctionalization. Genetics. 2000, 154: 459-473.PubMedPubMed CentralGoogle Scholar
- Nadeau JH, Sankoff D: Comparable rates of gene loss and functional divergence after genome duplications early in vertebrate evolution. Genetics. 1997, 147: 1259-1266.PubMedPubMed CentralGoogle Scholar
- Francois V, Louarn J, Patte J, Rebollo JE, Louarn JM: Constraints in chromosomal inversions in Escherichia coli are not explained by replication pausing at inverted terminator-like sequences. Mol Microbiol. 1990, 4: 537-542.PubMedView ArticleGoogle Scholar
- Rebollo JE, Francois V, Louarn JM: Detection and possible role of two large nondivisible zones on the Escherichia coli chromosome. Proc Natl Acad Sci USA. 1988, 85: 9391-9395.PubMedPubMed CentralView ArticleGoogle Scholar
- Segall A, Mahan MJ, Roth JR: Rearrangement of the bacterial chromosome: forbidden inversions. Science. 1988, 241: 1314-1318.PubMedView ArticleGoogle Scholar
- Mahan MJ, Roth JR: Ability of a bacterial chromosome segment to invert is dictated by included material rather than flanking sequence. Genetics. 1991, 129: 1021-1032.PubMedPubMed CentralGoogle Scholar
- Segall AM, Roth JR: Recombination between homologies in direct and inverse orientation in the chromosome of Salmonella : intervals which are nonpermissive for inversion formation. Genetics. 1989, 122: 737-747.PubMedPubMed CentralGoogle Scholar
- Read TD, Brunham RC, Shen C, Gill SR, Heidelberg JF, White O, Hickey EK, Peterson J, Utterback T, Berry K, et al: Genome sequences of Chlamydia trachomatis MoPn and Chlamydia pneumoniae AR39. Nucleic Acids Res. 2000, 28: 1397-1406. 10.1093/nar/28.6.1397.PubMedPubMed CentralView ArticleGoogle Scholar
- Tettelin H, Saunders NJ, Heidelberg J, Jeffries AC, Nelson KE, Eisen JA, Ketchum KA, Hood DW, Peden JF, Dodson RJ, et al: Complete genome sequence of Neisseria meningitidis serogroup B strain MC58. Science. 2000, 287: 1809-1815. 10.1126/science.287.5459.1809.PubMedView ArticleGoogle Scholar
- Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J: Preservation of duplicate genes by complementary, degenerative mutations. Genetics. 1999, 151: 1531-1545.PubMedPubMed CentralGoogle Scholar
- Schmid MB, Roth JR: Selection and endpoint distribution of bacterial inversion mutations. Genetics. 1983, 105: 539-557.PubMedPubMed CentralGoogle Scholar
- Mahan MJ, Roth JR: Reciprocality of recombination events that rearrange the chromosome. Genetics. 1988, 120: 23-35.PubMedPubMed CentralGoogle Scholar
- Gordon AJ, Halliday JA: Inversions with deletions and duplications. Genetics. 1995, 140: 411-414.PubMedPubMed CentralGoogle Scholar
- Valencia-Morales E, Romero D: Recombination enhancement by replication (RER) in Rhizobium etli. Genetics. 2000, 154: 971-983.PubMedPubMed CentralGoogle Scholar
- National Center for Biotechnology Information, Entrez Genomes. [http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/entrez/query.fcgi?db=Genome]
- The Institute for Genomic Research Microbial Genome Database. [http://www.tigr.org/tdb/mdb/mdb.html]
- Kawarabayasi Y, Hino Y, Horikawa H, Yamazaki S, Haikawa Y, Jin-no K, Takahashi M, Sekine M, Baba S, Ankai A, et al: Complete genome sequence of an aerobic hyper-thermophilic crenarchaeon, Aeropyrum pernix K1. DNA Res. 1999, 6: 83-101.PubMedView ArticleGoogle Scholar
- Deckert G, Warren PV, Gaasterland T, Young WG, Lenox AL, Grahams DE, Overbeek R, Snead MA, Keller M, Aujay M, et al: The complete genome of the hyperthemophilic bacterium Aquifex aeolicus. Nature. 1998, 392: 353-358. 10.1038/32831.PubMedView ArticleGoogle Scholar
- Klenk H-P, Clayton RA, Tomb J-F, White O, Nelsen KE, Ketchum KA, Dodson RJ, Gwinn M, Hickey EK, Peterson JD, et al: The complete genomic sequence of the hyperthermophilic, sulfate-reducing archaeon Archaeoglobus fulgidus. Nature. 1997, 390: 364-370. 10.1038/37052.PubMedView ArticleGoogle Scholar
- Kunst A, Ogasawara N, Moszer I, Albertini A, Alloni G, Azevedo V, Bertero M, Bessieres P, Bolotin A, Borchert S, et al: The complete genome sequence of the Gram-positive bacterium Bacillus subtilis. Nature. 1997, 390: 249-256. 10.1038/36786.PubMedView ArticleGoogle Scholar
- Fraser CM, Norris SJ, Weinstock GM, White O, Sutton GG, Dodson R, Gwinn M, Hickey EK, Clayton R, Ketchum KA, et al: Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi. Nature. 1997, 390: 580-586. 10.1038/37551.PubMedView ArticleGoogle Scholar
- Parkhill J, Wren BW, Mungall K, Ketley JM, Churcher C, Basham D, Chillingworth T, Davies RM, Feltwell T, Holroyd S, et al: The genome sequence of the food-borne pathogen Campylobacter jejuni reveals hypervariable sequences. Nature. 2000, 403: 665-668. 10.1038/35001088.PubMedView ArticleGoogle Scholar
- Kalman S, Mitchell W, Marathe R, Lammel C, Fan J, Hyman RW, Olinger L, Grimwood J, Davis RW, Stephens RS: Comparative genomes of Chlamydia pneumoniae and C. trachomatis. Nat Genet. 1999, 21: 385-389. 10.1038/7716.PubMedView ArticleGoogle Scholar
- Stephens RS, Kalman S, Lammel C, Fan J, Marathe R, Aravind L, Mitchell W, Olinger L, Tatusov RL, Zhao Q, et al: Genome sequence of an obligate intracellular pathogen of humans: Chlamydia trachomatis. Science. 1998, 282: 754-759. 10.1126/science.282.5389.754.PubMedView ArticleGoogle Scholar
- White O, Eisen JA, Heidelberg JF, Hickey EK, Peterson JD, Dodson RJ, Haft DH, Gwinn ML, Nelson WC, Richardson DL, et al: Genome sequence of the radioresistant bacterium Deinococcus radiodurans R1. Science. 1999, 286: 1571-1577. 10.1126/science.286.5444.1571.PubMedPubMed CentralView ArticleGoogle Scholar
- Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM, et al: Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995, 269: 496-512.PubMedView ArticleGoogle Scholar
- Tomb JF, White O, Kerlavage AR, Clayton RA, Sutton GG, Fleischmann RD, Ketchum KA, Klenk HP, Gill S, Dougherty BA, et al: The complete genome sequence of the gastric pathogen Helicobacter pylori. Nature. 1997, 388: 539-547. 10.1038/41483.PubMedView ArticleGoogle Scholar
- Alm RA, Ling LS, Moir DT, King BL, Brown ED, Doig PC, Smith DR, Noonan B, Guild BC, deJonge BL, et al: Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori. Nature. 1999, 397: 176-180. 10.1038/16495.PubMedView ArticleGoogle Scholar
- Smith DR, Doucette-Stamm LA, Deloughery C, Lee H, Dubois J, Aldredge T, Bashirzadeh R, Blakely D, Cook R, Gilbert K, et al: Complete genome sequence of Methanobacterium thermoautotrophicum DH: functional analysis and comparative genomics. J Bacteriol. 1996, 179: 7135-7155.Google Scholar
- Bult CJ, White O, Olsen GJ, Zhou L, Fleischmann RD, Sutton GG, Blake JA, Fitzgerald LM, Clayton RA, Gocayne JD, et al: Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science. 1996, 273: 1058-1073.PubMedView ArticleGoogle Scholar
- Fraser CM, Gocayne JD, White O, Adams MD, Clayton RA, Fleischmann RD, Bult CJ, Kerlavage AR, Sutton G, Kelley JM, et al: The minimal gene complement of Mycoplasma genitalium. Science. 1995, 270: 397-403.PubMedView ArticleGoogle Scholar
- Himmelreich R, Hilbert H, Plagens H, Pirkl E, Li BC, Herrmann R: Complete sequence analysis of the genome of the bacterium Mycoplasma pneumoniae. Nucleic Acids Res. 1996, 24: 4420-4449. 10.1093/nar/24.22.4420.PubMedPubMed CentralView ArticleGoogle Scholar
- Parkhill J, Achtman M, James KD, Bentley SD, Churcher C, Klee SR, Morelli G, Basham D, Brown D, Chillingworth T, et al: Complete DNA sequence of a serogroup A strain of Neisseria meningitidis Z2491. Nature. 2000, 404: 502-506. 10.1038/35006655.PubMedView ArticleGoogle Scholar
- Kawarabayasi Y, Sawada M, Horikawa H, Haikawa Y, Hino Y, Yamamoto S, Sekine M, Baba S, Kosugi H, Hosoyama A, et al: Complete sequence and gene organization of the genome of a hyperthermophilic archaebacterium, Pyrococcus horikoshii OT3. DNA Res. 1998, 5: 55-76.PubMedView ArticleGoogle Scholar
- Andersson SG, Zomorodipour A, Andersson JO, Sicheritz-Ponten T, Alsmark UC, Podowski RM, Naslund AK, Eriksson AS, Winkler HH, Kurland CG: The genome sequence of Rickettsia prowazekii and the origin of mitochondria. Nature. 1998, 396: 133-140. 10.1038/24094.PubMedView ArticleGoogle Scholar
- Kaneko T, Sato S, Kotani H, Tanaka A, Asamizu E, Nakamura Y, Miyajima N, Hirosawa M, Sugiura M, Sasamoto S, et al: Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions. DNA Res. 1996, 3: 109-136.PubMedView ArticleGoogle Scholar
- Nelson KE, Clayton RA, Gill SR, Gwinn ML, Dodson RJ, Haft DH, Hickey EK, Peterson JD, Nelson WC, Ketchum KA, et al: Evidence for lateral gene transfer between Archaea and bacteria from genome sequence of Thermotoga maritima. Nature. 1999, 399: 323-329. 10.1038/20601.PubMedView ArticleGoogle Scholar
- Fraser CM, Norris SJ, Weinstock GM, White O, Sutton GG, Dodson R, Gwinn M, Hickey EK, Clayton R, Ketchum KA, et al: Complete genome sequence of Treponema pallidum, the syphilis spirochete. Science. 1998, 281: 375-388. 10.1007/s004410050434.PubMedView ArticleGoogle Scholar
- Pearson WR: Flexible sequence similarity searching with the FASTA3 program package. Methods Mol Biol. 2000, 132: 185-219.PubMedGoogle Scholar
- Marsh RC, Worcel A: A DNA fragment containing the origin of replication of the Escherichia coli chromosome. Proc Natl Acad Sci USA. 1977, 74: 2720-2724.PubMedPubMed CentralView ArticleGoogle Scholar
- Salazar L, Fsihi H, de Rossi E, Riccardi G, Rios C, Cole ST, Takiff HE: Organization of the origins of replication of the chromosomes of Mycobacterium smegmatis, Mycobacterium leprae and Mycobacterium tuberculosis and isolation of a functional origin from M. smegmatis. Mol Microbiol. 1996, 20: 283-293.PubMedView ArticleGoogle Scholar
- Lobry JR: Asymmetric substitution patterns in the two DNA strands of bacteria. Mol Biol Evol. 1996, 13: 660-665.PubMedView ArticleGoogle Scholar
- Salzberg SL, Salzberg AJ, Kerlavage AR, Tomb JF: Skewed oligomers and origins of replication. Gene. 1998, 217: 57-67. 10.1016/S0378-1119(98)00374-6.PubMedView ArticleGoogle Scholar