- Open Access
Population phylogenomic analysis of mitochondrial DNA in wild boars and domestic pigs revealed multiple domestication events in East Asia
Genome Biology volume 8, Article number: R245 (2007)
Previously reported evidence indicates that pigs were independently domesticated in multiple places throughout the world. However, a detailed picture of the origin and dispersal of domestic pigs in East Asia has not yet been reported.
Population phylogenomic analysis was conducted in domestic pigs and wild boars by screening the haplogroup-specific mutation motifs inferred from a phylogenetic tree of pig complete mitochondrial DNA (mtDNA) sequences. All domestic pigs are clustered into single clade D (which contains subclades D1, D2, D3, and D4), with wild boars from East Asia being interspersed. Three haplogroups within D1 are dominant in the Mekong region (D1a2 and D1b) and the middle and downstream regions of the Yangtze River (D1a1a), and may represent independent founders of domestic pigs. None of the domestic pig samples from North East Asia, the Yellow River region, and the upstream region of the Yangtze River share the same haplogroup status with the local wild boars. The limited regional distributions of haplogroups D1 (including its subhaplogroups), D2, D3, and D4 in domestic pigs suggest at least two different in situ domestication events.
The use of fine-grained mtDNA phylogenomic analysis of wild boars and domestic pigs is a powerful tool with which to discern the origin of domestic pigs. Our findings show that pig domestication in East Asia mainly occurred in the Mekong region and the middle and downstream regions of the Yangtze River.
The origin and dispersal of major domestic animals have been widely studied in recent years and great progress has been made [1–18]. Multiple origin has been revealed to be a common phenomenon in domestic animals such as cattle, goats, chicken, and horses [7–9, 12, 17, 19]. Several studies have shown that pigs were independently domesticated in various parts of the world [5, 16, 20–25]. The time of divergence between European and Asian pig mitochondrial DNAs (mtDNAs) was long before the time of possible pig domestication, which supported the independent origin of domestic pigs in Europe and Asia [5, 26]. By analyzing the mtDNA control region (D-loop) sequences of worldwide wild boars, domestic pigs, and ancient specimens, recent studies conducted by Larson and coworkers [16, 27, 28] have revealed a schematic profile concerning the origin of wild boars and their dispersal and domestication across Eurasia, as well as the Neolithic expansion in Island South East Asia and Oceania. However, because of the small sample size from East Asia and the limited resolution of phylogeny based on partial mtDNA D-loop sequences of pigs, a detailed picture of the origin and dispersal of domestic pigs in East Asia is still to be developed.
To investigate where the East Asian pigs were domesticated and to reconstruct their early dispersal history, we conducted a population phylogenomic analysis of wild boars and domestic pigs by applying a strategy consisting of the following steps. First, we sequenced 670 base pair (bp) fragments of the mtDNA D-loop region in 567 domestic pigs and 155 wild boars across China, South East Asia, and India. Then, we selected 24 wild boars and domestic pigs and analyzed their entire mtDNA sequences. Each of these 24 samples represented a unique haplotype in the major clades observed in a neighbor-joining (NJ) tree of the 722 D-loop sequences (Additional data file 1). Employing a strategy that has been well described by us and others in anthropologic studies [29–38], haplogroup-specific mutation motifs (a string of characteristic mutations shared exclusively by its members) for the respective haplogroups (monophyletic groups or clades in the tree) were inferred from the phylogenetic tree of 42 (near) complete Asian pig mtDNA sequences determined in this study and from published sources. The haplogroup-specific motifs were further screened in all domestic pigs and wild boars to justify the inferred haplogroup status of each sample based on the available information. Finally, all our samples and previously published mtDNA data were assigned to haplogroups based on the haplogroup-specific mutation motifs in the sequence. This fine-grained phylogeographic analysis of matrilineal components of wild boars and domestic pigs provided new insights into the origin and domestication of pigs in East Asia.
Results and discussion
mtDNA control region sequences (670 bp) of 567 domestic pigs and 155 wild boars across China, South East Asia, and India were determined. A preliminary phylogenetic analysis of the 119 haplotypes of these D-loop sequences was performed and revealed several clades in the tree (Additional data file 1). Then, the complete mtDNAs of 24 samples of wild boars and domestic pigs, each representing a unique haplotype in the major clades in this tree, were selected and sequenced. It should be noted that although the choice of the specific representative samples within certain clade was selected at random, each sample from the same clade has equal potential to allow us to determine the schematic backbone and mutation motif of this clade/haplogroup. The African warthog is well known to be distinct from Eurasian wild boars and has frequently been used as the outgroup in previous phylogenetic studies of pigs [16, 25, 26, 39–42]. In the present study we completely sequenced an mtDNA of African warthog (Phacochoerus africanus) and used it as the outgroup to root the mtDNA genome tree.
The NJ tree of 50 mtDNA genomes (including 24 published near complete sequences; Additional data file 2) revealed two major clades, E and A, which represent the wild boars (Sus scrofa) and domestic pigs from Europe and East Asia, respectively (Figure 1). Within clade A, all Asian domestic pig mtDNAs were further clustered into a single clade D, with wild boars from this region intermingled (Figure 1 and Additional data file 3). Phylogenetic trees constructed using other methods, such as maximum parsimony and Bayesian estimation, exhibited similar topology for all major clades in the NJ tree (Additional data file 4) and further confirmed the monophyletic position of East Asian domestic pigs and wild boars. The phylogenetic position of the newly sequenced Malaysia wild boar (Sus barbatus) fell outside the macro-clade containing Eurasian samples.
The information read from the mtDNA genome tree enabled us to conduct a phylogenomic analysis for wild boars and domestic pigs. By detecting the haplogroup unique mutation motif (Additional data file 3), each mtDNA could be allocated to the smallest named haplogroup to which it belongs. For instance, haplogroup D1 was characterized by five mutations at sites 500, 2374, 11432, 12064, 16301, whereas its subhaplogroup D1a was defined further by the additional variant at site 14198 (Additional data file 3). By screening these haplogroup-specific mutations in each mtDNA, it could reliably be classified into haplogroup D1 based on the presence of the five D1 specific mutations and further into D1a if it also harbored the mutation at site 14198. Based on our established haplogroup classification system, published pig mtDNA cytochrome (Cyt) b and D-loop sequences (Additional data file 5) were also tentatively classified by haplogroup-specific motif recognition and/or a matching or near-matching strategy [30, 34, 35, 38] with the mtDNAs determined in this study.
In agreement with the phylogenetic pattern discerned in the tree of complete mtDNA sequences, all 1,096 mtDNA sequences of East Asian wild boars and domestic pigs could be classified into clade A (Additional data file 5). The wild boars from Island South East Asia reported in a previous study by Larson and coworkers  could not be assigned into clades E and A defined in the present study (Additional data file 5 [Table S5]), confirming their basal phylogenetic positions [16, 28].
The resulting mtDNA haplogroup classification of wild boars and domestic pigs and their sampling locations revealed sympatric distribution of both wild boars and domestic pigs. Samples from different places/breeds could be grouped into one haplogroup, whereas samples from the same location were assigned to different nested haplogroups (Additional data files 5 and 6, and Tables 1 and 2). Some wild boars from South Asia fell outside of macro-clade containing clades E and A, whereas other wild boars from this region could be classified into clades A and E (Additional data file 4 and 6, and Table 1). Within East and South East Asia, the matrilineal pool of wild boars from the Mekong region contained nearly all the main lineages presented in other regions (Additional data file 6 and Table 1). Furthermore, genetic diversity of wild boars from this region was much greater than that in other regions (except for the upstream region of the Yangtze River [URYZ; Additional data file 7], in which diversity was comparable but this region had a different proportion of matrilineal components). Although high diversity in a region may be caused by influx of haplotypes from different regions, this possibility might be very low here because it also applies to other regions, whereas we failed to observe many centers of diversity.
Most wild boars from the middle and downstream region of the Yangtze River (MDYZ) were clustered into the nested haplogroups within haplogroup A1a, particularly in haplogroup D (Additional data file 6 and Table 1). All wild boars from North East Asia (NEA) belonged to haplogroup D (Additional data file 6 and Table 1), suggesting potential derivation from the matrilineal pool of region MDYZ caused by the natural movement of wild boar. Wild boar lineages from the upstream and middle region of the Yellow River (UMYR) were a subset of region URYZ (Additional data file 6 and Table 1). Overall, the population structure of URYZ and UMYR were distinct from MDYZ and NEA populations; both URYZ and UMYR populations contained more basal lineages, whereas the MDYZ and NEA populations harbored a large proportion of recently derived lineages (Additional data file 6 and Table 1). Under the hypothesis of selective neutrality and population equilibrium, Tajima's D and Fu's Fs test values tend to be negative under an excess of recent mutations, which is regarded as evidence of population growth [43, 44]. The P values of the Fs test established by Fu  for all wild boar samples belonging to haplogroups D1 and D all indicated statistical significance (Table 3), suggesting population expansion in the past. Taken together, the above haplogroup distribution pattern suggests that East Asian wild boar lineages were most likely derived from the Mekong region population and dispersed via two main routes: one route is through the Yangtze River region to NEA, and the other is through URYZ to UMYR (Additional data file 6). The difference between the current population structures of wild boars in these regions might be shaped by the early dispersal of the wild boars out of the Mekong region.
Our classification analysis of the published Ryukyu island wild boar samples suggested that none of these samples belonged to haplogroup D, which was dominant in the adjacent regions, such as MDYZ, Taiwan, and Japanese islands (Additional data files 5 and 6). This unique distribution pattern might be attributed to insufficient sampling of Ryukyu island wild boars, or wild boars might have dispersed to Ryukyu Islands before the arrival of haplogroup D and were subsequently isolated from the adjacent regions. This latter scenario is consistent with the suggestion of a different origin of wild boars in Japanese islands and Ryukyu islands .
In the phylogenetic tree of complete mtDNA sequences, all East Asian domestic pig mtDNAs were clustered into single subclade D, with wild boars from this region interspersed (Figure 1 and Additional data files 3 and 4). The haplogroup classification of all available East Asian domestic pigs also uniformly referred to haplogroup D, which contains four subhaplogroups: D1, D2, D3, and D4. However, only part of wild boar samples in this region could be allocated to haplogroup D (Additional data files 5 and 6, and Table 1). This pattern suggests that East Asian domestic pigs originated from a subset of the wild boar genetic pool that was characterized by haplogroup D. Direct comparison of the geographic distribution between wild boars and domestic pigs can provide clues regarding the domestication of East Asian pigs. All domestic pig samples from regions NEA, URYZ, UMYR, and the downstream region of the Yellow River (DRYR), clustered within haplogroup D1 (Additional data file 6 and Figure 2). The wild boars from region NEA belonged to haplogroups D3 and D1f (excluding one unique yet unassigned D1 haplotype because of absence of coding region information), and did not share any haplotype with all of the domestic pigs from this region (Figure 2, Tables 1 and 2, and Additional data files 5 and 6). None of the 32 wild boars from region UMYR belonged to haplogroup D1 (Additional data file 6 and Table 1), although archeologic assemblages from this region exhibit signs of pig domestication during the Neolithic period . Among the 32 wild boars from region URYZ, only one individual could be assigned to D1 (Additional data file 6 and Table 1). In contrast, the wild boars in the Mekong region and region MDYZ extensively shared haplotypes with domestic pigs that were clustered into different haplogroups (Figure 2 and Additional data files 5 and 6): haplogroups D2, D3, D4, D1a2, D1b, and D1c contained both wild boars and domestic pigs from the Mekong region; and haplogroups D1a1a, D1d, and D1e contained both wild boars and domestic pigs from region MDYZ. These distinct phylogeographic patterns of wild boars and domestic pigs indicate that domestication events might have occurred mainly in the Mekong region and the MDYZ region.
The limited regional distributions of some haplogroups in domestic pigs, such as D2, D3, and D4, would suggest in situ domestication (Additional data file 6 and Table 2). Domestic pigs belonging to haplogroups D2, D3, and D4 were mainly found in the Mekong region, whereas only a small portion of domestic pigs (three samples) belonging to these three haplogroups was found in region MDYZ. Furthermore, wild boars from the Mekong region harbored lineages belonging to all three haplogroups, whereas wild boars from other places only contained one or two of the three haplogroups (Additional data file 6). None of 12 South Asian wild boars from a previous study  could be assigned to haplogroup A1 (including its subhaplogroups), suggesting that they made no contribution to domestic pigs in haplogroups D2, D3, and D4. Four out of six Indian domestic samples belonged to haplogroups D2 and D3 (the other two domestic samples had same A* status as wild boars from this region and the one reported by Larson and coworkers  shared an A* haplotype with the wild boars from this region; see Additional data file 5 [Tables S1 and S6]); and nine Australian feral pigs, seven New Zealand domestic pigs, and three European domestic pigs could be classified into D2, D3, and D4. It is possible that these lineages were derived from the same matrilineal pool with the domestic pigs from the Mekong region (Additional data file 6). The distinguished regional distribution pattern of haplogroups D2, D3, and D4 in domestic pigs and wild boars suggested that they probably originated from the Mekong region and/or adjacent regions that we did not sample in the present study.
Haplogroup D1 harbors more than 90% of the domestic animals, which were widely distributed in various regions in East Asia (Table 2, and Additional data files 5 and 6). The regional distribution of the main founders belonging to this haplogroup was depicted in a reduced median network (Figure 2). The main subhaplogroups in haplogroup D1 were almost equidistant to their coalescent ancestral root type. Some of them, such as haplogroups D1a1a, D1a2, and D1b, exhibited a star-like profile that was typical of exponential population growth. Each of these haplogroups harbored one or two widely distributed major haplotypes, which were also found in wild boars and had many one-mutation or two-mutation distance derivatives that were detected exclusively in domestic pigs. Our estimations for domestic pigs in the major haplogroups within D all revealed negative Tajima's D and Fu's Fs test values (not including D4 and D1c). The P values of the Fs test indicated statistical significance for haplogroups D1a1a (Tajima's D test was also statistically significant for this haplogroup), D1a2, and D1b, suggesting potential population expansion in the past (Table 3). Taken together, haplotypes within each of these haplogroups might have originated from their major (central) haplotypes as results of domestication events followed by subsequent expansion. Tracing the geographic distribution pattern of these haplogroups might reveal more information about the domestication events, as discussed below.
Most of the domestic pigs in haplogroups D1a2 and D1b were from the Mekong region and the URYZ region, whereas wild boars in these two haplogroups were exclusively from the Mekong region (Additional data file 6, Figure 2, and Tables 1 and 2). A small portion of domestic pigs (18.5%) in region UMYR belonged to D1a2 and shared haplotypes (excluding two D1a2 individuals) with samples from the Mekong region. However, there are only a few domestic pigs from region MDYZ in haplogroups D1a2 and D1b (three D1a2 types and three D1b types; all sharing haplotypes with the samples from the Mekong region). None of the domestic pigs in the other regions, such as South China (SC), region DRYR, and region NEA, belonged to D1a2 and D1b. This unique genetic pattern of haplogroups D1a2 and D1b suggested that they might have originated in the Mekong region and then dispersed northward to regions URYZ and UMYR (Additional data file 6 and Figure 2). The shared D1a2 and D1b mtDNA types between the MDYZ region and the Mekong region might have been introduced from the Mekong region after the initial domestication.
More than half of the domestic pigs in haplogroup D1a1a were from regions DRYR and MDYZ (Additional data file 6, Figure 2, and Table 2). Fourteen wild boars in haplogroup D1a1a were only found in region MDYZ. Domestic pigs in region MDYZ also possessed a greater number of D1a1a haplotypes and unique haplotypes than did samples from other regions (Table 2). Thus, the D1a1a domestic pigs might have originated from the wild boar population in region MDYZ, which was regarded as one of the origin and dispersal centers of cultivated rice and the agriculture civilization of East Asia [47–49]. Most of the domestic individuals from regions NEA and DRYR shared haplotypes with pigs from region MDYZ, which suggested that domestic pigs from these two regions were most likely derived from the MDYZ pool (Additional data file 6 and Figure 2).
By reanalyzing them with previously reported data, the new data generated in the present study could yield some valuable insights into pig origin in Japan and Vietnam. Pig husbandry was interrupted from the 8th century to the late 19th century on mainland Japan . Evidence of the origin of Japanese domestic pigs was mainly estimated from cultural records and ancient DNA studies. Recent studies of ancient DNA conducted in pig and wild boar remains from the Japanese mainland and islands suggested that Japanese domestic pigs were introduced from China [15, 50, 51].
Based on the haplogroup classification system established in this study, the Sakhalin pig ancient DNAs from the Kabukai A site (centuries 5 to 8 AD) of the Okhotsk cultural area  and the ancient DNAs from the Jomon period (6,100 to 1,700 years old)  could be classified: 16 haplotypes fell outside haplogroup D; ten haplotypes belonged to haplogroupd D but could not be assigned into its defined subhaplogroups; and five, one, and four haplotypes belonged to haplogroups D3, D4, and D1, respectively (Additional data file 5 [Table S3]). None of these ancient DNAs shared haplotypes with East Asian domestic pigs (excluding one sequence that shared a haplotype with domestic pigs; Additional data file 6 and Additional data file 5 [Table S3]). Similar matrilineal components were also found among local wild boars (Additional data file 6). Therefore, most of these ancient DNAs were more closely related to local and North East Asia wild boars than to East Asia domestic pigs. Ancient DNA of Sus scrofa specimens from Ryukyu Shimizu shell midden (Yayoi-Heian period; 1,700 to 2,000 before present)  and Ryukyu wild boars belonged to haplogroup A1b, which diverged earlier than haplogroup D (Table 1 and Additional data file 6). It is thus clear that these ancient DNA might not be the domestic pigs introduced from the Asian continent in the early Yayoi-Heian period. More archaeologic evidence and genetic data from Japan and its adjacent continental regions are necessary to refine further the origin of Japanese domestic pigs.
A previous study of pigs in Vietnam showed that large Vietnamese pigs were wild boars and had close genetic affinity to Ryukyu wild boars, whereas small Vietnamese pigs were domestic pigs and closely related to East Asian domestic pigs, suggesting a local domestication or direct introduction from Southwest China . Reanalysis of these data showed that large Vietnamese pigs shared the same lineage (A1b) with the Ryukyu wild boars and wild boars from the Mekong region, URYZ, and UMYR (Additional data file 5 [Table S1]). Among the haplotypes identified in small Vietnamese pigs, one haplotype belonged to D1a1a and the other haplotypes belonged to haplogroups presented in the Mekong region, such as D3 and D1b (Additional data file 5 [Table S1]). Six haplotypes were also found in small pigs from Yunnan, China, and pigs from Laos (Additional data file 5 [Table S1]). Our reanalysis of these Vietnamese pig mtDNAs further demonstrated that large pigs and small pigs from this region had different matrilineal components; thus, in general it supports the previous claim that small Vietnamese pigs were introduced from China and the Mekong region, whereas large Vietnamese pigs were local wild boars .
In the present study, use of a phylogeny of complete mtDNA sequences allowed us to conduct a fine-grained phylogeographic analysis of the Asian domestic pigs and wild boars and to reappraise the published data. This approach could also be utilized to elucidatde the origin of other domestic animals such as chicken, cattle, sheep, and goats. Our findings indicate that the current domestic pig regional pools in East Asia originated from a subset of wild boar matrilineal components belonging to haplogroup D. These major matrilineal components in domestic pigs, such as D2, D3, D4, D1b, and D1a2, were probably domesticated in the Mekong region. Region MDYZ might also be a domestication center for lineages in D1a1a, D1d, and D1e. The initial domesticated pool was composed of a number of founders (at least ten) and underwent subsequent northward dispersal but with limited admixture.
Materials and methods
In total, 567 domestic pigs and 155 wild boars from China, South East Asia, and India were collected and analyzed for mtDNA control region sequence variation. Among them, 24 samples (not including one African warthog [Phacochoerus africanus] and a wild boar sample from Maylasia [Sus barbatus]; Additional data file 2) were selected for complete mtDNA sequencing. The published pig and wild boar mtDNA sequences in Asia [5, 15, 16, 26, 28, 40, 41, 45, 51–61] were retrieved from GenBank and were reanalyzed (Additional data file 5). The ancient DNA sequences from Japanese islands [15, 51] were only scored according to the haplotype information because the precise number of individuals sharing a haplotype was not listed in the original reports or GenBank deposits.
DNA extraction, PCR amplification, and sequencing
Genomic DNA was extracted from whole blood, tissue, and/or hair using the standard phenol/chloroform method. The mtDNA control region sequence (670 bp) was amplified using primer pair H695 (5'-CTCTTGCTCCACCATCAGC-3') and L99 (5'-AAACTATATGTCCTGAAACC-3'). Complete mtDNA sequences were amplified and sequenced using different combinations of 36 to 40 pairs of primers (Additional data file 8). Polymerase chain reaction (PCR) products were purified on spin columns (Watson BioTechnologies, Shanghai, China) and sequenced by using BigDye Terminator sequencing kit (Applied Biosystems, Foster City, California, USA). Sequencing was performed on a 377 and 3700 DNA sequencer (Applied Biosystems). Sequences were edited by using the DNASTAR software (DNAstar Inc. Madison, Wisconsin, USA) and mutations were scored relative to a reference sequence (individual Saba722; accession number EF545567) determined in the present study. All mtDNA D-loop and complete genome sequences have been submitted to GenBank (accession numbers DQ409327, DQ496251 to DQ497000, and EF545567 to EF545593).
An unrooted NJ tree was initially constructed based on the haplotypes (670 bp fragments) in all 722 samples. Twenty-four samples of wild boars and domestic pigs, each representing a unique haplotype from the major clades in the NJ tree, were selected for complete mtDNA sequencing. An African warthog was sequenced and used as the outgroup for phylogenetic analyses of the complete mtDNA sequences. The phylogenetic consensus tree of 50 mtDNA complete and near complete sequences (one warthog, one Malaysia wild boar, six European pigs and wild boars [note that sequences AF034253 and NC_000845  should refer to the same sample], and 42 Asian pigs and wild boars; Additional data file 2) was constructed by using the NJ method in Phylogenetic Analysis Using Parsimony (PAUP) 4.0 β  with the model of HKY + I + G (shape α = 1.0714; Pinvar = 0.7512), as recommended by Modeltest 3.6 . The maximum parsimony tree of these mtDNA sequences was constructed by using the branch-and-bound search with a tree bisection-reconnection (TBR) branch-swapping option in PAUP. Robustness of the nodes was assessed by the bootstrap method after 1,000 replications (bootstrap option with heuristic search in PAUP) by adding sequences randomly. Bayesian inference tree was constructed by MrBayes 3.1  with the general time reversible (GTR) model. In an initial run, the likelihood of the cold chain stopped increasing and began to fluctuate randomly within a more or less stable range after 10,000 generations; this suggests that the run may have reached stationarity. Three independent runs (each with 1 million generations) were performed. Each run was started from a randomly chosen but different tree. All of these runs yielded similar estimates of substitution model parameters, topology, and branch lengths (Additional data file 4).
We denoted the principal clades (or haplogroups) that emerged in the phylogenetic tree by capital letters (for example, clade E [European] and clade A [Asian]). For the prominent clade containing all East Asian domestic pigs and some wild boars, we designated it by the capital letter D. For other subclades of clade A and the subclades within D, a hierarchical haplogroup nomination system was used, as for human mtDNA [30, 32–34]. Thus, the code signifies the nested haplogroup relationships (for example, D1a1a ⊂ D1a1 ⊂ D1a ⊂ D1⊂ D ⊂ A1a ⊂ A1 ⊂ A.
Each haplogroup was composed of a cohort mtDNAs that shared a string of characteristic mutations, which could be read from the complete mtDNA tree [29, 30, 32, 36, 37] (Additional data file 3). We screened the haplogroup-specific mutations in all of our samples to justify the haplogroup assignment of each sample. If a mtDNA could be assigned to a haplogroup but could not be further assigned to its specific subhaplogroups, then an asterisk (*) is attached to the haplogroup name that refers to the mtDNA under consideration, in order to emphasize that the haplogroup status of the mtDNA cannot be specified further (relative to the classification tree) [33, 34]. For example, haplogroup A has two named subhaplogroups A1 and A2, and the Indian wild boars could be assigned to haplogroup A based on the available sequence variation information, but they could not be further assigned to A1 or A2 and lacked the necessary mtDNA coding region information to identify a new haplogroup nested in A. Therefore, they were left unassigned and denoted A*. Thus, A* contains all mtDNAs that were grouped into A but fell outside A1 and A2 (A* = A - A1 - A2).
After each mtDNA was classified into its respective haplogroup (Additional data file 5), the haplogroup distribution frequency in each geographic region (see below) was estimated. The published pig mtDNA Cyt b sequences [5, 45] were tentatively assigned to haplogroups according to the established classification system. The reported partial mtDNA D-loop sequences [5, 15, 16, 26, 40, 41, 45, 51–55, 58, 59, 65] were also classified by a matching or near matching strategy with the mtDNAs determined in this study as well as by mutation motif recognition, as described in human mtDNA studies [30, 34, 35, 38].
Geographic group classification
We grouped the samples into the following groups according to geographic fauna and possible pig domestication sites (Additional data file 6 and Figure 2). The Mekong region includes northwest, south and southeast Yunnan, China, Burma, Laos, north Vietnam, and north Thailand. Region URYZ includes Sichuan, Chongqing, Guizhou, north and northeast Yunnan, northwest Guangxi, west Hebei, and northwest Hunan. Region MDYZ includes east Hubei, northeast Hunan, Anhui, Jiangxi, Fujian, Zhejiang, Jiangsu, and Shanghai. Fourth, region SC includes Guangdong, south and southeast Guangxi, south Hunan, southwest Fujian, and Hainan. Region UMYR includes Gansu, east Qinghai, northwest Sichuan, south Inner Mongolia, Ningxia, Shaanxi, Shanxi, and west Henan. Region DRYR includes east Henan, Hebei, and Shandong. Region NEA includes Jilin, Liaoning, Heilongjiang, northeast Inner Mongolia, southeast Siberia, and Korea. Region SPI (South Pacific Islands) includes South Pacific Islands and the Malay Peninsula. Region AN includes feral pigs in Australia and New Zealand. 'Other' includes domestic pigs with Asian mtDNA type found in Europe, Australia, New Zealand, and America.
To provide more detailed information on the phylogeographic relationship among the wild boars and domestic pigs belonging to haplogroup D1, which contained most of the samples analyzed in this study, a reduced median network  was constructed by using Network 4.1 .
Estimation of population expansion
Tajima's D test  and Fu's Fs test  was employed to test whether neutrality holds (the population under study evolves with a constant effective population size, all mutations being selectively neutral) by using Arlequin 3.1 . A population that has experienced population expansion may result in a rejection of the null hypothesis. We also estimated the haplotype diversity (h) and nucleotide diversity (π)  for main haplogroups nested in D1 using DnaSP 4.0 .
Additional data files
The following additional data are available with the online version of this paper. Additional data file 1 shows an unrooted NJ tree of 119 mtDNA D-loop sequence haplotypes identified in 722 wild boar and domestic pig samples. Additional data file 2 provides the sample information for the complete mtDNAs analyzed in this study. Additional data file 3 shows the classification tree of 42 (near) complete mtDNA sequences in clade A in Figure 1. Additional data file 4 shows the phylogenetic trees constructed using the maximum parsimony and the Bayesian methods. Additional data file 5 contains six tables (listing mtDNA sequence information and haplogroup classification of wild boars and domestic pigs analyzed in this study. Additional data file 6 shows the phylogeographic distribution of haplogroups and hypothetical dispersal routes of East Asian wild boars and domestic pigs. Additional data file 7 is a table showing genetic diversity of samples in each geographic region. Additional data file 8 is a table listing all of the primers used for pig complete mtDNA sequencing and haplogroup motif detection.
downstream region of the Yellow River
middle and downstream region of the Yangtze River
North East Asia
Phylogenetic Analysis Using Parsimony
polymerase chain reaction
upstream and middle region of the Yellow River
upstream region of the Yangtze River.
Loftus RT, MacHugh DE, Bradley DG, Sharp PM, Cunningham P: Evidence for two independent domestications of cattle. Proc Natl Acad Sci USA. 1994, 91: 2757-2761. 10.1073/pnas.91.7.2757.
Bradley DG, MacHugh DE, Cunningham P, Loftus RT: Mitochondrial diversity and the origins of African and European cattle. Proc Natl Acad Sci USA. 1996, 93: 5131-5135. 10.1073/pnas.93.10.5131.
Fumihito A, Miyake T, Takada M, Shingu R, Endo T, Gojobori T, Kondo N, Ohno S: Monophyletic origin and unique dispersal patterns of domestic fowls. Proc Natl Acad Sci USA. 1996, 93: 6792-6795. 10.1073/pnas.93.13.6792.
Yu Y, Nie L, He Z-Q, Wen J-K, Jian C-S, Zhang Y-P: Mitochondrial DNA variation in cattle of south China: origin and introgression. Anim Genet. 1999, 30: 245-250. 10.1046/j.1365-2052.1999.00483.x.
Giuffra E, Kijas JM, Amarger V, Carlborg O, Jeon JT, Andersson L: The origin of the domestic pig: independent domestication and subsequent introgression. Genetics. 2000, 154: 1785-1791.
Zeder MA, Hesse B: The initial domestication of goats (Capra hircus) in the Zagros mountains 10,000 years ago. Science. 2000, 287: 2254-2257. 10.1126/science.287.5461.2254.
Luikart G, Gielly L, Excoffier L, Vigne JD, Bouvet J, Taberlet P: Multiple maternal origins and weak phylogeographic structure in domestic goats. Proc Natl Acad Sci USA. 2001, 98: 5927-5932. 10.1073/pnas.091591198.
Troy CS, MacHugh DE, Bailey JF, Magee DA, Loftus RT, Cunningham P, Chamberlain AT, Sykes BC, Bradley DG: Genetic evidence for Near-Eastern origins of European cattle. Nature. 2001, 410: 1088-1091. 10.1038/35074088.
Vilà C, Leonard JA, Gotherstrom A, Marklund S, Sandberg K, Liden K, Wayne RK, Ellegren H: Widespread origins of domestic horse lineages. Science. 2001, 291: 474-477. 10.1126/science.291.5503.474.
Hanotte O, Bradley DG, Ochieng JW, Verjee Y, Hill EW, Rege JE: African pastoralism: genetic imprints of origins and migrations. Science. 2002, 296: 336-339. 10.1126/science.1069878.
Jansen T, Forster P, Levine MA, Oelke H, Hurles M, Renfrew C, Weber J, Olek K: Mitochondrial DNA and the origins of the domestic horse. Proc Natl Acad Sci USA. 2002, 99: 10905-10910. 10.1073/pnas.152330099.
Bruford MW, Bradley DG, Luikart G: DNA markers reveal the complexity of livestock domestication. Nat Rev Genet. 2003, 4: 900-910. 10.1038/nrg1203.
Beja-Pereira A, England PR, Ferrand N, Jordan S, Bakhiet AO, Abdalla MA, Mashkour M, Jordana J, Taberlet P, Luikart G: African origins of the domestic donkey. Science. 2004, 304: 1781-10.1126/science.1096008.
Lindgren G, Backström N, Swinburne J, Hellborg L, Einarsson A, Sandberg K, Cothran G, Vilà C, Binns M, Ellegren H: Limited number of patrilines in horse domestication. Nat Genet. 2004, 36: 335-336. 10.1038/ng1326.
Watanobe T, Ishiguro N, Nakano M, Matsui A, Hongo H, Yamazaki K, Takahashi O: Prehistoric Sado Island populations of Sus scrofa distinguished from contemporary Japanese wild boar by ancient mitochondrial DNA. Zoolog Sci. 2004, 21: 219-228. 10.2108/zsj.21.219.
Larson G, Dobney K, Albarella U, Fang M, Matisoo-Smith E, Robins J, Lowden S, Finlayson H, Brand T, Willerslev E, et al: Worldwide phylogeography of wild boar reveals multiple centers of pig domestication. Science. 2005, 307: 1618-1621. 10.1126/science.1106927.
Liu Y-P, Wu G-S, Yao Y-G, Miao Y-W, Luikart G, Baig M, Beja-Pereira A, Ding Z-L, Palanichamy M, Zhang Y-P: Multiple maternal origins of chickens: out of the Asian jungles. Mol Phylogenet Evol. 2006, 38: 12-19. 10.1016/j.ympev.2005.09.014.
Chen S-Y, Duan Z-Y, Sha T, Xiangyu J, Wu S-F, Zhang Y-P: Origin, genetic diversity, and population structure of Chinese domestic sheep. Gene. 2006, 376: 216-223. 10.1016/j.gene.2006.03.009.
Lai S-J, Liu Y-P, Liu Y-X, Li X-W, Yao Y-G: Genetic diversity and origin of Chinese cattle revealed by mtDNA D-loop sequence variation. Mol Phylogenet Evol. 2006, 38: 146-154. 10.1016/j.ympev.2005.06.013.
Zeuner FE: A History of Domesticated Animals. 1963, London, UK: Thames and Hudson
Herre W: The Science and History of Domestic Animals. 1969, London, UK: Thames and Hudson
Eusebio AN: Animal Genetic Resources in the Philippines. 1980, Japan: Tropical Agriculture Research Centre, Ministry of Agriculture, Forestry and Fisheries
Ma RC: Animal Genetic Resources in Taiwan. 1980, Japan: Tropical Agriculture Research Centre, Ministry of Agriculture, Forestry and Fisheries
Groves CP: Ancestors for the Pigs: Taxonomy and Phylogeny of the Genus Sus. 1981, Canberra, Australia: Australian National University Press
Fang M, Andersson L: Mitochondrial diversity in European and Chinese pigs is consistent with population expansions that occurred prior to domestication. Proc Biol Sci. 2006, 273: 1803-1810. 10.1098/rspb.2006.3514.
Kijas JM, Andersson L: A phylogenetic study of the origin of the domestic pig estimated from the near-complete mtDNA genome. J Mol Evol. 2001, 52: 302-308.
Bellwood P, White P: Response: domesticated pigs in Eastern Indonesia. Science. 2005, 309: 381-10.1126/science.309.5733.381a.
Larson G, Cucchi T, Fujita M, Matisoo-Smith E, Robins J, Anderson A, Rolett B, Spriggs M, Dolman G, Kim TH, et al: Phylogeny and ancient DNA of Sus provides insights into neolithic expansion in Island Southeast Asia and Oceania. Proc Natl Acad Sci USA. 2007, 104: 4834-4839. 10.1073/pnas.0607753104.
Torroni A, Achilli A, Macaulay V, Richards M, Bandelt H-J: Harvesting the fruit of the human mtDNA tree. Trends Genet. 2006, 22: 339-345. 10.1016/j.tig.2006.04.001.
Kong Q-P, Bandelt H-J, Sun C, Yao Y-G, Salas A, Achilli A, Wang C-Y, Zhong L, Zhu C-L, Wu S-F, et al: Updating the East Asian mtDNA phylogeny: a prerequisite for the identification of pathogenic mutations. Hum Mol Genet. 2006, 15: 2076-2086. 10.1093/hmg/ddl130.
Sun C, Kong Q-P, Palanichamy Mg, Agrawal S, Bandelt H-J, Yao Y-G, Khan F, Zhu C-L, Chaudhuri T-K, Zhang Y-P: The dazzling array of basal branches in the mtDNA macrohaplogroup M from India as inferred from complete genomes. Mol Biol Evol. 2006, 23: 683-690. 10.1093/molbev/msj078.
Kong Q-P, Yao Y-G, Sun C, Bandelt H-J, Zhu C-L, Zhang Y-P: Phylogeny of East Asian mitochondrial DNA lineages inferred from complete sequences. Am J Hum Genet. 2003, 73: 671-676. 10.1086/377718.
Richards MB, Macaulay VA, Bandelt H-J, Sykes BC: Phylogeography of mitochondrial DNA in western Europe. Ann Hum Genet. 1998, 62: 241-260. 10.1046/j.1469-1809.1998.6230241.x.
Yao Y-G, Kong Q-P, Bandelt H-J, Kivisild T, Zhang Y-P: Phylogeographic differentiation of mitochondrial DNA in Han Chinese. Am J Hum Genet. 2002, 70: 635-651. 10.1086/338999.
Yao Y-G, Kong Q-P, Wang C-Y, Zhu C-L, Zhang Y-P: Different matrilineal contributions to genetic structure of ethnic groups in the silk road region in China. Mol Biol Evol. 2004, 21: 2265-2280. 10.1093/molbev/msh238.
Palanichamy Mg, Sun C, Agrawal S, Bandelt H-J, Kong Q-P, Khan F, Wang C-Y, Chaudhuri TK, Palla V, Zhang Y-P: Phylogeny of mitochondrial DNA macrohaplogroup N in India, based on complete sequencing: implications for the peopling of South Asia. Am J Hum Genet. 2004, 75: 966-978. 10.1086/425871.
Macaulay V, Hill C, Achilli A, Rengo C, Clarke D, Meehan W, Blackburn J, Semino O, Scozzari R, Cruciani F, et al: Single, rapid coastal settlement of Asia revealed by analysis of complete mitochondrial genomes. Science. 2005, 308: 1034-1036. 10.1126/science.1109792.
Yao Y-G, Kong Q-P, Man X-Y, Bandelt H-J, Zhang Y-P: Reconstructing the evolutionary history of China: a caveat about inferences drawn from ancient DNA. Mol Biol Evol. 2003, 20: 214-219. 10.1093/molbev/msg026.
Lucchini V, Meijaard E, Diong CH, Groves CP, Randi E: New phylogenetic perspectives among species of South-east Asian wild pig (Sus sp.) based on mtDNA sequences and morphometric data. J Zool. 2005, 266: 25-35. 10.1017/S0952836905006588.
Gongora J, Fleming P, Spencer PB, Mason R, Garkavenko O, Meyer JN, Droegemueller C, Lee JH, Moran C: Phylogenetic relationships of Australian and New Zealand feral pigs assessed by mitochondrial control region sequence and nuclear GPIP genotype. Mol Phylogenet Evol. 2004, 33: 339-348. 10.1016/j.ympev.2004.06.004.
Okumura N, Kurosawa Y, Kobayashi E, Watanobe T, Ishiguro N, Yasue H, Mitsuhashi T: Genetic relationship amongst the major non-coding regions of mitochondrial DNAs in wild boars and several breeds of domesticated pigs. Anim Genet. 2001, 32: 139-147. 10.1046/j.1365-2052.2001.00757.x.
Wu G-S, Pang J, Zhang Y-P: Molecular phylogeny and phylogeography of Suidae. Zool Res. 2006, 27: 197-201.
Fu Y-X: Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics. 1997, 147: 915-925.
Tajima F: Statistical methods to test for nucleotide mutation hypothesis by DNA polymorphism. Genetics. 1989, 123: 585-595.
Watanobe T, Okumura N, Ishiguro N, Nakano M, Matsui A, Sahara M, Komatsu M: Genetic relationship and distribution of the Japanese wild boar (Sus scrofa leucomystax) and Ryukyu wild boar (Sus scrofa riukiuanus) analysed by mitochondrial DNA. Mol Ecol. 1999, 8: 1509-1512. 10.1046/j.1365-294x.1999.00729.x.
Yuan J, Flad R: Pig domestication in ancient China. Antiquity. 2002, 76: 724-732.
Yan W: The origin of rice agriculture in China [in Chinese]. Agricultural Archaeol. 1982, 19-31.
Zhang Z: Origin of rice agriculture on the Middle and lower Yangtze River [in Chinese]. Agricultural Archaeol. 1998, 206-211.
Wang X, Sun C, Cai H, Zhang J: The origin and evolution of rice cultivation in China. Chinese Sci Bull. 1998, 43: 2354-2363.
Watanobe T, Ishiguro N, Nakano M, Takamiya H, Matsui A, Hongo H: Prehistoric introduction of domestic pigs onto the Okinawa Islands: ancient mitochondrial DNA evidence. J Mol Evol. 2002, 55: 222-231. 10.1007/s00239-002-2320-6.
Watanobe T, Ishiguro N, Okumura N, Nakano M, Matsui A, Hongo H, Ushiro H: Ancient mitochondrial DNA reveals the origin of Sus scrofa from Rebun Island, Japan. J Mol Evol. 2001, 52: 281-289.
Hongo H, Ishiguro N, Watanobe T, Shigehara N, Anezaki T, Long VT, Binh DV, Tien NT, Nam NH: Variation in mitochondrial DNA of Vietnamese pigs: relationships with Asian domestic pigs and Ryukyu wild boars. Zoolog Sci. 2002, 19: 1329-1335. 10.2108/zsj.19.1329.
Kim KI, Lee JH, Li K, Zhang Y-P, Lee SS, Gongora J, Moran C: Phylogenetic relationships of Asian and European pig breeds determined by mitochondrial DNA D-loop sequence polymorphism. Anim Genet. 2002, 33: 19-25. 10.1046/j.1365-2052.2002.00784.x.
Okumura N, Ishiguro N, Nakano M, Hirai K, Matsui A, Sahara M: Geographic population structure and sequence divergence in the mitochondrial DNA control region of the Japanese wild boar (Sus scrofa leucomystax), with reference to those of domestic pigs. Biochem Genet. 1996, 34: 179-189.
Ursing BM, Arnason U: The complete mitochondrial DNA sequence of the pig (Sus scrofa). J Mol Evol. 1998, 47: 302-306. 10.1007/PL00006388.
Lin CS, Sun YL, Liu CY, Yang PC, Chang LC, Cheng IC, Mao SJ, Huang MC: Complete nucleotide sequence of pig (Sus scrofa) mitochondrial genome and dating evolutionary divergence within Artiodactyla. Gene. 1999, 236: 107-114. 10.1016/S0378-1119(99)00247-4.
Jiang SW, Giuffra E, Andersson L, Xiong YZ: Molecular phylogenetics relationship between six Chinese native pig breeds and three Swedish pig breeds from mitochondrial DNA [in Chinese]. Yi Chuan Xue Bao. 2001, 28: 1120-1128.
Yang J, Wang J, Kijas J, Liu B, Han H, Yu M, Yang H, Zhao S, Li K: Genetic diversity present within the near-complete mtDNA genome of 17 breeds of indigenous Chinese pigs. J Hered. 2003, 94: 381-385. 10.1093/jhered/esg077.
Watanobe T, Ishiguro N, Nakano M: Phylogeography and population structure of the Japanese wild boar Sus scrofa leucomystax : mitochondrial DNA variation. Zoolog Sci. 2003, 20: 1477-1489. 10.2108/zsj.20.1477.
Li C-Q, Chang Q, Chen J-Q, Zhang B-W, Zhu L-F, Zhou K-Y: Population structure and phylogeography of wild boar Sus Scrofa in Northeast Asia based on mitochondrial DNA control region variation analysis [in Chinese]. Acta Zoolog Sinica. 2005, 51: 640-649.
Lum JK, McIntyre JK, Greger DL, Huffman KW, Vilar MG: Recent Southeast Asian domestication and Lapita dispersal of sacred male pseudohermaphroditic 'tuskers' and hairless pigs of Vanuatu. Proc Natl Acad Sci USA. 2006, 103: 17190-17195. 10.1073/pnas.0608220103.
Swofford D: PAUP*: Phylogenetic Analysis Using Parsimony (and other Methods). Version 4.0b2a. 1998, Sunderland, MA: Sinauer Associates
Posada D, Crandall K: Modeltest: testing the model of DNA substitution. Bioinformatics. 1998, 14: 817-818. 10.1093/bioinformatics/14.9.817.
Ronquist F, Huelsenbeck JP: MRBAYES 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19: 1572-1574. 10.1093/bioinformatics/btg180.
Alves E, Ovilo C, Rodriguez MC, Silio L: Mitochondrial DNA sequence variation and phylogenetic relationships among Iberian pigs and other domestic and wild pig populations. Anim Genet. 2003, 34: 319-324. 10.1046/j.1365-2052.2003.01010.x.
Bandelt H-J, Forster P, Sykes BC, Richards MB: Mitochondrial portraits of human populations using median networks. Genetics. 1995, 141: 743-753.
Excoffier L, Laval G, Schneider S: Arlequin ver. 3.0: An integrated software package for population genetics data analysis. Evol Bioinformatics Online. 2005, 1: 47-50.
Nei M: Molecular Evolutionary Genetics. 1987, New York, NY: Columbia University Press
Rozas J, Sanchez-DelBarrio JC, Messeguer X, Rozas R: DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics. 2003, 19: 2496-2497. 10.1093/bioinformatics/btg359.
We thank Drs Albano Beja-Pereira and Olivier Hanotte, and the anonymous reviewers for helpful suggestions and comments. We thank Yin-Qiu Ji for her help with experiments. We thank Felicia Yap Chai Lee for her help in sample collection. This work was supported by grants of the National Basic Research Program of China (973 Program, 2007CB815700, and 2006CB102100), Chinese Academy of Sciences (KSCX2-YW-N-018), Bureau of Science and Technology of Yunnan Province, and National Natural Science Foundation of China (30621092).
YPZ, GSW, and YGY conceived and designed the experiments. GSW, KXQ, ZLD, and HL performed the experiments. GSW, YGY, and YPZ analyzed the data. GSW, MGP, ZYD, NL, and YSC collected samples. GSW, YGY, and YPZ wrote the paper. All authors read and approved the final manuscript.
Electronic supplementary material
About this article
Cite this article
Wu, G., Yao, Y., Qu, K. et al. Population phylogenomic analysis of mitochondrial DNA in wild boars and domestic pigs revealed multiple domestication events in East Asia. Genome Biol 8, R245 (2007) doi:10.1186/gb-2007-8-11-r245
- Wild Boar
- Additional Data File
- Mekong Region
- Region North East Asia
- Wild Boar Sample