Skip to main content

Genetic features of red and green junglefowls and relationship with Indonesian native chickens Sumatera and Kedu Hitam



More than 2,500 breeds of chicken are reared throughout the world as a source of eggs or meat and as pets. The primary ancestor of the present domestic chicken is widely believed to be the red junglefowl, although genetic contributions from other junglefowls cannot be excluded entirely. The reference genome for chicken was obtained from a red junglefowl, the genetic purity of which has been debated. There is, at present, insufficient data to resolve these interesting issues.


In this study, we performed whole-genome sequencing to compare various species and breeds of chicken, including wild red and green junglefowl, as well as the Indonesian native chickens Sumatera and Kedu Hitam and their respective descendants, the American Black Sumatra and Black Java. The data indicate that wild junglefowls have retained their genetic identity, but the Indonesian and American breeds have not. The Black Sumatra and Black Java are now closely related to each other, suggesting loss of genetic identity after export to the United States. In addition, the results indicate that the red junglefowl used as reference genome is more closely related to domestic chickens and apparently different from other wild red junglefowls.


This study illuminates the genetic and phylogenetic relationships among these species. It provides a framework for genetic studies in wild junglefowls and native and domestic chicken breeds.


After a long history of domestication and breeding dating back to 6,000 BC, more than 2,500 breeds of chicken are now raised worldwide as sources of eggs or meat and as pets [1]. The present domestic chicken has descended from the junglefowl, which belongs to the genus Gallus, order Galliformes. There are four living species, namely the red junglefowl (RJF) Gallus gallus, the green junglefowl (GJF) G. varius, the grey junglefowl G. sonneratii, and the Ceylon junglefowl G. lafayettii, which are found in India, Sri Lanka, Southeast Asia, and Indonesia. Of these, RJF is widely believed to be the primary ancestor of the present domestic chicken Gallus gallus domesticus [25]. Indeed, comparison with mitochondrial DNA from ancient bone samples revealed that domestication occurred at least 7,400 years ago from a common ancestral red junglefowl [6]. However, genetic contributions from the other three junglefowls cannot be totally excluded [3, 4, 711], and several reports suggest that the green [11] and grey [7] junglefowl may have bred with domestic chicken to produce hybrid birds. Unfortunately, there is insufficient data at present to resolve this interesting question, whereas studies in other species indicate that recruitment of wild animals into domesticated herds persisted over a long period after initial domestication [12].

Data from whole-genome sequencing significantly advanced the understanding of genetic diversity in chicken. Such data have been generated for RJF [13], in addition to single nucleotide polymorphisms (SNPs) in Chinese Silkie [14], broiler, and layer lines [15]. However, the RJF reference sequence may not represent pure, wild-type RJF [13], and the authors have acknowledged the possibility that the sequence is of a chicken interbred with domestic breeds. Therefore, it is necessary to obtain whole-genome sequencing data from a wild RJF, as well as from other junglefowl species.

Wild RJF is found on Sumatra, Java, and Madura islands, whereas wild GJF is found in Java, Madura, Bali, Lombok, Sumbawa, Flores, and Alor Islands in Indonesia [16]. A free-ranging variety of indigenous chicken, which is distinct from commercial breeds, is also found. At least 28 breeds of native chicken, totaling 290 million birds, are reared in Indonesia [17]. Of these, the Sumatera (English version is Sumatra) and Kedu Hitam, thought to have been bred for thousand years, are two of the oldest varieties and are used primarily for cockfighting and recently for ornamental purposes and as sources of eggs [1820] (Fig. 1). These varieties are adapted to roughage diet and to the hot and humid climate. The Sumatera and Kedu Hitam chicken were brought to India, Europe, and America in the eighteenth century [2125]. The Sumatera was admitted to the American Poultry Association Standard of Perfection in 1883 and named Black Sumatra (BS) in 1906 [26]. Kedu Hitam was crossed with other unknown breeds to develop Black Java (BJ), which was admitted to the American Standard in 1910 [26, 27].

Fig. 1
figure 1

Chicken populations compared by whole-genome sequencing. a-b: American Black Sumatra (a) and Black Java (c). c: Sumatera chickens from three different geographical regions including Riau, north, and west Sumatra, Indonesia. d: Red junglefowl from west and south Sumatra, Indonesia. e: Kedu Hitam chickens from central Java, Indonesia. f: Red junglefowl from Java, Indonesia. g: Green junglefowl from Madura and east Java, Indonesia. Photos by Jeannette Beranger (a, b), Marka Hidayat (c), M. Fatchur Rohim (d), Maria Ulfah (e), Rahman Hidayat (f), and Reza Aulia Ahmadi (g). Map was retrieved from Wikimedia Commons []

Genomic comparison will help clarify the genetic contribution of RJF and GJF to Sumatera and Kedu Hitam, as well as to BS and BJ. Therefore, we compared, by whole-genome sequencing, RJF from Sumatra (RJFs) and Java (RJFj), GJF from Madura (GJFm) and Java (GJFj), the Indonesian Sumatera and Kedu Hitam, and the American BS and BJ (Fig. 1). Wild RJF and GJF from natural forests were used in the present study in order to compare their genomes with the reference chicken genome from RJF. These breeds were also compared with Japanese White Leghorn, White Plymouth Rock, and Rhode Island Red. The data illuminate the genetic and phylogenetic relationships among these species.



The study was conducted in compliance with the Bogor Agricultural University/IPB Animal Care and Use Committee (approved protocol number: 18-2014IPB). For sampling, the authors complied with the Convention on the Trade in Endangered Species of Wild Fauna and Flora and the IUCN Policy Statement on Research Involving Species a Risk of Extinction. Wild junglefowls were collected, and after blood sampling, immediately released back to their natural habitats. GJF was collected from a natural forest in the Tulungagung District, East Java (GJFj: 1 male, 1 female), and from karst areas in the Bangkalan District, Madura Island (GJFm: 2 males, 5 females). RJF was collected from a natural forest in the Madiun District, East Java, and in the Cipanas and Banten Districts, West Java (RJFj: 3 males). RJF was also collected from the Solok District, West Sumatra, and Ogan Komering Ilir District, South Sumatra (RJFs: 1 male, 1 female) (Fig. 1). The characteristics of pure junglefowl were determined based on a field guidebook for bird identification [28]. Samples of Indonesian native chickens, Sumatera (2 males, 3 females) and Kedu Hitam (5 males, 5 females), were obtained from farmers with unknown breeding practices. American BS and BJ were obtained from small-scale breeders. Inbred White Leghorn was provided by the Nagoya University Graduate School of Bioagricultural Science, Avian Bioscience Research Center, through the National Bio-Resource Project of the Ministry of Education, Culture, Sports, Science and Technology, Japan. Whole blood was collected from these birds, and genomic DNA was extracted by a combination of standard phenol/chloroform methods [29] and DNeasy Blood Kit (Qiagen, Valencia, CA, USA).

Library preparation and DNA sequencing

Samples of genomic DNA (1μg) were fragmented to a median fragment size of 200 bp using a Covaris S2 Sample Preparation System (Covaris Inc., Woburn, MA, USA) and used to construct DNA libraries using NEBNext DNA Library Prep Reagent Set for Illumina (New England Biolabs, Ipswich, MA, USA). Library quality and quantity were evaluated using a 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA) and a KAPA Library Quantification Kit (Kapabiosystems, Inc., Woburn, MA, USA). The libraries were used to generate clusters on an Illumina cBOT using a TrueSeq PE Cluster Kit v3-cBOT-HS (Illumina Inc., San Diego, CA, USA) and were sequenced using Hiseq 2500 (Illumina) with a TruSeq SBS Kit-HS (200 cycles; Illumina) as 100 bp paired-end reads. Sequence data are deposited in the DNA Data Bank Japan Sequence Read Archive (Accession No. DRA003951).

Sequence data from each bird were then mapped separately and compared with published data from domestic chickens White Plymouth Rock (Accession No. SRS524493) and Rhode Island Red (SRS524486).

Mapping and SNP calling

Raw sequence reads from each bird were mapped separately to the galGal4 reference genome established in November 2011 using BWA program ver. 0.7.5a-r405 with default settings [30]. Read depth and coverage were then estimated based on the results. SNPs in uniquely mapped reads were identified in each bird by Samtools ver. 0.1.19-44428cd [31] according to criteria defined previously [15], with slight modifications in which SNPs were called at sites with read depth equal to or higher than the average and with 100 % polymorphic call rate. The SNPs were annotated using Refseq datasets. Insertions (sites that existed in the sequenced sample but not in the reference genome) and deletions (sites that existed in the reference genome but not in the sequenced sample) were identified in the same manner as SNPs.

Finally, gene ontology (GO) terms associated with genes containing non-synonymous SNPs, which were common in each breed, were extracted and summarized using Agbase [32, 33]. Enrichment analysis was performed using singular enrichment analysis as implemented by Agrigo [34]. Fisher’s exact test was used for statistical test. GO term of FDR < 0.05 was assumed significant [35].

Phylogenetic analysis

Phylogenetic analysis was performed using SNP matrices for each individual. Prior to analysis, SNPs with sequence read depth below average were filtered out against the reference genome by an in-house perl script. SNPs with a minimum allele frequency of less than 5 % were also removed by Tassel ver. 5 [36]. The final SNP matrix of 14,554,492 sites, which included only variable sites, was used for further analysis. A pairwise genetic distance matrix between individuals was calculated based on the modified Euclidean distance [37], which was defined as D = 1-identity by state (IBS) similarity, where IBS is the probability that alleles derived at random from two individuals at identical loci are the same. For any two individuals, the probability of IBS was averaged using Tassel. An unrooted neighbor-joining (NJ) tree was constructed and robustness of the tree topology was assessed using 100 bootstrap replicates in PHYLIP ver. 3.695 [38].

Phylogenetic analysis was also performed using mitochondrial genome sequences. Sequences were first reconstructed by replacing corresponding sites in the reference sequence with SNPs found in other species. Mitochondrial genome sequence from the reference genome was not used in this analysis because the sample used for the reference genome sequence was different from that used for mitochondrial genome sequence. To construct phylogenetic trees, genes from mitochondrial genomes, excluding ND6 gene, were aligned separately using Clustal W implemented in the software package MEGA6 [39]. Gene alignments were then concatenated into protein-coding, tRNA, and rRNA genes. The protein-coding gene dataset was further partitioned into the 1st, 2nd, and 3rd position of the codons, and three datasets were constructed. The first dataset consisted of tRNA, rRNA, and the 1st, 2nd, and 3rd codons of protein-coding genes (123tr), while the second dataset consisted of tRNA, rRNA, and the 1st and 2nd positions of protein-coding genes (12tr). The final dataset contained tRNA, rRNA, the 1st and 2nd position, and RY coding for the 3rd position (123RYtr). Datasets were analyzed by partitioned Bayesian and partitioned maximum likelihood using MrBayes ver. 3.2.5 [40] and RAxML version 7.7.1 [41], respectively, with default settings.

Results and discussion

Sequencing and mapping

Various breeds of domestic chicken and wild junglefowl were analyzed by whole-genome sequencing using 2–10 birds per breed; > 15 Gb were obtained for each breed (Table 1). In all individuals except one GJFj bird, > 70 % of sequence reads were uniquely mapped to the reference genome (Additional file 1: Figure S1). About 5 % were mapped to multiple chromosomal locations, and up to 20 % were not mapped (Additional file 1: Figure S1). Coverage depth was averaged among individuals in each breed except in Rhode Island Red and White Plymouth Rock (in these breeds, the coverage depth was obtained from one sample). The average read depth was 15–80 in each breed (4–12 in each individual). In each breed except Rhode Island Red, > 90 % of the genome was covered by at least five sequence reads (Additional file 2: Figure S2). Sequence data from Rhode Island Red did not cover the whole genome, probably because of insufficient data in the published sequence. Chromosome 16 had relatively low coverage depth in all breeds, probably because the reference sequence was incomplete or mapping was difficult due to repetitive content in this chromosome, as previously noted [42]. Notably, GJFj and White Plymouth Rock had relatively sparse coverage at depth 10. Finally, coverage of chromosome W varied depending on the sex of the samples.

Table 1 Summary of sequenced data and average genetic distance in each breed

SNP analysis

The average number of SNPs in each breed was calculated and compared among the breeds (Fig. 2). GJFm (n = 699 × 104) and GJFj (n = 604 × 104) had more SNPs than did the others (n = 68–258 × 104), suggesting that these junglefowls were more distant to the reference RJF (Fig. 2). Unexpectedly, wild RJFj (n = 258 × 104) had many more SNPs than domestic chickens, even though the reference sequence was a RJF, whereas wild RJFs had the similar number of SNPs to domestic chicken (Rhode Island Red, White Leghorn, and White Plymouth Rock). The domestic chickens Rhode Island Red (n = 68 × 104) and White Leghorn (n = 86 × 104) had the lowest number of SNPs against the reference (Fig. 2). White Plymouth Rock, RJFs, Kedu Hitam, and BJ possessed a similar number of SNPs (n = 136–156 × 104), whereas Sumatera (n = 182 × 104) and BS (n = 222 × 104) had relatively more (Fig. 2). Similar results were obtained based on insertions and deletions (indels), although these variations were very few in White Leghorn (Fig. 2).

Fig. 2
figure 2

Histogram of SNPs (blue), insertions (yellow), deletions (green), and total (red). The number of SNPs and indels were averaged from individuals in each breed, except for Rhode Island Red and White Plymouth Rock. RedJunglefowlS, red junglefowl from Sumatra; RedJunglefowlJ, red junglefowl from Java; GreenJunglefowlM, green junglefowl from Madura; GreenJunglefowlJ, green junglefowl from Java

Pairwise genetic distance between the breeds was estimated using the SNP data (Table 1, Additional file 3: Figure S3). The highest genetic distance was observed between GJFm and GJFj (Table 1), indicating high genetic variation among individuals from the GJF breeds compared to other varieties.

The average numbers of SNPs and indels in each breed were similarly distributed in the genome (Additional file 4: Figure S4). Approximately 79–80 % were intergenic, while 16–18 % were genic; most of the genic variations were in introns. However, the number of non-synonymous and synonymous variations in exons was different between SNPs and indels. In particular, thousands of SNPs were present in exons, of which 579–7,564 were found to cause non-synonymous amino acid changes. In contrast, several tens of indels at the most caused amino acid gain or loss (Additional file 4: Figure S4).

To detect additional breed specific variations, we performed GO analysis focusing on genes containing non-synonymous SNP, which is common within each breed. Kedu Hitam was excluded from further analysis because of a too small number of genes containing non-synonymous SNPs (6 genes; Additional file 5: Table S1). GO terms associated with the non-synonymous SNP containing genes were obtained and summarized via GO slim terms (Additional file 6: Figure S5). Several differences in the percentage of GO slim terms to total GO slim annotation were found in Sumatera and BS breeds (Additional file 6: Figure S5), albeit these differences were not statistically significant.

Furthermore, GO enrichment analysis against the chicken reference genome demonstrated that RJFj and GJFj showed two GO terms, which were significantly enriched in each breed (Additional file 5: Table S1). Further functional analysis is necessary to examine the molecular mechanisms that are represented by these significantly enriched GO terms. Nevertheless, the significant enrichment of GO terms reflects the distant relationship between these breeds and the reference genome.

Phylogenetic analysis

Relationships among the individuals were further investigated by phylogenetic analysis of 14,554,492 SNPs, and NJ tree was constructed with 100 bootstrap iterations (Fig. 3). Most of internal branches received 100 % bootstrap support, except for several internal clades that were part of a cluster containing Kedu Hitam and Sumatera.

Fig. 3
figure 3

Neighbor-joining tree based on 14,554,492 SNPs. Bootstrap values from 100 replicates are shown on internal branches. Abbreviations are defined in Fig. 2. Numbers after the name of a breed are the sample numbers. Colors on labels are colored by breeds. Red junglefowl are in red, green junglefowl are in green, Black Sumatra in blue, Black Java in purple, Sumatera in pink, Kedu Hitam in yellow, and Rhode Island Red, White Plymouth Rock, and White Leghorn are in gray

GJFm and GJFj formed a cluster, showing that GJF is phylogenetically distinct, with negligible genetic contribution to other varieties. In addition, GJFm birds had long branches more than other species did, indicating high genetic variation within the species. In contrast, wild RJF from Sumatra (RJFs) and Java (RJFj), as well as the reference RJF, did not form a monophyletic group, indicating diverging genetic backgrounds. RJFj formed a monophyletic group that was sister to the GJF group, which in turn formed a monophyletic group with RJFs. Surprisingly, the reference RJF formed a monophyletic group with domestic chickens (Rhode Island Red, White Plymouth Rock, and White Leghorn), which relationship supports the hypothesis that the chicken used for reference genome sequence had interbred with domestic breeds [13]. The separation of RJFj, RJFs, and the reference RJF suggests that these breeds have differentiated by founder effects, genetic drift, or incomplete lineage sorting as their ancestral species separated on different islands. Further analysis using RJF from other islands and countries will clarify the detailed genetic structure of RJF populations in Indonesia and neighbor countries and help elucidate the detailed scenario for domestication of chicken. The monophyly of three birds of White Leghorn, which were from the inbred closed colony, was confirmed. One Kedu Hitam bird formed a monophyletic group with Sumatera chicken, whereas the remaining Kedu Hitam birds formed a separate but neighboring clade. Monophyly of Kedu Hitam and Sumatera indicated close relationship among the Indonesian native chicken breeds. BS and BJ also formed separate groups but shared a sister-group relationship, suggesting a close relationship between the two American breeds. However, these two breeds did not show a close relationship with other domestic chicken breeds. Finally, the analysis did not support a distinct relationship between Sumatera and BS and between Kedu Hitam and BJ.

As the analysis of all SNPs in this study did not resolve the problems of incomplete lineage sorting and the anomaly zone, the resultant phylogeny, particularly the short internodes, might lead to different conclusions. Future multi-species coalescent studies would refine the phylogeny of these species based on the SNP matrix.

Phylogenetic relationships were also reconstructed using mitochondrial genomes, resulting in six phylogenies based on three datasets and two methods. The topology reconstructed by Bayes method from a dataset containing rRNA, tRNA, and 1st, 2nd, and 3rd codon of the protein-coding genes (123tr) is shown in Fig. 4. This topology seems reasonable, as it reflects the monophyly of three birds of White Leghorn, which were from a closed colony. However, the support values in the mitochondrial tree are generally low for both the posterior probabilities of the Bayesian analysis and the bootstrap values of the maximum-likelihood analysis (Fig. 4). GJFm and GJFj formed a monophyletic group that was a sister clade to RJFj. RJF birds did not form a monophyletic group, suggesting diverse genetic origins, as we have noted in the analysis of SNPs. Similarly, Sumatera and Kedu Hitam groups were not closely related to BS and BJ, respectively. In particular, the American BS and BJ formed a monophyletic, poorly supported group with White Plymouth Rock and other domestic chicken breeds (Rhode Island Red and White Leghorn), implying that BS and BJ have been crossed with other chickens in the United States.

Fig. 4
figure 4

Bayesian consensus tree based on mitochondrial genome. Numbers beside internal branches indicate bootstrap values >50 % from 100 replicates (left) and Bayesian posterior probabilities (right). (-) indicates a node not recovered in the maximum-likelihood analysis or bootstrap values <50 %. Abbreviations are defined in Fig. 2. Numbers after the name breed are the sample numbers. Colors on labels are defined in Fig. 3

The difference between the topology inferred from the SNPs and that inferred from the mitochondrial genome sequences reflects the different amount of sequence data or different inheritance pattern. Nevertheless, the poor support in the mitochondrial genome phylogeny questions whether the inferred topology reflects the underlying biology.


In this study, we evaluated the genetic contribution of RJF and GJF to Indonesian (Sumatera and Kedu Hitam), American (BS and BJ), and domestic chicken (Rhode Island Red, White Plymouth Rock, and White Leghorn) breeds. The results show that genetic identity is conserved in GJF, and that this species has made little genetic contribution to the domestic chicken (Figs. 3 and 4). In contrast, RJF breeds are genetically heterogeneous, perhaps reflecting the original genetic diversity of this species or a history of cross breeding between wild and domestic species. In particular, the Indonesian Sumatera and American BS and likewise Kedu Hitam and BJ do not cluster together as breed history would have suggested, presumably because of crossbreeding in the United States.

Thus this study illuminates the genetic and phylogenetic relationships among wild junglefowls and native and domestic chicken breeds and provides a framework for genetic studies in these species. The results suggest that conserved morphological similarity does not necessarily reflect the conserved genetic background, demonstrating the importance of and difficulty in conserving genetic diversity of wild and indigenous chicken species. Therefore, the whole genome sequencing provides a great tool for addressing these challenges.


Ethical approval was obtained from Animal Care and Used Committee of the Bogor Agricultural University/IPB (protocol number: 18-2014IPB) and research procedures to animal were complied with the Convention on the Trade in Endangered Species of Wild Fauna and Flora and the IUCN Policy Statement on Research Involving Species at Risk of Extinction.

Consent to publish

Not applicable.

Availability of data

Sequence data were deposited in the DNA Data Bank Japan Sequence Read Archive (Accession No. DRA003951).



Black Java


Black Sumatra


green junglefowl


green junglefowl from Java


green junglefowl from Madura


gene ontology


insertions and deletions


ministry of education, culture, sports, science and technology


red junglefowl


red junglefowl from Java


red junglefowl from Sumatra


single nucleotide polymorphisms


  1. The Domestic Animal Diversity Information System []

  2. Fumihito A, Miyake T, Sumi SI, Takada M, Ohno S, Kondo N. One subspecies of the red junglefowl (Gallus-Gallus Gallus) suffices as the matriarchic ancestor of all domestic breeds. Proc Natl Acad Sci U S A. 1994;91(26):12505–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Fumihito A, Miyake T, Takada M, Shingu R, Endo T, Gojobori T, Kondo N, Ohno S. Monophyletic origin and unique dispersal patterns of domestic fowls. Proc Natl Acad Sci U S A. 1996;93(13):6792–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Liu YP, Wu GS, Yao YG, Miao YW, Luikart G, Baig M, Beja-Pereira A, Ding ZL, Palanichamy MG, Zhang YP. Multiple maternal origins of chickens: out of the Asian jungles. Mol Phylogenet Evol. 2006;38(1):12–9.

    Article  CAS  PubMed  Google Scholar 

  5. Oka T, Ino Y, Nomura K, Kawashima S, Kuwayama T, Hanada H, Amano T, Takada M, Takahata N, Hayashi Y, et al. Analysis of mtDNA sequences shows Japanese native chickens have multiple origins. Anim Genet. 2007;38(3):287–93.

    Article  CAS  PubMed  Google Scholar 

  6. Storey AA, Athens JS, Bryant D, Carson M, Emery K, DeFrance S, Higham C, Huynen L, Intoh M, Jones S, et al. Investigating the global dispersal of chickens in prehistory using ancient mitochondrial DNA signatures. PLoS One. 2012;7:e39171.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Eriksson J, Larson G, Gunnarsson U, Bed'hom B, Tixier-Boichard M, Stromstedt L, Wright D, Jungerius A, Vereijken A, Randi E, et al. Identification of the yellow skin gene reveals a hybrid origin of the domestic chicken. PLoS Genet. 2008;4(2), e1000010.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Sulandari S, Zein MSA, Sartika T. Molecular characterization of Indonesian indigenous chickens based on mitochondrial DNA displacement (D)-loop sequences. HAYATI Journal of Biosciences. 2008;15:4.

    Google Scholar 

  9. Sawai H, Kim HL, Kuno K, Suzuki S, Gotoh H, Takada M, Takahata N, Satta Y, Akishinonomiya F. The origin and genetic variation of domestic chickens with special reference to junglefowls Gallus g. gallus and G. varius. PLoS One. 2010;5:e10639.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Pramual P, Meeyen K, Wongpakam K, Klinhom U. Genetic diversity of Thai native chicken inferred from mitochondrial DNA sequences. Tropical Natural History. 2013;13(2):97–106.

    Google Scholar 

  11. Miao YW, Peng MS, Wu GS, Ouyang YN, Yang ZY, Yu N, Liang JP, Pianchou G, Beja-Pereira A, Mitra B, et al. Chicken domestication: an updated perspective based on mitochondrial genomes. Heredity (Edinb). 2013;110(3):277–82.

    Article  CAS  Google Scholar 

  12. Marshall FB, Dobney K, Denham T, Capriles JM. Evaluating the roles of directed breeding and gene flow in animal domestication. Proc Natl Acad Sci. 2014;111(17):6153–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Hillier LW, Miller W, Birney E, Warren W, Hardison RC, Ponting CP, Bork P, Burt DW, Groenen MAM, Delany ME, et al. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004;432(7018):695–716.

    Article  CAS  Google Scholar 

  14. Wong GK, Liu B, Wang J, Zhang Y, Yang X, Zhang Z, Meng Q, Zhou J, Li D, Zhang J, et al. A genetic variation map for chicken with 2.8 million single-nucleotide polymorphisms. Nature. 2004;432(7018):717–22.

    Article  CAS  PubMed  Google Scholar 

  15. Rubin CJ, Zody MC, Eriksson J, Meadows JR, Sherwood E, Webster MT, Jiang L, Ingman M, Sharpe T, Ka S, et al. Whole-genome resequencing reveals loci under selection during chicken domestication. Nature. 2010;464(7288):587–91.

    Article  CAS  PubMed  Google Scholar 

  16. BirdLife International []

  17. Livestocks Population, 2000 - 2014 []

  18. Sastroamijoyo S. Ilmu Beternak Ayam, vol. 2. Bandung, Indonesia: N.V. Masa Baru; 1958.

    Google Scholar 

  19. Nataamijaya AG. The native chicken of Indonesia. Buletin Plasma Nutfah. 2000;6(1):1–6.

    Google Scholar 

  20. Sartika T, Iskandar S. Mengenal Plasma Nutfah Ayam Indonesia Dan Pemanfaatannya. In: Kelompok Peternak Ayam Kampung Sukabumi (KEPRAKS). 2nd ed. 2007.

    Google Scholar 

  21. Bennet JC. The Poultry Book: A Treatise on Breeding and General Management of Domestic Fowls with Enourmous Original Descriptions, and Portraits from Life. Boston: Phillips, Sampson; 1856.

    Book  Google Scholar 

  22. Cooper JW. Their Origin and History, with A Description of The Breeds, Strains, and Crosses. Pensylvania: Kessinger Publishing; 1869.

    Google Scholar 

  23. Finsterbusch CA. Cock Fighting All Over the World. New York: Grit and Steel; 1929.

    Google Scholar 

  24. Ekarius C. Storey’s Illustrated Guide to Poultry Breeds. Pownal: Storey Publishing; 2007.

    Google Scholar 

  25. Roberts V. British Poultry Standard. 6th ed. Oxford: Wiley-Blackwell; 2008.

    Google Scholar 

  26. American Poultry Association. The American standard of perfection, illustrated. A complete description of all recognized varieties of fowls. Boston, Mass: American Poultry Association; 1910.

    Google Scholar 

  27. American Poultry Association. The American standard of perfection, illustrated. A complete description of all recognized varieties of fowls. Guthrie, Okla: American Poultry Association; 1956.

    Google Scholar 

  28. MacKinnon J, Phillipps K, Balen BV. Burung-burung di Sumatera, Jawa, Bali dan Kalimantan. Burung Indonesia: Termasuk Sabah, Serawak dan Brunei Darussalam; 2000.

    Google Scholar 

  29. Maniatis T, Fritsch EF, Sambrook J. Molecular cloning: A Laboratory Manual. New York: Cold Spring Harbor Laboratory; 1982.

    Google Scholar 

  30. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Subgroup GPDP. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–9.

    Article  PubMed  PubMed Central  Google Scholar 

  32. McCarthy FM, Wang N, Magee GB, Nanduri B, Lawrence ML, Camon EB, Barrell DG, Hill DP, Dolan ME, Williams WP, et al. AgBase: a functional genomics resource for agriculture. BMC Genomics. 2006;7:229.

    Article  PubMed  PubMed Central  Google Scholar 

  33. McCarthy FM, Gresham CR, Buza TJ, Chouvarine P, Pillai LR, Kumar R, Ozkan S, Wang H, Manda P, Arick T, et al. AgBase: supporting functional modeling in agricultural organisms. Nucleic Acids Res. 2011;39(Database issue):D497–506.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Du Z, Zhou X, Ling Y, Zhang Z, Su Z. agriGO: a GO analysis toolkit for the agricultural community. Nucleic Acids Res. 2010;38:W64–70.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Yekutieli D, Benjamini Y. Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics. Journal of Statistical Planning and Inference. 1999;82(1–2):171–96.

    Article  Google Scholar 

  36. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23(19):2633–5.

    Article  CAS  PubMed  Google Scholar 

  37. Legesse BW, Myburg AA, Pixley KV, Botha AM. Genetic diversity of African maize inbred lines revealed by SSR markers. Hereditas. 2007;144(1):10–7.

    Article  CAS  PubMed  Google Scholar 

  38. Felsenstein J. PHYLIP—Phylogeny inference package (Version 3.2). Cladistics. 1989;5(2):164–6.

    Google Scholar 

  39. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol Biol Evol. 2013;30(12):2725–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Ronquist F, Huelsenbeck JP. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003;19:1572–4.

    Article  CAS  PubMed  Google Scholar 

  41. Stamatakis A, Hoover P, Rougemont J. A rapid bootstrap algorithm for the RAxML Web servers. Syst Biol. 2008;57(5):758–71.

    Article  PubMed  Google Scholar 

  42. Solinhac R, Leroux S, Galkina S, Chazara O, Feve K, Vignoles F, Morisson M, Derjusheva S, Bed’hom B, Vignal A, et al. Integrative mapping analysis of chicken microchromosome 16 organization. BMC Genomics. 2010;11(1):616.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


We thank Taichiro Ishige, Rumi Ohtake, and Eri Kubota for help in preparing the samples, and Satoshi Sano and Naoko Kosuge for technical support. We also thank the Nagoya University Graduate School of Bioagricultural Science Avian Bioscience Research Center for providing White Leghorn chicken through the National Bio-Resource Project of the Ministry of Education, Culture, Sports, Science and Technology (MEXT), Japan.


This study was supported by Grants-in-Aid for Scientific Research from MEXT (S1311017).

Author information

Authors and Affiliations


Corresponding author

Correspondence to Tomohiro Kono.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

MU helped conceive and design the study, prepared the samples, performed the whole-genome sequencing, and drafted the manuscript. RKM performed the whole-genome sequencing, sequence analysis, and phylogenetic analysis, and drafted the manuscript. AF, MM, BD, and AM helped conceive and design the study, prepared the samples, and supervised the research. TK helped conceive and design the study, drafted the manuscript, and supervised the research. All authors contributed to and approved the final version of the manuscript.

Additional files

Additional file 1: Figure S1.

Mapping. Data are percentage of sequence reads mapped to unique regions in the reference genome (blue), to multiple regions (red), and not mapped (green). Abbreviations are defined in Figure 2. Numbers after the breed name are sample numbers. (PDF 89 kb)

Additional file 2: Figure S2.

Genome coverage averaged in each breed. Data are percentage of regions covered by ≥ 1 (red), ≥ 3 (light blue), ≥ 5 (blue), and ≥ 10 (green) sequence reads in each chromosome. Abbreviations are defined in Fig. 2. (PDF 106 kb)

Additional file 3: Figure S3.

Genetic distance (K2P, upper triangle; 1-IBS, lower triangle) in each breed. Abbreviations are defined in Fig. 2. (PDF 538 kb)

Additional file 4: Figure S4.

Distribution of SNPs (blue), deletions (green), and insertions (yellow) in the genome. The number in each region is averaged in each breed. Upstream and downstream variations are within 5 kb of genes. Abbreviations are defined in Fig. 2. (PDF 210 kb)

Additional file 5: Table S1.

Summary of non-synonymous SNP containing genes fixed in each breeds and associated GO terms. (PDF 48 kb)

Additional file 6: Figure S5.

Distribution of gene ontology (GO) slim terms associated with non-synonymous single nucleotide polymorphism (SNP) containing genes, which are common in each breed. GO terms were categorized by biological process, cellular component, and molecular function, as indicated. Abbreviations are defined in Fig. 2. (PDF 89 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ulfah, M., Kawahara-Miki, R., Farajalllah, A. et al. Genetic features of red and green junglefowls and relationship with Indonesian native chickens Sumatera and Kedu Hitam . BMC Genomics 17, 320 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: