Genomic resources for the endangered Hawaiian honeycreepers
© Callicrate et al.; licensee BioMed Central. 2014
Received: 11 August 2014
Accepted: 8 December 2014
Published: 12 December 2014
The Hawaiian honeycreepers are an avian adaptive radiation containing many endangered and extinct species. They display a dramatic range of phenotypic variation and are a model system for studies of evolution, conservation, disease dynamics and population genetics. Development of a genome-scale resources for this group would augment the quality of research focusing on Hawaiian honeycreepers and facilitate comparative avian genomic research.
We assembled the genome sequence of a Hawaii amakihi (Hemignathus virens),and identified ~3.9 million single nucleotide polymorphisms (SNPs) in the genome. Using the amakihi genome as a reference, we also identified ~156,000 SNPs in RAD tag (restriction site associated DNA) sequencing of five honeycreeper species (palila [Loxioides bailleui], Nihoa finch [Telespiza ultima], iiwi [Vestiaria coccinea], apapane [Himatione sanguinea], and amakihi). SNPs are distributed throughout the amakihi genome, and the individual sequenced shows several large regions of low heterozygosity on chromosomes 1, 5, 6, 8 and 11. SNPs from RAD tag sequencing were also found throughout the genome but were found to be more densely located on microchromosomes, apparently a result of differential distribution of the particular site recognized by restriction enzyme BseXI.
The amakihi genome sequence will be useful for comparative avian genomics research and provides a significant resource for studies in such areas as disease ecology, evolution, and conservation genetics. The genome sequences will enable mapping of transcriptome data for honeycreepers and comparison of gene sequences between avian taxa. Researchers will be able to use the large number of SNP markers to genotype honeycreepers in regions of interest or across the whole genome. There are enough markers to enable use of methods such as genome-wide association studies (GWAS) that will allow researchers to make connections between phenotypic diversity of honeycreepers and specific genetic variants. Genome-wide markers will also help resolve phylogenetic and population genetic questions in honeycreepers.
Avian genome sequences were first obtained for well-studied model systems for which there was a long history of multidisciplinary research, namely the chicken Gallus gallus and zebra finch Taeniopygia guttata. But now genomes are starting to appear along lines of interest such as other agricultural species (turkey, Meleagris gallopavo), members of adaptive radiations (Darwin’s medium ground finch, Geospiza magnirostris), species with traits of interest such as vocal learning (budgerigar, Melopsittacus undulatus) and systems with possible incipient speciation (Ficedula flycatchers ). Genome-scale resources for non-traditional model organisms have become a reality over a short period of time, due in a large part to the commercialization of sequencing-by-synthesis (also called next-generation sequencing) technology . Initial examinations of these genomes have revealed that there is a high degree of synteny among avian species, confirming hypotheses from cytogenetic studies . Although 40 million years of evolution separate chickens and turkeys, only 30 minor chromosome rearrangements were detected between the two and their karyotypes are strikingly similar . Chicken and zebra finch (perhaps 100 million years diverged ) also exhibit a high degree of synteny and conservation of karyotype . However, recent work shows that small inversions may be common when comparing distantly-related avian taxa .
There are over 5,000 passerine species with many unique traits and adaptations . Each additional passerine genome [2, 4, 6] that is sequenced offers an opportunity to identify different genes under selection and to elucidate the mechanisms underlying avian adaptations . The Hawaiian honeycreepers are an endemic Hawaiian passerine adaptive radiation in the Cardueline finch subfamily Drepanidinae , and display a tremendous diversity of plumages, beak shapes (some unique to this radiation) and niches . Molecular analyses indicate that the radiation is sister to the Eurasian Carpodacus rosefinches, and dates to about 5.7 million years ago [12, 14]. Adaptive radiations have long been recognized for their value as evolutionary case studies and their usefulness in understanding adaptive evolutionary processes. The Hawaiian honeycreepers have the special characteristic that the history of their radiation is integrated with the geological history of the Hawaiian Islands. Patterns in honeycreeper divergence appear to be linked to the pattern of island emergence , which has been well-documented as part of a volcanic time series . Because this unusual geology provides a well-defined timeline, honeycreepers are a good system for estimation of rates of molecular evolution .
Unfortunately, of the 33 described historical honeycreeper species (plus over 17 species known only from subfossil material) , roughly two-thirds are now extinct, largely from human-related impacts such as habitat loss, introduced mammalian predators and vectored pathogens . Study of the evolution of disease resistance is an area that will especially benefit from genome-wide markers. In particular, honeycreepers appear extremely susceptible to introduced diseases such as avian malaria (Plasmodium relictum) and avian poxvirus, both vectored by an introduced Culex mosquito [17–19]. Most extant honeycreepers are limited to higher elevations free from mosquitoes and disease . However, a few species, most notably the Hawaii amakihi (Hemignathus virens), can survive with chronic malaria infection, exhibiting tolerance or resistance to the disease [21–23]. A few studies suggest that strong selective pressure from malaria resulted in rapid evolution of disease tolerance in certain low-elevation Hawaii amakihi populations and that resistance may be spreading amongst low-elevation amakihi, although it is unknown whether resistance arose once or simultaneously in multiple source populations . Understanding the source and mechanism of disease resistance in amakihi is a priority research area using the SNP markers. Such work is needed to improve our strategies for identifying and preserving the most viable populations of many species threatened by invasive pathogens.
Our objective in this study is to characterize the genome of a Hawaiian honeycreeper, the Hawaii amakihi (Hemignathus virens), and to develop and assess a set of genome-wide SNP markers to enable both phylogenetics-scale and fine-scale investigations about adaptive evolution and population genetics. We used two sequencing-by-synthesis approaches and then performed a hybrid assembly to create a draft Hawaii amakihi genome sequence. The Hawaii amakihi, in addition to being a member of the honeycreeper adaptive radiation, serves as an ecological model for disease transmission due to its variable responses to infection by avian malaria [21, 22]. The individual selected for the genome sequence had a high level of infection, but had been recaptured several times, indicating persistence despite a chronic, intense malaria infection. To increase the utility of markers for broader topics of study, we combined de-novo genome sequencing with a reduced representation sequencing method (restriction site-associated DNA, or RAD) to identify and map SNP polymorphisms isolated from four additional honeycreeper species. In addition to facilitating research into honeycreeper evolution and disease resistance, the draft amakihi genome will contribute to knowledge of avian genome biology and improve the pool of resources for comparative genomic study.
Results and discussion
Summary of input for genome assembly
2 × 151
3.93 × 106
2 × 101
86.97 × 106
3.64 × 106
The hybrid assembly used the full 2x 454 coverage and ~19x Illumina coverage (see Table 1), similar to the process for turkey which used ~5x 454 and ~25x Illumina GAII . We used only a portion of the total Illumina data to avoid overwhelming the information from the 454 reads; limiting the data volume was also necessary to stay within the memory limits of the computer used (512 GB RAM). Contigs were ordered and oriented and extended into scaffolds by aligning to the zebra finch genome sequence. In this way, amakihi genotypes at each zebra finch genomic position were determined. Genotype calls were generated using only high-quality (Phred-like Q20 or above) bases in the mapped reads. An MPG  score cutoff of ≥ 10 is expected to yield high-quality genotypes with >99.84% concordance with those from an Illumina Infinium genotyping assay .
The structure of avian genomes in general appears to be relatively undisturbed with regard to rearrangements, resulting in high degree of synteny among a variety of bird species . This property has been observed when comparing turkey  and Ficedula flycatcher to chicken . Our use of zebra finch as a template for aligning and assembling the amakihi genome is justified, in part, by the relatively recent divergence (33.5 million years) of the species . In fact, the Ficedula albicollis genome shows remarkably strong synteny with chicken despite perhaps 100 million years of evolutionary distance . However, on a more localized scale, Ficedula flycatchers show many small rearrangements with respect to zebra finch . If similar rearrangements have occurred between zebra finch and amakihi, then our assembly could be different from the true amakihi genome sequence.
Alignment statistics for zebra finch and amakihi against amakihi genome
% of zebra finch sites aligned (non-N)
% of amakihi sites aligned (non-N)
Kimura two parameter model distance
Nucleotide diversity by chromosome
Because in birds females are the heterogametic sex (we sequenced a female) chromosome Z should in theory have no heterozygous sites except in pseudo autosomal regions. Our data show about 0.017% of the total sequence sites assigned to Z and Z random are heterozygous (9,906 heterozygous sites on Z and 2,162 on Z random) versus 0.417% for sites on autosomal chromosomes. These false positives on the Z could be attributed to mismapping of paralagous reads or misassignment of autosomal segments to the Z and Z random chromosomes. The false positive rate on Z/Z random is an approximate indicator of the false positive rate elsewhere in the genome because mismapping of paralagous sites could have occurred for autosomal chromosomes as well.
The RAD tag method involves digesting genomic DNA with a restriction enzyme and sequencing fragments (tags) of DNA adjacent to restriction sites . We sequenced RAD tags for six individuals of four honeycreeper species in addition to the same amakihi for which we obtained the genome. This method yielded a wide range of sequences per individual, with an average of 7,596,336 post quality filtering (range: 319,559 – 24,263,032; see Additional File 1). We attribute the large range of number of reads to stochastic factors and variable sample DNA quality, as all other parameters (DNA quantity, library preparation protocol, pooled for sequencing in equimolar ratios) were the same between samples. RAD sequences were analyzed following two protocols: without a reference genome, using the Stacks pipeline, or utilizing the amakihi sequence as a reference for assembly and genotype calling. Raw reads for each individual in FASTQ format have been uploaded to NCBI (BioProject 252695) and will be available after publication of this article.
Stacks results after quality filtering
Number of stacks loci
Number of SNPs
Number of variable loci
Since we had both RAD and genome data for the same individual amakihi, we compared genotype calls from Stacks to known values from the genome sequence. With a minimum stack depth requirement of nine, only 0.8% of Stacks SNP calls differed from the genome value.
RADs with a reference
SNP sites discovered by comparison to the honeycreeper reference. Filtered for qual > 30 and depth >6
Positions with known genotype
Sites with non-reference allele
Private non-reference alleles
Compared to analyzing without a reference, the BWA-GATK pipeline resulted in more SNPs identified for Nihoa finch, fewer for iiwi, about the same for palila, and fewer for amakihi.
Applications of honeycreeper genomic resources
Herein, we describe a draft genome sequence for the Hawaii amakihi and associated genomic resources for Hawaiian honeycreepers including approximately 3.9 million SNPs within the amakihi genome and over 150,000 SNPs within and between amakihi and four other honeycreeper species. Honeycreepers are an important model system for many questions in evolutionary biology, and the SNP markers will facilitate a wide range of future studies in ongoing and new research areas. Being genome-enabled both enhances the resolution of current research methods (for example, fully resolving the honeycreeper phylogeny) and also opens up new analyses that weren’t possible before (such as GWAS for malaria tolerance). Some of the important questions which may be addressed include: how do rates of sequence evolution vary among different classes of DNA; what genes or genome regions are involved in speciation, adaptation or evolution of tolerance or resistance to disease; and how much adaptive potential exists in a population after demographic decline or fragmentation?
Studies of the evolutionary relationships of honeycreepers (for example [41–43]) have been limited by available technology and methods, as well as by rapid speciation and low levels of sequence divergence. Early molecular studies used allozyme electrophoresis [14, 44], restriction fragment length polymorphism of mitochondrial DNA , and relatively short DNA sequences [14, 46, 47] to only marginally resolve nucleotide substitution rates and relationships within the honeycreepers. Larger molecular datasets, such as one with entire mitochondrial genomes and 13 nuclear loci (>15 Kb) more adequately resolved the phylogeny, and estimated rates of sequence evolution and a split from a cardueline finch lineage at 5.7 Mya . Re-evaluating the honeycreeper phylogeny with a larger, more comprehensive dataset will allow researchers to investigate the pattern and tempo of evolution in this radiation. With genome-wide markers, it will be possible to connect genomic regions with specific adaptive traits across the phylogeny. Because precise geological information about the Hawaiian Islands provides a framework for dating evolutionary events, the honeycreeper radiation can provide unique insights into the evolutionary process. What is learned from honeycreepers can also be compared with other avian adaptive radiations such as Darwin’s finches  to further our understanding of the evolutionary process overall.
The ability to use analytical tools that connect genotypes to traits, such as GWAS [48, 49]) is a key benefit of the honeycreeper genomic marker set. These methods require large numbers of markers and were previously only useful for genome-enabled model organisms. Such techniques may allow identification of genes or regions implicated in disease resistance or specific adaptive traits; when such information is combined with results in other taxa, it contributes to our overall understanding of molecular mechanisms. This is also a first step towards investigating what happens to the genetic diversity in adaptively important genes or regions when species decline and become endangered. Identifying key genomic regions for disease resistance or adaptation could help focus conservation efforts towards preserving genetic variation in those areas and provide guidance for genetically-based population management decisions.
Hawaiian honeycreepers are also a model to investigate the response of genetic variation to human caused population decline, fragmentation and founder effects. For example, the Hawaii akepa (Loxops coccineus coccineus) occupies < 10% of its historical range in fragmented habitat and is a magnitude less populous than before its decline, yet contemporary samples show the same level of mitochondrial genetic diversity as in specimens sampled > 100 years ago and no significant differentiation between fragmented populations is detected . In another case, several founder populations of Laysan finch (Telespiza cantans) have been established on Pearl & Hermes reef and microsatellite data reveal that these have become genetically differentiated from the Laysan population and, to some extent, from each other . Finally, Hawaii amakihi, which have a relatively large population size, exhibit a rather unique elevational structuring, with populations from high elevation genetically differentiated from those at low elevation; data from museum skins suggest that this was also true historically. This elevational pattern is not found in contemporary iiwi (Vestiaria coccinea) or apapane (Himatione sanguinea) populations . Using the more comprehensive SNP marker set will provide the power to start looking at selection and adaptation to anthropogenic caused change in these species.
Our results provide a set of genomic resources for Hawaiian honeycreepers that will facilitate research on disease interactions, metapopulation dynamics, adaptive radiations, and genome evolution. The amakihi genome sequence will enable comparative studies of avian genomes and is an important contribution as it represents one of the more than 5,000 passeriform species, a group for which there are currently only three other genomes available in the literature [2, 4, 6]. The results yield a large number of genome wide markers, both from heterozygous sites in the sequenced individual and discovered using RAD tags with other honeycreeper species. We have demonstrated their potential phylogenetic utility based on a tree of relationships between honeycreeper species used in our RAD analysis that matches expectation based on previous molecular phylogenetic analyses . Heterozygosity measures for the individual sequenced, a malaria-resistant amakihi, indicate some regions of potential selective sweeps that could be of interest for study of malaria resistance. These regions are being targeted for resequencing in populations of malaria resistant and susceptible amakihi. The markers could also be used to identify regions of divergence among honeycreeper species to help elucidate the speciation process .
A single female amakihi (Hemignathus virens) was sequenced for genome assembly (USGS aluminum band 1771-10606, sampled 22 February 2002 at Nanawale, Hawaii Island). Although it has been typically preferred to use an inbred individual for genome sequencing to simplify assembly, the possibility of high-coverage sequencing-by-synthesis makes it possible to assemble even with potentially high levels of variation . Indeed, when SNP discovery is a major goal it is typically preferred to use an outbred individual. Seven Hawaiian honeycreeper samples were selected for RAD tag sequencing: one iiwi (Vestiaria coccinea; female RCF 2682, sampled 8 March 1987 at Kokee State Park, Kauai), two palila (Loxioides bailleui; bands 8031-75515 and 8031-75622, sampled in 1993 at Puu Laau, Hawaii Island), one apapane (Himatione sanguinea; 1540-45550 sampled at Waikimoi Preserve, Maui), one Hawaii amakihi (the same individual used for genome assembly), and two Nihoa finches (Telespiza ultima; bands 1381-62204 and 1381-62194 sampled on Nihoa Island, HI). This selection of honeycreepers covers much of the Drepanidine tree, and includes two redbird species (iiwi, apapane), two finchbill species (Nihoa finch, palila) and a greenbird (amakihi). Samples used in this study were obtained under appropriate USFWS and Hawaii DLNR-DOFAW permits, and IACUC approvals. For a recent phylogeny of Hawaiian honeycreepers, see Lerner et al. Current Biology 2011, 21:1838-1844.
Genomic DNA was extracted from whole blood using proteinase K digestion followed by phenol:chloroform extraction and either ethanol precipitation (Nihoa finches one palila) or Amicon® Ultra-4 (Millipore, Billerica, MA) centrifugal dialysis  (amakihi). Alternately, for iiwi, apapane, and the other palila, DNA was extracted using a Qiagen DNeasy Blood and Tissue kit (Qiagen, Germantown, MD). DNA quality and concentration were visualized using agarose gel electrophoresis and quantified using a NanoDrop 1000 spectrophotometer (NanoDrop, Wilmington, DE).
454 Library construction and sequencing
For 454 sequencing, ~10 ug of genomic DNA was fragmented using a HydroShear apparatus from Genomic Solutions Ltd, and 454 library preparation was done following manufacturer recommended protocols using the Titanium Rapid Library Preparation Kit, with insert sizes greater than 1000 bp. The libraries were then processed for shotgun Roche FLX+ sequencing in 4 lanes, to a total of 2.5X coverage. Average read length was 458 bp.
Illumina library construction and sequencing
A total of 5 ug of input DNA was sheared by sonication (Covaris) and size-selected using a Pippin Prep (Sage Science). The fragmented DNA was end-repaired and ligated to Illumina adapters using a SPRI-TE robot and reagents (Beckman Coulter, Inc.). Illumina indexes were then added using 10-cycle PCR reaction performed in duplicate. The amplified library products were pooled and subjected to two rounds of Agencourt AMPure XP (Beckman Coulter, Inc.) bead clean up. The library was run on an Illumina MiSeq (v1 reagents) and two lanes of an Illumina HiSeq2000 (v3 reagents). The insert size of the library was subsequently determined by paired-end read mapping back to the genome assembly to be 392 +/- 29 bp.
RAD tag library construction and sequencing
For the samples involved in RAD tag development, DNA samples were prepared for RAD tag sequencing generally following the protocol of Baird et al. (2008) , with modifications. These included the use of directional TruSeq-style adapters with 10 bp unique indices, and selecting a restriction enzyme with indeterminate bases at the cut site to accommodate requirements of Illumina HiSeq chemistry . Briefly, 2 ug of genomic DNA for each sample was digested with the BseXI enzyme, ligated to an adapter with a unique 10 bp index sequence, and sheared to approximately 300 – 500 bp fragments. A second adapter also containing the index sequence was ligated to the other end of the sheared fragments. Adapters were designed so that only fragments with adapters ligated to both ends would amplify. Each library was amplified using Phusion master mix (New England Biolabs, Ipswich, MA) for 15 – 18 cycles of PCR. Magnetic beads (Sera-Mag Speed Beads, Thermo Fisher Scientific, Waltham, MA) were used to purify libraries after amplification and filter out small fragments. Libraries were assessed for correct size and concentration using an Agilent BioAnalyzer. Samples were pooled in equimolar ratios and sequenced on an Illumina HiSeq with 100 bp paired-end reads (amakihi, iiwi, apapane and one palila) or MiSeq with 150 bp paired-end reads (both Nihoa finches and one palila). Paired-end sequencing generates two reads for each fragment, each starting from opposite ends of the fragment.
Genome assembly and comparative analysis
Quality filtered Illumina reads (>80% of bases in the read pair had quality scores > 20) corresponding to ~19-fold coverage (assuming a 1 Gb genome) and filtered 454 reads (reads with at least 300 bp of Q20 bases) corresponding to ~2-fold coverage were used for a genome assembly with phusion . Chromosome level scaffolds were generated from the assembled contigs by merging position and orientation information about a subset of the reads in the amakihi contigs with their orthologous position in the zebra finch genome (taeGut1)  as determined by a megablast  search. The amakihi chromosome level scaffolds were aligned to the zebra finch genome with Pecan  using the default settings. The consensus sequences for each chromosome have been uploaded to NCBI (BioProject 252695) and will be available upon publication of this article.
SNP discovery in the amakihi genome
The Illumina reads were mapped to the amakihi genome assembly with Novoalign V2.08.02 (Novoalign short read mapper: http://www.novocraft.com/), duplicate read-pairs were removed using SAMtools  and variants detected using MPG . For genome-wide statistics, single-nucleotide variants were filtered to include only heterozygous sites with an MPG score > =10 and a MPG score to read-depth ratio > = 0.5, and sites that had a read-depth less than approximately 2-fold the mean depth of coverage, i.e. <=100x on the autosomes and < =50x on the Z chromosome.
Sequence processing using RAD tags without a reference
Raw reads were evaluated for quality using FastQC . Reads were trimmed at the point where per-base quality score inter-quartile range dropped below a quality score of 20. The quality of most read two sequences deteriorated near the beginning of the read, so these sequences were not used. All read one sequences were trimmed to a length of 75 bp, the shortest length of any of the libraries before quality score dropped below 20. All reads were trimmed to this length because the Stacks RAD tag analysis software requires reads from all samples to be the same length. After they were trimmed, reads were filtered for quality using a python script (QualityFilterFastQ.py ) (amakihi, iiwi, apapane, both palila) or fastq_quality_filter from the FastX-toolkit  (both Nihoa finches), both of which removed any read that had any base pair with a quality score below 20.
Stacks  was used to assemble and call SNPs from RAD loci using the denovo_map.pl pipeline for samples without a reference genome. Several samples were first run individually using the populations mode of Stacks. Next, all samples were analyzed together using superparent mode. This mode is designed for test crosses and creates a catalog of possible loci based on the loci present in the parents. For non-cross samples, read one sequences are concatenated into a ‘superparent’ from which a catalog of stacks loci is developed, followed by alignment and genotyping of each sample at each catalog locus. Default parameters were used except as follows: minimum of three identical raw reads to create a stack and three mismatches allowed between loci when building the catalog of possible loci. The apapane read one file became corrupted during the compression process and was not used in analyses subsequent to individual Stacks runs. After running Stacks, Python scripts were used to filter the output to remove stacks that were found in the superparent catalog but not found in any progeny (samples; no progeny filter) or where one or more individuals had more than two genotypes for a given locus (bad genotypes filter). Stacks representing repetitive regions of the genome were removed by assembling the stacks consensus sequences with minimum overlap 70 bp and maximum read difference of 5% and then discarding stacks that assembled into contigs composed of greater than two sequences.
Using the quality-filtered Stacks consensus sequences only, we compared Stacks SNP calls for the amakihi with genotypes from the genome assembly (same amakihi). BWA was used to align Stacks consensus sequences to the genome assembly. Next, custom Python and Perl scripts were used to match Stacks SNP genotypes with genome genotypes on a sample of 11 chromosomes selected to include various sizes (chromosomes 1, 5, 7, 9, 15, 20, 22, 23, 24, 26 and 28). These scripts are available upon request to the author.
Alignment of RAD reads to amakihi genome and SNP genotyping
Read one sequences from the RAD tag libraries were trimmed and quality filtered as for Stacks analysis, except reads from the MiSeq run (both Nihoa finch and one palila) were trimmed to 130 bp instead of 75 bp as there was no need to keep all sequences the same length for this part of the analysis. The amakihi genome assembly was indexed using the ‘bwtsw’ algorithm of BWA  and the trimmed, quality-filtered read one sequences were aligned to the indexed reference using the ‘samse’ algorithm  for single reads. The HaplotypeCaller function  of the Genome Analysis Toolkit (GATK ) was used to identify variable sites between the amakihi genome and aligned honeycreeper reads using the MalformedReadFilter and default parameters. The VariantFiltration function of GATK was used to filter variant sites, passing those with quality >30 and depth >6.
All RAD read one sequences were aligned to the amakihi reference sequence using Geneious and calls for each sample for all sites were generated using the GATK HaplotypeCaller function with the EMIT_ALL_CONFIDENT_SITES parameter. PyRAD v. 1.2  was used to identify RAD sequences with 10X or higher coverage present in three or more (out of seven) taxa. These were clustered based on similarity of 0.9 in USEARCH . The total number of aligned base-pairs was 12,847. A maximum likelihood analysis in Garli v2.0  was performed on these data with 100 search replicates.
Availability of supporting data
The data sets supporting the results of this article, including the amakihi genome sequence (each chromosome sequence in FASTA format) and raw RAD reads (FASTQ format), are available in the NCBI repository, BioProject: PRJNA252695.
The Smithsonian Institution provided funds to R.C.F. and T.C. for this research through the Pell Competitive Grants Program for Science and the Office of the Undersecretary for Science Next Generation Sequencing Small Grants Program. J.W.T., J.C.M. and the NISC Comparative Sequencing Program were funded by the NHGRI Intramural Research Program. Samples used in this study were obtained under appropriate USFWS and Hawaii DLNR-DOFAW permits, and IACUC approvals. Bhanu Rekepalli and Amit Upadhyay from the Joint Institute for Computational Sciences group at the University of Tennessee provided scripts for comparing Stacks and amakihi genome genotypes. We thank Jason Howard of the Jarvis lab for assistance in coordinating the Roche 454 Sequencing reactions, and Roche 454 and the Duke Genome center for help in conducting the reactions. We also thank Nancy Rotzel McInerney of the CCEG lab for facilitating this research, Helen James for discussion and comments on the manuscript, and Jack Jeffrey for use of his photographs in Figure 4.
- International Chicken Genome Sequencing Consortium. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004, 432: 695-716. 10.1038/nature03154.Google Scholar
- Warren WC, Clayton DF, Ellegren H, Arnold AP, Hillier LW, Kunstner A, Searle S, White S, Vilella AJ, Fairley S, Heger A, Kong L, Ponting CP, Jarvis ED, Mello CV, Minx P, Lovell P, Velho TAF, Ferris M, Balakrishnan CN, Sinha S, Blatti C, London SE, Li Y, Lin Y-C, George J, Sweedler J, Southey B, Gunaratne P, Watson M, et al: The genome of a songbird. Nature. 2010, 464: 757-762. 10.1038/nature08819.PubMed CentralPubMedView ArticleGoogle Scholar
- Dalloul RA, Long JA, Zimin AV, Aslam L, Beal K, Ann Blomberg L, Bouffard P, Burt DW, Crasta O, Crooijmans RPMA, Cooper K, Coulombe RA, De S, Delany ME, Dodgson JB, Dong JJ, Evans C, Frederickson KM, Flicek P, Florea L, Folkerts O, Groenen MAM, Harkins TT, Herrero J, Hoffmann S, Megens H-J, Jiang A, de Jong P, Kaiser P, Kim H, et al: Multi-Platform Next-Generation Sequencing of the Domestic Turkey (Meleagris gallopavo): Genome Assembly and Analysis. PLoS Biol. 2010, 8: e1000475-10.1371/journal.pbio.1000475.PubMed CentralPubMedView ArticleGoogle Scholar
- Rands CM, Darling A, Fujita M, Kong L, Webster MT, Clabaut C, Emes RD, Heger A, Meader S, Hawkins MB, Eisen MB, Teiling C, Affourtit J, Boese B, Grant PR, Grant BR, Eisen JA, Abzhanov A, Ponting CP: Insights into the evolution of Darwin’s finches from comparative analysis of the Geospiza magnirostris genome sequence. BMC Genomics. 2013, 14: 95-10.1186/1471-2164-14-95.PubMed CentralPubMedView ArticleGoogle Scholar
- Ganapathy G, Howard JT, Ward JM, Li J, Li B, Li Y, Xiong Y, Zhang Y, Zhou S, Schwartz DC, Schatz M, Aboukhalil R, Fedrigo O, Bukovnik L, Wang T, Wray G, Rasolonjatovo I, Winer R, Knight JR, Koren S, Warren WC, Zhang G, Phillippy AM, Jarvis ED: High-coverage sequencing and annotated assemblies of the budgerigar genome. GigaScience. 2014, 3: 11-10.1186/2047-217X-3-11.PubMed CentralPubMedView ArticleGoogle Scholar
- Ellegren H, Smeds L, Burri R, Olason PI, Backstrom N, Kawakami T, Kunstner A, Makinen H, Nadachowska-Brzyska K, Qvarnstrom A, Uebbing S, Wolf JBW: The genomic landscape of species divergence in Ficedula flycatchers. Nature. 2012, 491: 756-760.PubMedGoogle Scholar
- Lerner H, Fleischer R: Prospects for the Use of Next-Generation Sequencing Methods in Ornithology. Auk. 2010, 127: 4-15. 10.1525/auk.2010.127.1.4.View ArticleGoogle Scholar
- Griffin DK, Robertson LBW, Tempest HG, Skinner BM: The evolution of the avian genome as revealed by comparative molecular cytogenetics. Cytogenet Genome Res. 2007, 117: 64-77. 10.1159/000103166.PubMedView ArticleGoogle Scholar
- Hackett SJ, Kimball RT, Reddy S, Bowie RCK, Braun EL, Braun MJ, Chojnowski JL, Cox WA, Han K-L, Harshman J, Huddleston CJ, Marks BD, Miglia KJ, Moore WS, Sheldon FH, Steadman DW, Witt CC, Yuri T: A Phylogenomic Study of Birds Reveals Their Evolutionary History. Science. 2008, 320: 1763-1768. 10.1126/science.1157704.PubMedView ArticleGoogle Scholar
- Kawakami T, Smeds L, Backström N, Husby A, Qvarnström A, Mugal CF, Olason P, Ellegren H: A high-density linkage map enables a second-generation collared flycatcher genome assembly and reveals the patterns of avian recombination rate variation and chromosomal evolution. Mol Ecol. 2014, 23: 4035-4058. 10.1111/mec.12810.PubMed CentralPubMedView ArticleGoogle Scholar
- Barker FK, Cibois A, Schikler P, Feinstein J, Cracraft J: Phylogeny and diversification of the largest avian radiation. Proc Natl Acad Sci U S A. 2004, 101: 11040-11045. 10.1073/pnas.0401892101.PubMed CentralPubMedView ArticleGoogle Scholar
- Lerner H, Meyer M, Hofreiter M, Fleischer R: Multilocus resolution of the phylogeny and timescale in the extant adaptive radiation of Hawaiian honeycreepers. Curr Biol. 2011, 21: 1838-1844. 10.1016/j.cub.2011.09.039.PubMedView ArticleGoogle Scholar
- James H, Olson S: Descriptions of thirty-two new species of birds from the Hawaiian Islands: Part II. Passeriformes. Ornithol Monogr. 1991, 46: 1-88.View ArticleGoogle Scholar
- Fleischer RC, McIntosh CE, Tarr CL: Evolution on a volcanic conveyor belt: using phylogeographic reconstructions and K–Ar-based ages of the Hawaiian Islands to estimate molecular evolutionary rates. Mol Ecol. 1998, 7: 533-545. 10.1046/j.1365-294x.1998.00364.x.PubMedView ArticleGoogle Scholar
- Price JP, Clague DA: How old is the Hawaiian biota? Geology and phylogeny suggest recent divergence. Proc R Soc Lond B Biol Sci. 2002, 269: 2429-2435. 10.1098/rspb.2002.2175.View ArticleGoogle Scholar
- Banko WE, Banko PC: Historic Decline and Extinction. Conserv Biol Hawaii For Birds. 2009, New Haven, CT: Yale University Press, 25-58.Google Scholar
- van Riper CI, van Riper SG, Goff ML, Laird M: The epizootiology and ecological significance of malaria in Hawaiian land birds. Ecol Monogr. 1986, 56: 327-344. 10.2307/1942550.View ArticleGoogle Scholar
- Atkinson CT, Woods KL, Dusek RJ, Sileo LS, Iko WM: Wildlife disease and conservation in Hawaii: Pathogenicity of avian malaria (Plasmodium relictum) in experimentally infected Iiwi (Vestiaria coccinea). Parasitology. 1995, 111: S59-S69. 10.1017/S003118200007582X.PubMedView ArticleGoogle Scholar
- Atkinson CT, Samuel MD: Avian malaria Plasmodium relictum in native Hawaiian forest birds: epizootiology and demographic impacts on apapane Himatione sanguinea. J Avian Biol. 2010, 41: 357-366. 10.1111/j.1600-048X.2009.04915.x.View ArticleGoogle Scholar
- Van Riper CI, Scott J: Limiting factors affecting Hawaiian native birds. Stud Avian Biol. 2001, 22: 221-233.Google Scholar
- Woodworth BL, Atkinson CT, LaPointe DA, Hart PJ, Spiegel CS, Tweed EJ, Henneman C, LeBrun J, Denette T, DeMots R, Kozar KL, Triglia D, Lease D, Gregor A, Smith T, Duffy D: Host population persistence in the face of introduced vector-borne diseases: Hawaii amakihi and avian malaria. Proc Natl Acad Sci U S A. 2005, 102: 1531-1536. 10.1073/pnas.0409454102.PubMed CentralPubMedView ArticleGoogle Scholar
- Atkinson C, Dusek R, Woods K, Iko W: Pathogenicity of avian malaria in experimentally-infected Hawaii Amakihi. J Wildl Dis. 2000, 36: 197-204. 10.7589/0090-3558-36.2.197.PubMedView ArticleGoogle Scholar
- Jarvi S, Atkinson C, Fleischer R: Immunogenetics and resistance to avian malaria in Hawaiian honeycreepers (Drepanidinae). Stud Avian Biol. 2001, 22: 254-263.Google Scholar
- Foster JT, Woodworth BL, Eggert LE, Hart PJ, Palmer D, Duffy DC, Fleischer RC: Genetic structure and evolved malaria resistance in Hawaiian honeycreepers. Mol Ecol. 2007, 16: 4738-4746. 10.1111/j.1365-294X.2007.03550.x.PubMedView ArticleGoogle Scholar
- Teer JK, Bonnycastle LL, Chines PS, Hansen NF, Aoyama N, Swift AJ, Abaan HO, Albert TJ, Margulies EH, Green ED, Collins FS, Mullikin JC, Biesecker LG: Systematic comparison of three genomic enrichment methods for massively parallel DNA sequencing. Genome Res. 2010, 20: 1420-1431. 10.1101/gr.106716.110.PubMed CentralPubMedView ArticleGoogle Scholar
- Paten B, Herrero J, Beal K, Fitzgerald S, Birney E: Enredo and Pecan: genome-wide mammalian consistency-based multiple alignment with paralogs. Genome Res. 2008, 18: 1814-1828. 10.1101/gr.076554.108.PubMed CentralPubMedView ArticleGoogle Scholar
- Burt DW, Bruley C, Dunn IC, Jones CT, Ramage A, Law AS, Morrice DR, Paton IR, Smith J, Windsor D, Sazanov A, Fries R, Waddington D: The dynamics of chromosome evolution in birds and mammals. Nature. 1999, 402: 411-413. 10.1038/46555.PubMedView ArticleGoogle Scholar
- Backström N, Karaiskou N, Leder EH, Gustafsson L, Primmer CR, Qvarnström A, Ellegren H: A Gene-Based Genetic Linkage Map of the Collared Flycatcher (Ficedula albicollis) Reveals Extensive Synteny and Gene-Order Conservation During 100 Million Years of Avian Evolution. Genetics. 2008, 179: 1479-1495. 10.1534/genetics.108.088195.PubMed CentralPubMedView ArticleGoogle Scholar
- Jetz W, Thomas GH, Joy JB, Hartmann K, Mooers AO: The global diversity of birds in space and time. Nature. 2012, 491: 444-448. 10.1038/nature11631.PubMedView ArticleGoogle Scholar
- Aslam M, Bastiaansen J, Elferink M, Megens H-J, Crooijmans R, Blomberg L, Fleischer R, Van Tassell C, Sonstegard T, Schroeder S, Groenen M, Long J: Whole genome SNP discovery and analysis of genetic diversity in Turkey (Meleagris gallopavo). BMC Genomics. 2012, 13: 391-10.1186/1471-2164-13-391.PubMed CentralPubMedView ArticleGoogle Scholar
- Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, Selker EU, Cresko WA, Johnson EA: Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers. PLoS ONE. 2008, 3: e3376-10.1371/journal.pone.0003376.PubMed CentralPubMedView ArticleGoogle Scholar
- Davey JW, Cezard T, Fuentes-Utrilla P, Eland C, Gharbi K, Blaxter ML: Special features of RAD Sequencing data: implications for genotyping. Mol Ecol. 2013, 22: 3151-3164. 10.1111/mec.12084.PubMed CentralPubMedView ArticleGoogle Scholar
- McQueen HA, Fantes J, Cross SH, Clark VH, Archibald AL, Bird AP: CpG islands of chicken are concentrated on microchromosomes. Nat Genet. 1996, 12: 321-324. 10.1038/ng0396-321.PubMedView ArticleGoogle Scholar
- Smith J, Bruley CK, Paton IR, Dunn I, Jones CT, Windsor D, Morrice DR, Law AS, Masabanda J, Sazanov A, Waddington D, Fries R, Burt DW: Differences in gene density on chicken macrochromosomes and microchromosomes. Anim Genet. 2000, 31: 96-103. 10.1046/j.1365-2052.2000.00565.x.PubMedView ArticleGoogle Scholar
- Federico C, Cantarella C, Scavo C, Saccone S, Bed’Hom B, Bernardi G: Avian genomes: different karyotypes but a similar distribution of the GC-richest chromosome regions at interphase. Chromosom Res. 2005, 13: 785-793. 10.1007/s10577-005-1012-7.View ArticleGoogle Scholar
- Nikolajewa S: Common patterns in type II restriction enzyme binding sites. Nucleic Acids Res. 2005, 33: 2726-2733. 10.1093/nar/gki575.PubMed CentralPubMedView ArticleGoogle Scholar
- Li H, Durbin R: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009, 25: 1754-1760. 10.1093/bioinformatics/btp324.PubMed CentralPubMedView ArticleGoogle Scholar
- McKenna A, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo M: The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20: 1297-1303. 10.1101/gr.107524.110.PubMed CentralPubMedView ArticleGoogle Scholar
- Eaton DAR: PyRAD: assembly of de novo RADseq loci for phylogenetic analyses. Bioinformatics. 2014, 30: 1844-1849. 10.1093/bioinformatics/btu121.PubMedView ArticleGoogle Scholar
- Zwickl D: Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion. PhD Dissertation. 2006, The University of Texas at AustinGoogle Scholar
- Amadon D: The Hawaiian honeycreepers (Aves, Drepaniidae). Bull AMNH. 1950, 95: 151-262.Google Scholar
- Richards LP, Bock WJ: Functional Anatomy and Adaptive Evolution of the Feeding Apparatus in the Hawaiian Honeycreeper Genus Loxops (Drepanididae). Ornithol Monogr. 1973, 15: 1-173.Google Scholar
- Raikow R: The origin and evolution of the Hawaiian honeycreepers (Drepanididae). Living Bird. 1977, 15: 95-117.Google Scholar
- Johnson NK, Marten JA, Ralph CJ: Genetic Evidence for the Origin and Relationships of Hawaiian Honeycreepers (Aves: Fringillidae). Condor. 1989, 91: 379-396. 10.2307/1368317.View ArticleGoogle Scholar
- Tarr CL, Fleischer RC: Mitochondrial-DNA Variation and Evolutionary Relationships in the Amakihi Complex. Auk. 1993, 110: 825-831. 10.2307/4088636.View ArticleGoogle Scholar
- Reding D, Freed L, Cann R, Fleischer R: Spatial and temporal patterns of genetic diversity in an endangered Hawaiian honeycreeper, the Hawaii Akepa (Loxops coccineus coccineus). Conserv Genet. 2010, 11: 225-240. 10.1007/s10592-009-0025-8.View ArticleGoogle Scholar
- Reding DM, Foster JT, James HF, Pratt HD, Fleischer RC: Convergent evolution of “creepers” in the Hawaiian honeycreeper radiation. Biol Lett. 2009, 5: 221-224. 10.1098/rsbl.2008.0589.PubMed CentralPubMedView ArticleGoogle Scholar
- Orr N, Back W, Gu J, Leegwater P, Govindarajan P, Conroy J, Ducro B, Van Arendonk JAM, MacHugh DE, Ennis S, Hill EW, Brama PAJ: Genome-wide SNP association–based localization of a dwarfism gene in Friesian dwarf horses. Anim Genet. 2010, 41: 2-7.PubMedView ArticleGoogle Scholar
- Jones FC, Grabherr MG, Chan YF, Russell P, Mauceli E, Johnson J, Swofford R, Pirun M, Zody MC, White S, Birney E, Searle S, Schmutz J, Grimwood J, Dickson MC, Myers RM, Miller CT, Summers BR, Knecht AK, Brady SD, Zhang H, Pollen AA, Howes T, Amemiya C, Lander ES, Di Palma F, Lindblad-Toh K, Kingsley DM: The genomic basis of adaptive evolution in threespine sticklebacks. Nature. 2012, 484: 55-61. 10.1038/nature10944.PubMed CentralPubMedView ArticleGoogle Scholar
- Tarr CL, Conant S, Fleischer RC: Founder events and variation at microsatellite loci in an insular passerine bird, the Laysan finch (Telespiza cantans). Mol Ecol. 1998, 7: 719-731. 10.1046/j.1365-294x.1998.00385.x.View ArticleGoogle Scholar
- Slikas B, Jones IB, Derrickson SR, Fleischer RC: Phylogenetic relationships of Micronesian white-eyes based on mitochondrial sequence data. Auk. 2000, 117: 355-365. 10.1642/0004-8038(2000)117[0355:PROMWE]2.0.CO;2.View ArticleGoogle Scholar
- Faircloth BC, Glenn TC: Not All Sequence Tags Are Created Equal: Designing and Validating Sequence Identification Tags Robust to Indels. PLoS ONE. 2012, 7: e42543-10.1371/journal.pone.0042543.PubMed CentralPubMedView ArticleGoogle Scholar
- Mullikin JC, Ning Z: The Phusion Assembler. Genome Res. 2003, 13: 81-90. 10.1101/gr.731003.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhang Z, Schwartz S, Wagner L, Miller W: A greedy algorithm for aligning DNA sequences. J Comput Biol J Comput Mol Cell Biol. 2000, 7: 203-214. 10.1089/10665270050081478.View ArticleGoogle Scholar
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The Sequence Alignment/Map format and SAMtools. Bioinforma Oxf Engl. 2009, 25: 2078-2079. 10.1093/bioinformatics/btp352.View ArticleGoogle Scholar
- Babraham Bioinformatics - FastQC A Quality Control tool for High Throughput Sequence Data. [http://www.bioinformatics.babraham.ac.uk/projects/fastqc/]
- Kircher M: Analysis of High-Throughput Ancient DNA Sequencing Data. Anc DNA. Volume 840. Edited by: Shapiro B, Hofreiter M. 2012, Humana Press: Totowa, NJ, 197-228.View ArticleGoogle Scholar
- FASTX-Toolkit. [http://hannonlab.cshl.edu/fastx_toolkit/index.html]
- Catchen JM, Amores A, Hohenlohe P, Cresko W, Postlethwait JH: Stacks: Building and Genotyping Loci De Novo From Short-Read Sequences. G3 Genes Genomes Genet. 2011, 1: 171-182.Google Scholar
- DePristo M, Banks E, Poplin R, Garimella K, Maguire J, Hartl C, Philippakis A, del Angel G, Rivas M, Hanna M, McKenna A, Fennell T, Kernytsky A, Sivachenko A, Cibulskis K, Gabriel S, Altshuler D, Daly MJ: A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Rev Genet. 2011, 43: 491-498. 10.1038/ng.806.View ArticleGoogle Scholar
- Edgar RC: Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010, 26: 2460-2461. 10.1093/bioinformatics/btq461.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.