Identification of avian W-linked contigs by short-read sequencing
© Chen et al.; licensee BioMed Central Ltd. 2012
Received: 12 December 2011
Accepted: 25 April 2012
Published: 14 May 2012
The female-specific W chromosomes and male-specific Y chromosomes have proven difficult to assemble with whole-genome shotgun methods, creating a demand for new approaches to identify sequence contigs specific to these sex chromosomes. Here, we develop and apply a novel method for identifying sequences that are W-specific.
Using the Illumina Genome Analyzer, we generated sequence reads from a male domestic chicken (ZZ) and mapped them to the existing female (ZW) genome sequence. This method allowed us to identify segments of the female genome that are underrepresented in the male genome and are therefore likely to be female specific. We developed a Bayesian classifier to automate the calling of W-linked contigs and successfully identified more than 60 novel W-specific sequences.
Our classifier can be applied to improve heterogametic whole-genome shotgun assemblies of the W or Y chromosome of any organism. This study greatly improves our knowledge of the W chromosome and will enhance future studies of avian sex determination and sex chromosome evolution.
KeywordsSex chromosomes Next-generation sequencing
While whole-genome shotgun and short-read assemblies are rather effective at reconstructing single-copy euchromatic genes, repetitive regions remain a major challenge. Short-read sequencing eliminates issues related to low cloning efficiency of interspersed repeats, but the assembly process remains problematic for both repeats and segmental duplications, as high sequence homogeneity among copies of a given repeat or duplication limit the potential to reconstruct sequence order [1, 2]. The inability to assemble repetitive regions can also pose difficulties for reconstructing large scaffolds from contigs , and the resulting gene fragmentation complicates gene assembly and annotation . The assembly of repeats and duplications therefore remains a major challenge in genome sequencing and is only possible by focused and concerted efforts [4, 5].
In species with chromosomal sex determination, the male-specific Y (in species with XX/XY sex determination) and female-specific W chromosomes (in species with ZZ/ZW sex determination) present special challenges to whole genome shotgun assembly. Sex-specific chromosomes are enriched for interspersed repeats and segmental duplications, on which whole genome shotgun methods perform poorly . The absence of crossing-over outside the pseudoautosomal region makes it impossible to take advantage of the genetic map for scaffolding the assembly . An additional hindrance is the lower sequence coverage of the sex chromosomes when sequencing heterogametic individuals, which reduces the average length of assembled contigs. Sex chromosomes receive half the coverage of autosomes when sequencing heterogametic individuals (the strategy used for chicken and turkey), and just a quarter of the autosomal coverage if sequencing a 50:50 mix of heterogametic and homogametic individuals (the strategy adopted for Drosophila melanogaster). Even in organisms like Drosophila melanogaster, where the quality of the whole genome shotgun assembly is extremely high, the Y chromosome remains a collection of unassembled contigs [7–9]. In the case of humans and chimpanzee, the Y chromosome assemblies are nearly complete, because these were sequenced by a painstaking BAC-by-BAC effort [5, 10].
There is considerable interest in assembling the female-specific avian W chromosome, not only to expand our understanding of sex-determination mechanisms, but also to address many questions about sex chromosome evolution. The exact mechanism of avian sex determination remains controversial: though the Z-linked DMRT1 gene is required for testis development (which is consistent with the Z dosage hypothesis), female sex determination may still involve a dominant, W-linked gene (analogous to Y-linked dominant sex determination in mammals) [11, 12]. More information about the W chromosome will contribute to our understanding of the evolution of female heterogamety as well as the dynamics of sex chromosome degradation and differentiation .
The chicken genome, which contains 38 autosomes and a pair of sex chromosomes, was sequenced in 2004 from a single female Red Junglefowl . About 70% of the heterochromatic chicken W chromosome consists of XhoI-, EcoRI-, and SspI-family repetitive sequences, and some known genes on the W are tandemly duplicated (e.g., Wpkci), leaving an estimated 10–15 Mb of non-redundant sequence . The chicken genome was sequenced to 6.6x coverage and assembled from whole-genome shotgun reads, as well as plasmid, fosmid, and bacterial artificial chromosome (BAC)-end read pairs . Of the 1.05 Gb of assembled sequence, only 933 Mb were anchored to a specific chromosome, leaving 121 Mb in unmapped sequence fragments, collectively called chrUn . Assembly of the W chromosome is especially poor: only 0.5% of the W (based on its estimated size of 50–55 Mb) has been successfully mapped. To date, only a handful of genes have been identified on the W: CHD1W, ATP5A1W, ASW/Wpkci/HINT1W[15, 19], SPINW, SMAD2, UBAP2W/ADO12W, ZNF532W, ZFRW, MIER3W, hnRNPKW, SSC2W/NIPBLW, and KCMFW (first identified in Build 2.1 and then cited by ).
Given the challenges in producing an assembly of the Y and W chromosomes by traditional shotgun-sequencing methods, new tools are required to identify sex-specific sequences generated by heterogametic shotgun sequencing projects. Here, we adapt a method devised by Carvalho and colleagues (unpublished; the original approach was aimed toward discovering Y-linked contigs in Drosophila) and identify female-specific sequences by contrasting male-derived, short-read shotgun genomic sequences and unmapped sequence fragments (chrUn) from the female-derived chicken genome. This method relies on the fact that the W chromosome is female-limited. By sequencing the genome of the homogametic sex (in our case, the ZZ male) to high depth and aligning the reads to the genome of the heterogametic sex (the ZW female), we were able to identify regions of the genome that are underrepresented in males and are therefore likely to be female-specific.
W-specific contigs have distinct coverages and read depths
Contig length influences alignment results
Evaluation of performance
We present a framework to identify W-specific sequences in the chicken genome. The approach is generalizable to identify any genomic sequences that are present uniquely in one sex (e.g., Y or W chromosomes within other animal species), and is potentially useful for characterizing the genomes of non-model organisms. Our method is based on the fact that sequences unique to the W chromosome are not present in the genome of a male. We mapped male-derived sequence fragments to the genome of a female and developed a naïve Bayes classifier using the alignment results (summarized by coverage and read depth). As predicted, contigs specific to the W chromosome had significantly lower coverages and read depths.
The accuracy of our method can be improved with deeper sequencing. Many of the false positive contigs probably had low coverages and read depths due to low sequence depth. We generated 367.2 Mbp of high quality sequence, which translates to only 0.45x coverage of the masked genome. It is therefore not surprising to find portions of the genome misleadingly underrepresented in the data set. At half this coverage, 40% of contigs of length 1 kb have very few reads aligning, making it more difficult to distinguish true female-specific contigs. However, this depth of sequencing was sufficient for proof of concept. We show that, even at low coverage, the approach was successful at identifying a focal set of candidate sequences for subsequent verification by targeted PCR.
Unlike traditional sequence mapping methods, our approach is not severely hindered by the lower sequence coverage of the W chromosome during shotgun sequencing of heterogametic individuals. While lower coverage results in W contigs that on average are shorter in length (and therefore more difficult to classify), we greatly improve performance by conditioning on contig length in the classification method. However, our method cannot fully overcome the challenges posed by repetitive regions. All interspersed repeats and segmental duplications were masked out of the genome before performing the alignments, thereby eliminating much of the W chromosome from consideration. It is possible to relax the stringency of the filtering step in further iterations of the classifier to identify euchromatic repeats that do not resemble genome typical repeats. Furthermore, this method cannot exhaustively find all non-repetitive W contigs – it can only detect unique regions specific to the W. Sequences in the pseudoautosomal region will produce the same read depth as autosomal regions, and recent gene duplication events may produce W-linked sequences with enough similarity to autosomal or Z-linked sequence to be represented in male genomes.
Because our method searches for regions in the male genome that are underrepresented in female-derived genome sequences, any male-specific deletions could lead to an inappropriate assignment of contigs to the W chromosome. Deletions in the White Leghorn genome compared to the Red Junglefowl genome are not an issue because all our PCR validations used males and females of the same species. Our method would classify a deletion in the White Leghorn genome as W-specific, but such a region would not show a female-specific amplification pattern in our PCR validation step. Misclassifications due to male-specific deletions can be detected by screening a larger set of individuals and by BAC screening and sequencing.
Despite the limitations of our approach, we were still able to identify more than 62 new W-specific contigs. Note that this number is an underestimate, as contigs that fail to produce a female-specific marker may still be located on the W chromosome. These new markers will greatly improve the assembly and annotation of the W chromosome. A more complete annotation of genes on the chicken W chromosome will accompany the BAC-based sequencing and assembly of the chromosome.
There is particular interest in fully annotating the avian W because the sex-determining mechanism in birds has yet to be completely characterized. DMRT1 is known to be required for testis development , though studies on triploid and chimeric chickens suggest there may be a female-determining gene that interacts with a male-determining locus on the Z [26, 27]. Evidence supporting the popular W-linked candidate, HINTW, is mixed: though HINTW is functionally different from its Z chromosome paralog , mis-expression of HINTW in male (ZZ) embryos resulted in normal testes development . Further annotation of the W may unearth other candidate ovary-determining genes.
Sequence information of the W chromosome would benefit several different evolutionary studies besides avian sex determination, from sex chromosome evolution to sexual conflict and sex-biased mutation rate . For example, birds are good subjects for the study of sex chromosome evolution because different bird groups exhibit parallel divergence of the W as well as variation in the degree of W chromosome degradation (from a largely undifferentiated state in ratites to a highly degenerate state in passerines) [13, 28]. The scope for genetic conflict is increased in ZW species because the W is expressed in both sexes in the form of maternal effects, and the accumulation of sexually antagonistic maternal effect genes could contribute to the decay of the non-recombining W . The W chromosome may be a magnet for female-specific fertility genes. Evolutionary theory indicates that male fertility genes are expected to be retained on the Y chromosome because they are free from the influence of selection in females [30, 31]. By symmetry, this same evolutionary theory leads to the expectation that the W chromosome may concentrate genes that are uniquely necessary for female fertility [30, 31]. Finally, ZW systems may be more appropriate than XY systems for studying sex-specific mutation rates: while higher mutation on the Y may be due to male-biased mutation or suppressed mutation on the X chromosome to minimize exposure of deleterious recessives in the hemizygote male, these hypotheses can be distinguished in ZW sex chromosomes .
The availability of more W-specific sequences also facilitates the development of additional sex-specific primers for unambiguous molecular sexing techniques. The ability to sex individuals is critical for answering several questions in evolution and ecology, and morphological identification of sex is often difficult in birds . The commonly used universal primer sets for avian molecular sexing depend on length differences between CHD-Z and CHD-W introns [34–36], which may be problematic in certain species due to CHD-Z polymorphisms  and heteroduplex molecule formation . Thus the new W-specific sequences identified here can help advance several different avenues of research.
Here we describe a novel approach for identifying sequences specific to a heterogametic sex chromosome. We performed a proof-of-concept experiment by aligning shotgun sequence reads from a male (ZZ) chicken to the genome of a female (ZW) chicken, and our classifier successfully identified >60 confirmed novel W-specific contigs despite low coverage. We believe that our method is widely applicable and can benefit future genome assembly efforts. While there have been significant investments in lowering sequencing costs and increasing sequencing throughput, little investment has been made in techniques to cope with the limitations of whole-genome shotgun sequencing strategies, particularly the challenges specific to sex chromosomes: low coverage, resolution of interspersed repeats and segmental duplications, inability to map, etc. In addition, de novo assemblies generated using only next-generation sequencing technologies are especially prone to collapsing segmental duplications and large repeats . The approach described here can quickly identify candidate W or Y chromosome markers, and these contigs can be extended by probing BAC libraries. A full assembly of the W chromosome still requires substantial BAC sequencing efforts, but this method can greatly facilitate the process of designing W-specific probes. A combination of our method with traditional BAC screening and sequencing would provide a powerful approach to assembling the W or Y chromosome in any organism.
Genomic DNA was extracted from the blood of a White Leghorn rooster using the Qiagen DNeasy kit. We generated 10.5 million 36 bp reads using the Illumina Genome Analyzer (GA-IIx). Duplicate and low-complexity reads were removed before alignment, resulting in a total of 10.2 million unique and high quality reads. The sequence data generated in this study have been submitted to the NCBI Sequence Read Archive (http://www.ncbi.nlm.nih.gov/sra) under accession SRP008449.
We obtained chicken genome sequences (Build 2.1) and known W chromosome BAC sequences. The chicken genome assembly includes 18 scaffolds mapped to the W chromosome, and 1044 autosomal or Z-linked scaffolds. The 25,378 unmapped contigs (chrUn) had lengths ranging from 54 to 48,370 bp. Low complexity sequences and repeats were masked with RepeatMasker (http://ftp.genome.washington.edu/cgi-bin/RepeatMasker/). After removing segments less than 50 bp in length, this resulted in 920.7 Mbp of sequence and 20,069 unmapped contigs. However, because our method relies on the unique mapping of reads, any sequences that occur in multiple locations in the genome could lead to spurious results. Thus, more stringent filtering of the reference genome was required. We aligned the masked contigs to themselves in MUMMER  and masked any duplicate regions larger than 50 bp. After this more stringent filtering step, we were left with a total 823.7 Mbp of unique sequence, with 6,905 unmapped contigs.
Reads were aligned to the masked and filtered reference genome using MAQ . We allowed some mismatches in the alignment process to account for sequence divergence between White Leghorn and Red Junglefowl . Alignment results were summarized for each contig using two statistics: coverage and read depth (Figure1B). Here we define coverage as the fraction of unmasked bases in a contig that is covered by one or more reads. Read depth is the number of reads aligning to a contig, normalized by the total number of locations a read could align to that contig. Our measure of read depth is analogous to the widely used measure of gene expression, reads per kilobase of exon model per million mapped reads (RPKM). Because we used only one library, there was no reason to calculate RPKM, which standardizes among libraries.
Confirmation of predictions
Because a large portion of the initial chicken W chromosome assembly was later discovered to be misassigned [14, 42], we used genomic BLAST to ensure that the W contigs in our reference genome are representative of W-specific sequence. In addition, we confirmed any outliers in the initial W-specific set by comparing features of each W contig to features of the known set of autosomal and Z-linked contigs. We used 1000 bootstrap replicates to estimate confidence intervals of mean coverage and read depth for known autosomal or Z-linked contigs, which were then compared to the coverage and read depth values, respectively, of each putative W-specific contig.
Our method is based on the assumption that very few ZZ reads should align to W-specific contigs, which as a result should have significantly lower coverage and read depth compared to autosomal or Z-linked contigs (Figure1C). To confirm the predictions of our method, we compared the coverage and read depth for contigs of known location. We used nonparametric bootstrapping methods to determine whether known W and known autosomal or Z-linked contigs had different distributions of coverage and read depth. For each of the 1000 bootstrap replicates, we calculated the difference between the 100th quantile of the W bootstrap distribution and the 0th quantile of the non-W-specific bootstrap distribution. This difference should be positive if the distribution of coverage or read depth of autosomal/Z-linked contigs is distinctly greater than that of W-specific contigs.
Simulations to determine effect of contig length
Because the length of unmapped contigs varied greatly (from 50 to 44,574 unmasked bp), we tested the effect of length by simulating genomes consisting of different-sized contigs. Contigs were sorted by length into 500 bp bins. We fragmented the mapped portion of the reference genome into contigs of length 500 bp, 1 kb, etc. For each fragmented genome, we redid the alignments and compared the distributions of coverage and read length for W- and non-W-specific contigs.
We developed a naïve Bayes classifier to identify W-specific contigs. A naïve Bayes classifier uses a set of training data to calculate the probability that a given example belongs to a certain class based on a set of features. It simplifies the learning process by assuming that the features are independent, although in practice it performs well even if that assumption is violated. We will refer to each contig by its feature vector X = (x 1 , x 2 ), where x 1 is coverage and x 2 is read depth. The goal is to find the class C that maximizes the likelihood: P(X|C) = P(x 1 ,x 2 |C). C can be either W or non-W. Since we assume that x 1 and x 2 are conditionally independent, we can simplify this conditional probability to P(X|C) = P(x 1 |C)P(x 2 |C).
We assessed the performance of our test using Receiver Operating Characteristic (ROC) curves. ROC curves plot the true positive rate and false positive rate of a classifier over a range of threshold values, and the area under the curve (AUC) is a traditionally used statistic for model comparison. We generated ROC curves and calculated the AUC using the package ROCR in the R statistical package (http://www.r-project.org). A series of cross-validation tests using the previously-mapped contigs was used to fine-tune the bin sizes of classifier feature distributions and evaluate the effects of training set size and sample imbalance.
Validation and follow-up
W-specific candidates were verified using PCR. Genomic DNA was extracted from the blood of two female and two male White Leghorn chickens using the Qiagen DNeasy kit. Primers were designed for each candidate contig, and amplification was attempted in all four individuals (see Additional file 1 for primer sequences and PCR conditions). If a given contig amplified successfully in both females but not in either male, then it was considered female-specific. Some candidates were verified via PCR in two female and two male Red Junglefowl (UCD 100 Red Jungle Fowl, from M.E. Delany, University of California, Davis). Primer pairs were scored for their ability to produce bands from both female templates that differed from the bands produced from both male templates. Primer pairs with identical results on male and female templates were scored as non-specific. The validation results were used in additional tests of performance. We used independent tests to further investigate the effects of contig length and sample imbalance on the predictive accuracy of our classifier. Validated W-specific candidates will be annotated in Bellott et al. in prep.
We thank Karel A. Schat for generously providing White Leghorn (N-2 line) tissue samples, as well as Alex Coventry and Wes Hochachka for helpful statistical discussions. Thanks to Clement Chow, Tim Connallon, Angela Early, and Scott Edwards for valuable comments on the manuscript. Grace Chi helped with labwork for some validations. NC was supported by a National Science Foundation Graduate Research Fellowship. This work was supported in part by NIH grant R01 GM64590 to AGC and AB Carvalho.
- Mardis ER: The impact of next-generation sequencing technology on genetics. Trends Genet. 2008, 24 (3): 133-141.View ArticlePubMedGoogle Scholar
- Alkan C, Sajjadian S, Eichler EE: Limitations of next-generation genome sequence assembly. Nat Methods. 2011, 8 (1): 61-65.PubMed CentralView ArticlePubMedGoogle Scholar
- Green ED: Strategies for the systematic sequencing of complex genomes. Nat Rev Genet. 2001, 2 (8): 573-583.View ArticlePubMedGoogle Scholar
- Schueler MG, Higgins AW, Rudd MK, Gustashaw K, Willard HF: Genomic and genetic definition of a functional human centromere. Science. 2001, 294 (5540): 109-115.View ArticlePubMedGoogle Scholar
- Skaletsky H, Kuroda-Kawaguchi T, Minx PJ, Cordum HS, Hillier L, Brown LG, Repping S, Pyntikova T, Ali J, Bieri T, Chinwalla A, Delehaunty A, Delehaunty K, Du H, Fewell G, Fulton L, Fulton R, Graves T, Hou S-F, Latrielle P, Leonard S, Mardis E, Maupin R, McPherson J, Miner T, Nash W, Nguyen C, Ozersky P, Pepin K, Rock S, Rohlfing T, Scott K, Schultz B, Strong C, Tin-Wollam A, Yang S-P, Waterston RH, Wilson RK, Rozen S, Page DC: The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes. Nature. 2003, 423 (6942): 825-837.View ArticlePubMedGoogle Scholar
- Foote S, Vollrath D, Hilton A, Page DC: The human Y chromosome: overlapping DNA clones spanning the euchromatic region. Science. 1992, 258 (5079): 60-66.View ArticlePubMedGoogle Scholar
- Hoskins RA, Carlson JW, Kennedy C, Acevedo D, Evans-Holm M, Frise E, Wan KH, Park S, Mendez-Lago M, Rossi F, Villasante A, Dimitri P, Karpen GH, Celniker SE: Sequence finishing and mapping of Drosophila melanogaster heterochromatin. Science. 2007, 316 (5831): 1625-1628.PubMed CentralView ArticlePubMedGoogle Scholar
- Hoskins RA, Smith CD, Carlson JW, Carvalho AB, Halpern A, Kaminker JS, Kennedy C, Mungall CJ, Sullivan BA, Sutton GG, Yasuhara JC, Wakimoto BT, Myers EW, Celniker SE, Rubin GM, Karpen GH: Heterochromatic sequences in a Drosophila whole-genome shotgun assembly. Genome Biol. 2002, 3 (12): research0085.0081-0085.0016.View ArticleGoogle Scholar
- Carvalho AB, Vibranovski MD, Carlson JW, Celniker SE, Hoskins RA, Rubin GM, Sutton GG, Adams MD, Myers EW, Clark AG: Y chromosome and other heterochromatic sequences of the Drosophila melanogaster genome: how far can we go?. Genetica. 2003, 117 (2): 227-237.View ArticlePubMedGoogle Scholar
- Hughes JF, Skaletsky H, Pyntikova T, Graves TA, van Daalen SKM, Minx PJ, Fulton RS, McGrath SD, Locke DP, Friedman C, Trask BJ, Mardis ER, Warren WC, Repping S, Rozen S, Wilson RK, Page DC: Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content. Nature. 2010, 463 (7280): 536-539.PubMed CentralView ArticlePubMedGoogle Scholar
- Smith CA, Roeszler KN, Ohnesorg T, Cummins DM, Farlie PG, Doran TJ, Sinclair AH: The avian Z-linked gene DMRT1 is required for male sex determination in the chicken. Nature. 2009, 461 (7261): 267-271.View ArticlePubMedGoogle Scholar
- Ellegren H: Evolution of the avian sex chromosomes and their role in sex determination. Trends Ecol Evol. 2000, 15 (5): 188-192.View ArticlePubMedGoogle Scholar
- Mank JE, Ellegren H: Parallel divergence and degradation of the avian W sex chromosome. Trends Ecol Evol. 2007, 22 (8): 389-391.View ArticlePubMedGoogle Scholar
- Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004, 432 (7018): 695-716.
- Hori T, Asakawa S, Itoh Y, Shimizu N, Mizuno S: Wpkci, encoding an altered form of PKCI, is conserved widely on the avian W chromosome and expressed in early female embryos: implication of its role in female sex determination. Mol Biol Cell. 2000, 11 (10): 3645-3660.PubMed CentralView ArticlePubMedGoogle Scholar
- Itoh Y, Mizuno S: Molecular and cytological characterization of SspI-family repetitive sequence on the chicken W chromosome. Chromosome Res. 2002, 10 (6): 499-511.View ArticlePubMedGoogle Scholar
- Ellegren H: First gene on the avian W chromosome (CHD) provides a tag for universal sexing of non-ratite birds. Proc R Soc Lond B Biol Sci. 1996, 263 (1377): 1635-1641.View ArticleGoogle Scholar
- Fridolfsson A-K, Cheng H, Copeland NG, Jenkins NA, Liu H-C, Raudsepp T, Woodage T, Chowdhary B, Halverson J, Ellegren H: Evolution of the avian sex chromosomes from an ancestral pair of autosomes. Proc Natl Acad Sci USA. 1998, 95 (14): 8147-8152.PubMed CentralView ArticlePubMedGoogle Scholar
- O’Neill M, Binder M, Smith C, Andrews J, Reed K, Smith M, Millar C, Lambert D, Sinclair A: ASW: a gene with conserved avian W-linkage and female specific expression in chick embryonic gonad. Dev Genes Evol. 2000, 210 (5): 243-249.View ArticlePubMedGoogle Scholar
- Itoh Y, Hori T, Saitoh H, Mizuno S: Chicken spindling genes on W and Z chromosomes: transcriptional expression of both genes and dynamic behavior of spindlin in interphase and mitotic cells. Chromosome Res. 2001, 9 (4): 283-299.View ArticlePubMedGoogle Scholar
- Axelsson E, Smith NGC, Sundström H, Berlin S, Ellegren H: Male-biased mutation rate and divergence in autosomal, Z-linked and W-linked introns of chicken and turkey. Mol Biol Evol. 2004, 21 (8): 1538-1547.View ArticlePubMedGoogle Scholar
- Wahlberg P, Strömstedt L, Tordoir X, Foglio M, Heath S, Lechner D, Hellström AR, Tixier-Boichard M, Lathrop M, Gut IG, Andersson L: A high-resolution linkage map for the Z chromosome in chicken reveals hot spots for recombination. Cytogenet Genome Res. 2007, 117 (1–4): 22-29.View ArticlePubMedGoogle Scholar
- Nam K, Ellegren H: The chicken (Gallus gallus) Z chromosome contains at least three nonlinear evolutionary strata. Genetics. 2008, 180 (2): 1131-1136.PubMed CentralView ArticlePubMedGoogle Scholar
- Alkan C, Coe BP, Eichler EE: Genome structural variation discovery and genotyping. Nat Rev Genet. 2011, 12 (5): 363-376.PubMed CentralView ArticlePubMedGoogle Scholar
- Medvedev P, Stanciu M, Brudno M: Computational methods for discovering structural variation with next-generation sequencing. Nat Methods. 2009, 6 (11s): S13-S20.View ArticlePubMedGoogle Scholar
- Smith CA, Roeszler KN, Sinclair AH: Genetic evidence against a role for W-linked histidine triad nucleotide binding protein (HINTW) in avian sex determination. Int J Dev Biol. 2009, 53: 59-67.View ArticlePubMedGoogle Scholar
- Smith CA, Sinclair AH: Sex determination: insights from the chicken. Bioessays. 2004, 26 (2): 120-132.View ArticlePubMedGoogle Scholar
- Shetty S, Griffin DK, Graves JAM: Comparative painting reveals strong chromosome homology over 80 million years of bird evolution. Chromosome Res. 1999, 7 (4): 289-295.View ArticlePubMedGoogle Scholar
- Miller PM, Gavrilets S, Rice WR: Sexual conflict via maternal-effect genes in ZW species. Science. 2006, 312 (5770): 73-View ArticlePubMedGoogle Scholar
- Fisher RA: The evolution of dominance. Biol Rev. 1931, 6 (4): 345-368.View ArticleGoogle Scholar
- Roldan ERS, Gomendio M: The Y chromosome as a battle ground for sexual selection. Trends Ecol Evol. 1999, 14 (2): 58-62.View ArticlePubMedGoogle Scholar
- Ellegren H: Molecular evolutionary genomics of birds. Cytogenet Genome Res. 2007, 117 (1–4): 120-130.View ArticlePubMedGoogle Scholar
- Ellegren H, Sheldon BC: New tools for sex identification and the study of sex allocation in birds. Trends Ecol Evol. 1997, 12 (7): 255-259.View ArticlePubMedGoogle Scholar
- Fridolfsson A-K, Ellegren H: A simple and universal method for molecular sexing of non-ratite birds. J Avian Biol. 1999, 30 (1): 116-121.View ArticleGoogle Scholar
- Griffiths R, Double MC, Orr K, Dawson RJG: A DNA test to sex most birds. Mol Ecol. 1998, 7 (8): 1071-1075.View ArticlePubMedGoogle Scholar
- Kahn NW, St John J, Quinn TW: Chromosome-specific intron size differences in the avian CHD gene provide an efficient method for sex identification in birds. Auk. 1998, 115 (4): 1074-1078.View ArticleGoogle Scholar
- Dawson DA, Darby S, Hunter FM, Krupa AP, Jones IL, Burke T: A critique of avian CHD-based molecular sexing protocols illustrated by a Z-chromosome polymorphism detected in auklets. Molecular Ecology Notes. 2001, 1 (3): 201-204.View ArticleGoogle Scholar
- Casey AE, Jones KL, Sandercock BK, Wisely SM: Heteroduplex molecules cause sexing errors in a standard molecular protocol for avian sexing. Mol Ecol Resour. 2009, 9 (1): 61-65.View ArticlePubMedGoogle Scholar
- Kurtz S, Phillippy A, Delcher A, Smoot M, Shumway M, Antonescu C, Salzberg S: Versatile and open software for comparing large genomes. Genome Biol. 2004, 5 (2): R12-PubMed CentralView ArticlePubMedGoogle Scholar
- Li H, Ruan J, Durbin R: Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 2008, 18 (11): 1851-1858.PubMed CentralView ArticlePubMedGoogle Scholar
- A genetic variation map for chicken with 2.8 million single-nucleotide polymorphisms. Nature. 2004, 432 (7018): 717-722.
- Stiglec R, Ezaz T, Graves JAM: Reassignment of chicken W chromosome sequences to the Z chromosome by fluorescence in situ hybridization (FISH). Cytogenet Genome Res. 2007, 116 (1–2): 132-134.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.