Correction: Transcriptome analysis of pigeon milk production – role of cornification and triglyceride synthesis genes

Background: The pigeon crop is specially adapted to produce milk that is fed to newly hatched young. The process of pigeon milk production begins when the germinal cell layer of the crop rapidly proliferates in response to prolactin, which results in a mass of epithelial cells that are sloughed from the crop and regurgitated to the young. We proposed that the evolution of pigeon milk built upon the ability of avian keratinocytes to accumulate intracellular neutral lipids during the cornification of the epidermis. However, this cornification process in the pigeon crop has not been characterised. Results: We identified the epidermal differentiation complex in the draft pigeon genome scaffold and found that, like the chicken, it contained beta-keratin genes. These beta-keratin genes can be classified, based on sequence similarity, into several clusters including feather, scale and claw keratins. The cornified cells of the pigeon crop express several cornification-associated genes including cornulin, S100-A9 and A16-like, transglutaminase 6-like and the pigeon ‘lactating’ crop-specific annexin cp35. Beta-keratins play an important role in ‘lactating’ crop, with several claw and scale keratins up-regulated. Additionally, transglutaminase 5 and differential splice variants of transglutaminase 4 are up-regulated along with S100-A10. Conclusions: This study of global gene expression in the crop has expanded our knowledge of pigeon milk production, in particular, the mechanism of cornification and lipid production. It is a highly specialised process that utilises the normal keratinocyte cellular processes to produce a targeted nutrient solution for the young at a very high turnover. Background Pigeon lactation was first noted in the literature in 1786 when John Hunter described pigeon milk as being like “..granulated white curd” [1]. This curd-like substance is produced in the crop of male and female pigeons and regurgitated to the young. Like the mammary gland, the pigeon crop undergoes significant changes to the tissue structure during lactation. Several histological studies have characterised these changes and determined that pigeon milk consists of desquamated, sloughed crop * Correspondence: Meagan.gillespie@csiro.au Australian Animal Health Laboratory, CSIRO Animal, Food and Health Sciences, 5 Portarlington Road, Geelong, Victoria, Australia School of Life and Environmental Sciences, Deakin University, Pigdons Road, Geelong, Victoria 3216, Australia Full list of author information is available at the end of the article © 2013 Gillespie et al.; licensee BioMed Centra Commons Attribution License (http://creativec reproduction in any medium, provided the or epithelial cells [2,3]. The process of pigeon milk production begins when the germinal cell layer of the crop rapidly proliferates in response to prolactin [4,5], and this results in a convoluted, highly folded epithelial structure that then coalesces as it out-grows the vasculature, to form the nutritive cell layer that is sloughed off to produce the milk. This nutritive cell layer contains lipidfilled vacuoles [2,3,5,6]. The lipid content of pigeon milk consists mainly of triglycerides, along with phospholipids, cholesterol, free fatty acids, cholesterol esters and diglycerides [7]. The triglyceride content decreases across the lactation period, from 81.2% of total lipid at day one, to 62.7% at day 19, whereas the other lipids increase, which suggests the cellular lipid content decreases towards the end of the lactation period, but the cell membrane-associated lipids remain constant [7]. l Ltd. This is an Open Access article distributed under the terms of the Creative ommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and iginal work is properly cited. Gillespie et al. BMC Genomics 2013, 14:169 Page 2 of 12 http://www.biomedcentral.com/1471-2164/14/169 Several studies have investigated the differences in gene expression between ‘lactating’ pigeon crop tissue and non‘lactating’ crop tissue [6,8,9]. Nearly three decades ago, Horseman and Pukac were the first to identify that mRNA species differ in response to prolactin injection in the crop [8]. Specifically, they identified and characterised gene expression and protein translation of the prolactinresponsive mRNA anxIcp35 and the non-prolactin-responsive isoform, anxIcp37 [9,10]. In addition, a recent global gene expression study in our laboratory [6] showed that genes encoding products involved in triglyceride synthesis and tissue signalling were up-regulated in the ‘lactating’ crop. We proposed that the evolution of the processes that result in the production of pigeon milk has built upon the more general ability of avian keratinocytes to accumulate intracellular neutral lipids during the cornification of the epidermis [11] in order to produce a nutritive substance for their young [6]. The mechanism of avian epidermal cornification and lipid accumulation is not well-characterised. However, studies have shown that antibodies against mammalian cornification proteins, which are relatively wellcharacterised, can cross-react with avian and reptilian species [12,13], which suggests similarities in cornification proteins amongst vertebrate species. Cultured chicken keratinocytes have been shown to express betakeratins (feather, scale and claw keratins), alpha-keratins (type I and II cytokeratins) and the cornified envelope precursor genes envoplakin and periplakin, as well as accumulating neutral lipids [11]. Mammalian keratinocytes differ from avian keratinocytes in that they are unable to accumulate intracellular neutral lipids [11], and can express alpha-keratins (cytokeratins) but not beta-keratins, which expanded from early archosaurians [14]. There are many cornification-associated proteins characterised from mammalian epidermal tissues. The proteins that form the cornified envelope include keratins, S100 proteins, small proline-rich proteins (SPRRs), late cornified envelope (LCE) proteins, annexins, involucrin, loricrin, filaggrin, desmoplakin, envoplakin, periplakin, trichohyalin, cystatin A, elafin and repetin [15]. Transglutaminase enzymes, some of which require cleavage by proteases and an increase in intracellular calcium concentration to become active, cross-link the cornified envelope proteins to form a ceramide lipid-coated protective barrier to the epidermis [16]. Many of the cornified envelope genes are present in the “epidermal differentiation complex” (EDC) which was first identified on chromosome 1q21 in humans [17]. Interestingly, the EDC region has been identified in an avian species (chicken), and is linked to the genes for beta-keratins, but lacks the LCE proteins [18]. Here we present an analysis of the pigeon crop transcriptome to show that pigeon milk production involves a specialised cornification process and de novo synthesis of lipids that accumulate intracellularly.


Background
Pigeon lactation was first noted in the literature in 1786 when John Hunter described pigeon milk as being like "..granulated white curd" [1]. This curd-like substance is produced in the crop of male and female pigeons and regurgitated to the young. Like the mammary gland, the pigeon crop undergoes significant changes to the tissue structure during lactation. Several histological studies have characterised these changes and determined that pigeon milk consists of desquamated, sloughed crop epithelial cells [2,3]. The process of pigeon milk production begins when the germinal cell layer of the crop rapidly proliferates in response to prolactin [4,5], and this results in a convoluted, highly folded epithelial structure that then coalesces as it out-grows the vasculature, to form the nutritive cell layer that is sloughed off to produce the milk. This nutritive cell layer contains lipidfilled vacuoles [2,3,5,6]. The lipid content of pigeon milk consists mainly of triglycerides, along with phospholipids, cholesterol, free fatty acids, cholesterol esters and diglycerides [7]. The triglyceride content decreases across the lactation period, from 81.2% of total lipid at day one, to 62.7% at day 19, whereas the other lipids increase, which suggests the cellular lipid content decreases towards the end of the lactation period, but the cell membrane-associated lipids remain constant [7].
Several studies have investigated the differences in gene expression between 'lactating' pigeon crop tissue and non-'lactating' crop tissue [6,8,9]. Nearly three decades ago, Horseman and Pukac were the first to identify that mRNA species differ in response to prolactin injection in the crop [8]. Specifically, they identified and characterised gene expression and protein translation of the prolactinresponsive mRNA anxI cp35 and the non-prolactin-responsive isoform, anxI cp37 [9,10]. In addition, a recent global gene expression study in our laboratory [6] showed that genes encoding products involved in triglyceride synthesis and tissue signalling were up-regulated in the 'lactating' crop. We proposed that the evolution of the processes that result in the production of pigeon milk has built upon the more general ability of avian keratinocytes to accumulate intracellular neutral lipids during the cornification of the epidermis [11] in order to produce a nutritive substance for their young [6].
The mechanism of avian epidermal cornification and lipid accumulation is not well-characterised. However, studies have shown that antibodies against mammalian cornification proteins, which are relatively wellcharacterised, can cross-react with avian and reptilian species [12,13], which suggests similarities in cornification proteins amongst vertebrate species. Cultured chicken keratinocytes have been shown to express betakeratins (feather, scale and claw keratins), alpha-keratins (type I and II cytokeratins) and the cornified envelope precursor genes envoplakin and periplakin, as well as accumulating neutral lipids [11]. Mammalian keratinocytes differ from avian keratinocytes in that they are unable to accumulate intracellular neutral lipids [11], and can express alpha-keratins (cytokeratins) but not beta-keratins, which expanded from early archosaurians [14]. There are many cornification-associated proteins characterised from mammalian epidermal tissues. The proteins that form the cornified envelope include keratins, S100 proteins, small proline-rich proteins (SPRRs), late cornified envelope (LCE) proteins, annexins, involucrin, loricrin, filaggrin, desmoplakin, envoplakin, periplakin, trichohyalin, cystatin A, elafin and repetin [15]. Transglutaminase enzymes, some of which require cleavage by proteases and an increase in intracellular calcium concentration to become active, cross-link the cornified envelope proteins to form a ceramide lipid-coated protective barrier to the epidermis [16]. Many of the cornified envelope genes are present in the "epidermal differentiation complex" (EDC) which was first identified on chromosome 1q21 in humans [17]. Interestingly, the EDC region has been identified in an avian species (chicken), and is linked to the genes for beta-keratins, but lacks the LCE proteins [18].
Here we present an analysis of the pigeon crop transcriptome to show that pigeon milk production involves a specialised cornification process and de novo synthesis of lipids that accumulate intracellularly.

Differentiation of the 'lactating' crop
Immunohistological analysis of the proliferating cells of the pigeon crop in its resting state and during nesting demonstrated the morphological changes that occur in preparation for pigeon milk production ( Figure 1). As the crop changed in preparation for lactation, the number and depth of rete pegs increased and the lamina propria became progressively more extended and narrow, which increased the surface area of the crop. During lactation the crop was highly proliferative, which resulted in the accumulation and sloughing of large tracts of cornified epithelium ( Figure 2). All lactating parents in this study (48 birds) fed their young every four hours over the lactation period. Histology revealed a cycle of production and turnover of cornified epithelium over the four-hour period ( Figure 2). The squabs milk intake reduced gradually toward the end of the lactation period, which lasted approximately fourteen days.
Analysis of transcriptional changes over the lactation period compared to non-'lactating' crop revealed no differentially expressed probes at pre-hatch (time points −8 and −2), large differences at hatch (time point 0) and 2 days post-hatch (time point +2) (17.2 and 48.8% of all probes differentially expressed, respectively), and no difference above what could be expected by chance (5%) at 10 days post-hatch (time point +10) (2.7% of probes differentially expressed) (Additional file 1: Table S1). Any effect of sex was ruled out by comparing males to females at non-'lactating' and 'lactating' time points. There was no difference above what could be expected by chance (Additional file 1: Table S2).
Cornification genes are differentially expressed in the 'lactating' pigeon crop Analysis of cornification-associated genes in the draft pigeon genome identified an epidermal differentiation complex (EDC) on scaffolds 1246 and 683, respectively ( Figure 3). Transcriptional analysis of these EDC genes and other cornification-associated genes in the pigeon crop at time points 0 and +2 revealed differential expression of 43 genes in 0, +2 or both 'lactating' pigeon crops compared with non-'lactating' crop ( Table 1). Thirteen of these genes were up-regulated and 30 were downregulated. Notably, the majority of cornification-associated genes up-regulated in the 'lactating' crop were keratins, constituting eight of the thirteen up-regulated genes. Five of these eight keratins were beta-keratins and three were alpha-keratins. Conversely, eight of the 30 down-regulated cornification-associated genes were alpha-keratins, and none were beta-keratins. Phylogenetic analysis of the beta-keratins ( Figure 4), which were all part of the pigeon EDC ( Figure 3) separates them into several groups. Feather, claw and scale keratins share a common ancestor related to chicken beta-keratin, from which feather keratins (none differentially expressed in 'lactating' crop) formed their own clade, and claw (ns and 10.5 to 12-fold up-regulated) and scale keratins (ns and 3.2-fold up-regulated) formed another monophyletic clade. Putative pigeon keratins formed three more clades not containing a chicken homolog, and ORF 683_38 formed a clade of its own. GenBank IDs of keratins with the highest amino acid identity to the pigeon keratins are found in Additional file 2.
Phylogenetic analysis of the alpha-keratins separates them into type I and type II ( Figure 4). Seven type I keratins and two type II keratins were down-regulated, and two type II keratins were up-regulated in 'lactating' crop (Table 1). Notably, all of the type I putative pigeon keratins were constrained to scaffold988, whereas the type I Figure 1 The pigeon crop differentiates during nesting. The non-'lactating' cropsac (A) differentiates during nesting (B&C), such that the lamina propria (*) becomes progressively more extended and narrow, and the number and depth of rete pegs (^) increases as the cropsac further differentiates. Proliferating cells are stained red with an antibody to proliferating cell nuclear antigen, and non-proliferating cells are counterstained blue with hematoxylin. Scale bar = 100 μm. keratins included 15 putative genes on scaffold748 and two on scaffold988 ( Figure 4). All of the chicken alphakeratins had a closely related putative pigeon homolog.
The epithelial cell-derived antimicrobial peptide encoding gene beta defensin 5 was up-regulated by 2.3 to 4.8-fold in 'lactating' crop at timepoints 0 and +2, respectively (Additional file 3).
Triglyceride synthesis is up-regulated in the 'lactating' pigeon crop Examination of lipid droplets in 'lactating' pigeon crop showed that lipid was present throughout the differentiated epithelium, and was perinuclear ( Figure 5). To investigate whether lipids could potentially be synthesised de novo, the expression of genes linked to milk lipid synthesis in the mouse mammary gland [19] were examined in the 'lactating' pigeon crop. Thirty-four mouse mammary glandlinked lipid synthesis genes were differentially expressed in the pigeon crop, including 7 variants of genes investigated in the mouse study. Expression patterns of milk lipid synthesis genes were similar in pigeon crop and mouse mammary gland, although pigeon crop expressed different variants of many genes of the triglyceride synthesis and fatty acid synthesis pathways in comparison to the mouse mammary gland. In particular, the triglyceride synthesis genes Agpat1 and Dgat1 were up-regulated in the lactating mouse mammary gland compared to pregnant mouse mammary gland [19], whereas Agpat3, Agpat9 and Dgat2 were up-regulated in the 'lactating' pigeon crop compared to non-'lactating' crop. The fatty acid synthesis gene Elovl1 was up-regulated in lactating mouse [19], whereas Elovl6 was up-regulated in 'lactating' pigeon crop. The lactating Figure 3 The pigeon epidermal differentiation complex. The pigeon EDC is located on (A) scaffolds 1246 and (B) 683 of the draft pigeon genome. It is bound by nicastrin and cathepsin S, and contains putative genes for the cornified envelope precursors repetin, cornulin, involucrin and filaggrin. In addition, the S100 genes S100A11, S100A4 and two copies of S100A9 are present. Two putative keratin-associated proteins (KAPs) are present, and clusters of beta keratins, feather keratins, scale keratins and claw keratins.

Discussion
This is the first genome-wide pigeon crop transcriptome study to investigate the molecular mechanism of pigeon milk production. Here we show that differential expression of cornification-associated proteins and de novo lipid synthesis genes in the pigeon crop during lactation contribute to a highly specialised process that leads to the production of pigeon milk.
In preparation for lactation, the pigeon crop increases in surface area through an increase in rete pegs and extension of the lamina propria ( Figure 1). This hyperplasia followed by desquamation results in large numbers of lipid-rich differentiated cells accumulating in the crop lumen, in the form of a curd-like substance, which provides nourishment for the young (Figure 2). Although the process of terminal differentiation, from the basal layer through to the desquamated layer takes days in mammals [20], it appears that the epidermal cells of the pigeon crop undergo a terminal differentiation program within the space of four hours. We have previously described this histologically [6]. The 1004-fold up-regulation of cornulin and 15-fold up-regulation of transglutaminase 6 ( Table 2), both late epidermal differentiation markers [21], in the cornified cell layer of the 'lactating' crop demonstrates the presence of terminally differentiated cells in the lactating pigeon crop epithelium.
Up-regulation of several beta keratins and three alpha keratins in the 'lactating' crop (Table 1) suggests an important function for keratin in the formation of pigeon milk. Beta-keratins are specific to archosaurians [22], and are found in the pigeon EDC (Figure 3), whereas alpha-keratins are ubiquitously expressed in eukaryotes. Phylogenetic  analysis of the putative pigeon beta-keratins places the majority of up-regulated beta keratins in claw and scale beta keratin groups (Table 1, Figure 4). Beta-keratins have been suggested to have evolved from alpha-keratins to form a new class of matrix proteins that have a structural role in cornification [22]. Hence, it appears that in addition to alpha-keratins, beta-keratins have an important structural role in 'lactating' pigeon crop cells. Unlike alpha-keratins, beta-keratins form their own filament-matrix structures [23] which negates the need to express matrix proteins to form cornified beta-keratin epidermis. The downregulation in 'lactating' crop of the typical mammalian cornified envelope precursors desmoplakin, envoplakin, periplakin, sciellin and cystatin A (Table 1) suggests that beta-keratins could play an alternative role to these matrix proteins in the 'lactating' crop. Alpha-keratins are cross-linked to matrix proteins by transglutaminase enzymes, which are activated by proteolytic cleavage and increased intracellular calcium concentration [15]. S100 proteins play a role in the establishment of the calcium gradient in epithelial cell layers [24] and can also be substrates for transglutaminase themselves [25]. Both transglutaminase and S100 protein-encoding  genes are up-regulated in 'lactating' crop (Table 1). Interestingly, prostate transglutaminase (transglutaminase 4) is differentially expressed in 'lactating' crop. Putative exons 7, 10 and 11 were down-regulated, while seven other putative exons were up-regulated, ( Table 1) which suggests there could be multiple splice variants; this is the case for human transglutaminase 4 in cancer tissues [26]. In addition, transglutaminase 5, which is expressed in mammalian cornifying epithelium [27] is up-regulated (Table 1). This is in contrast to mammalian cornifying tissues that express transglutaminases 1 and 3 in addition to transglutatminase 5 [28]. The up-regulation of the proteases calpain-15 and calpain 9 isoform 1 in the 'lactating' crop (Table 3) could suggest a role for these enzymes in the proteolytic activation of transglutaminases 4 and 5, as calpains are thought to activate transglutaminase 1 [15]. Additionally, cathepsin D has been suggested as an activator of transglutaminase, but this does not appear to be the case in the pigeon crop, as cathepsin D and six other cathepsin genes are downregulated (Table 3). In addition to being a substrate for transglutaminase, S100 proteins can interact with annexins and form part of the cornified envelope [29]. S100-A10 is up-regulated in 'lactating' crop, as are the pigeon lactation-specific annexin gene cp35 and its isoform cp37 (Table 1), which could indicate roles for these genes in the formation of the cornified envelope. Cp35 is expressed 20-fold higher in cornified cells of the 'lactating' crop, which suggests it has a function in the cornified cells, along with the S100 protein-encoding genes S100-A16-like and S100-A9-like ( Table 2).
Accumulation of neutral lipids in keratinocytes is a unique trait of avian species [11]. The pigeon makes use of this ability in the crop to produce a lipid-rich milk [30] for the young. It was suggested by Garrison and Scow [31] that the lipids in pigeon milk were sequestered from another organ due to the increase in lipoprotein lipase activity in the prolactin-stimulated crop. We have also shown previously that there is up-regulation of genes involved in the oxidation of imported triglycerides [6], and in this study we found that lipoprotein lipase was up-regulated in 'lactating' crop (Table 4). However, the current study showed that lipid synthesis in the 'lactating' pigeon crop is a combination of importation and de novo synthesis of lipids, and results in the perinuclear accumulation of neutral lipid droplets ( Figure 5). Table 4 shows that genes involved in triglyceride synthesis in the mouse mammary gland during lactation [19] are also differentially expressed in the 'lactating' crop. The majority of genes involved in de novo lipid synthesis in the mouse are also expressed in the pigeon, but there are three  gene variants that are expressed in the pigeon and not in the mouse. The pigeon expresses Agpat3, Agpat9 and Dgat 2 (Table 4), whereas the mouse expresses Agpat1 and Dgat1 [19], which suggests that both the mechanism of lipid synthesis and crop cornification in the pigeon varies from that of mammals. The differences in the specific combinations of genes expressed may be reflected in the differences in triglycerides produced by each species. Amongst mammalian species there are differences in the fatty acid composition of milk triglycerides [32]. However, a comparison of the major fatty acid components of pigeon milk; oleic acid, linoleic acid and palmitic acid [30], reveals these are also the major fatty acid components of mammalian milk fat. There is a difference in the expression of ELOVL genes involved in fatty acid synthesis in the mouse mammary gland and in the pigeon crop (Table 4).
In mouse and human, the ELOVL1 gene is up-regulated during lactation [19,33], whereas the pigeon crop upregulates ELOVL6 during lactation (Table 4). It has been shown that de novo synthesis of fatty acids in the mammary gland can change in response to dietary availability [34]. Therefore, the difference in ELOVL gene expression between mammals and pigeons could be due to differences in the dietary availability of triglycerides/fatty acids in the pigeon diet. ELOVL6, up-regulated in 'lactating' crop, has been shown to have high elongation activity on C16:0 long chain fatty acids, and also some activity on C18:1 and C18:2 long chain fatty acids [35], which are the major fatty acid components of pigeon milk. This suggests that a large proportion of pigeon milk fatty acids could be synthesised de novo in the crop. One of the major differences between pigeon milk fatty acids and mammalian milk fatty acids is the lack of very long chain fatty acids, which are synthesised de novo by ELOVL1 [35].
Here we have shown that pigeon milk is the result of a specialised cornification process that produced large numbers of lipid-laden, cornified cells with a very rapid four hour cycle of hyperplasia followed by desquamation in the 'lactating' pigeon crop.

Conclusions
This study has expanded our knowledge of pigeon milk production, in particular, the mechanism of cornification and lipid production in the crop. Pigeon lactation is a highly specialised process that utilises the normal keratinocyte cellular processes to produce a targeted nutrient solution for the young at a very high turnover rate.

Pigeon tissue sample collection
Thirty-two breeding pairs of King pigeons were purchased from Kooyong Squab Producers (Moama, New South Wales). They were housed in temperature-controlled cabinets (between 21-24°C) with a 12 hour light cycle (lights on 6 am), and supplied with nest bowls and nesting materials. Pigeons had ad libitum access to pigeon mix (pro-vitmin, Ivorsons, Geelong) and water. Control non-'lactating' pairs (ctrl, 13 birds) were culled prior to mating. Breeding pairs were culled at different lactation time points whereby squab hatch was designated as time zero. Time points prehatch have the prefix '-' and post-hatch have the prefix '+'. Specifically, breeding pairs were euthanised at 8 days pre-hatch (−8, n = 10 birds), 2 days pre-hatch (−2, n = 10 birds), at hatch (0, n = 14 birds), 2 days post-hatch (+2, n = 10 birds), and 10 days post-hatch (+10, n = 4 birds). Whole crop tissue samples were snap frozen in liquid nitrogen and separate samples of all crops were fixed in 10% neutral buffered formalin or snap frozen in optimal cutting temperature (OCT) compound for histology. Samples of pigeon crop from a time 0 pair were fixed in PaxGene (Qiagen) fixative according to the manufacturer's instructions for laser dissection microscopy, to investigate gene expression differences between basal and proliferating cell types. Samples of other whole tissues (brain, pituitary, thymus, esophagus, trachea, proventriculus, gizzard, heart, kidney, duodenum, ileum, jejunum, pancreas, spleen, cecum, bone marrow, muscle and skin) were snap frozen in liquid nitrogen and used for the construction of a pooled tissues cDNA library. The blood and spleen of the ten day old squab were removed; the blood into vacutainers coated with EDTA dipotassium salt and the spleen into sterile media (DMEM with 10% FCS, 100 U/mL penicillin, 100 μg/mL streptomycin, 500 μg/mL fungizone).
All work using animals was conducted in accordance with the Australian Code of Practice for the Care and Use of Animals for Scientific Purposes (7th edition), and in accordance with institutional animal ethics guidelines (CSIRO AAHL Animal Ethics Committee).

Pigeon splenocyte stimulation
The squab spleen was minced through a 70 μm filter using a syringe plunger into 15 mL phosphate buffered saline (PBS). The blood was diluted in PBS, and the cell suspensions were layered slowly over the same volume of Lymphoprep (Axis-Shield, Oslo, Norway). After centrifugation the cells were removed from the interface of the gradient and washed twice with 50 mL PBS + 10% FCS. The cells were seeded on a 24-well plate (Nunc) at 5 × 10 5 cells/mL. To each well 10 μg/mL concanavalin A (Astral Scientific) was added and the cells were incubated at 37°C in 5% CO 2 . After 24 hours the cells were pelleted and re-suspended in 1 mL TRIreagant RT (Molecular Research Center) for RNA extraction and synthesis of the immune library. High throughput sequencing, assembly, microarray design and annotation Sequencing libraries were prepared from the dscDNA libraries using a Rapid Library Preparation Kit (Roche). Sequencing beads were generated with a SV-emPCR Kit (Roche) and sequenced on a 454 GS FLX using the titanium chemistry (Roche). Each sample was sequenced in a separate region. The raw reads from all regions were combined (430654) and assembled with Newbler v2.3 shotgun assembler (Roche). The resulting contigs (10463) and remaining Singletons (71997) were used to design unique microarray probes with OligoArray 2.1 [36]. The microarray probes were annotated via the source contigs or read sequences by a series of BLAST searches using an E-value of 10 -3 as cut-off for all searches. The first search used BLASTX [37] with all sequences against a local copy of the non-redundant protein database (dated 11 April 2012). All non-matched sequences were then used in a BLASTN query [38] against the non-redundant nucleotide database (dated 13 April 2012). Finally, the remaining unmatched sequences were used as queries in a TBLASTX search [37] against the nucleotide database.

Identification of putative pigeon cornification-associated full-length genes
Cornification-associated genes were identified by literature search, and Raw 454 reads or assembled contigs were used as local megaBLAST [38] queries against the Columba livia draft genome [39] to identify in which scaffold each gene of interest was present. The scaffolds of interest were then submitted to a Hidden Markov Model gene prediction program (FGENESH, Softberry; http://linux1.softberry.com/berry.phtml) using parameters for chicken (aves) to identify predicted full-length gene sequences. Where scaffolds were too large to be processed by FGENESH, the region of the scaffold with the BLAST match was submitted. Microarray probes were mapped to the predicted gene sequence by local BLAST. Non-redundant, non-overlapping microarray probes matching predicted gene coding sequences were identified by megaBLAST against the predicted gene sequences and the Columba livia draft genome sequence.

Phylogenetic analysis of pigeon alpha and beta keratins
Phylogenetic trees were constructed separately for alphaand beta-keratins. The evolutionary relatedness was inferred using the Minimum Evolution method [40]. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates) was calculated [41]. The tree was drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree. The evolutionary distances were computed using the JTT matrix-based method [42] and are in the units of the number of amino acid substitutions per site. The ME tree was searched using the Close-Neighbor-Interchange (CNI) algorithm [43] at a search level of 1. The Neighbor-joining algorithm [44] was used to generate the initial tree. All positions containing alignment gaps and missing data were eliminated in pairwise sequence comparisons (Pairwise deletion option). Phylogenetic analyses were conducted in MEGA4 [45] after alignment in ClustalX [46].

Laser dissection microscopy and RNA amplification
PaxGene fixed time 0 pigeon crop of a female and male breeding pair were dehydrated through fresh ethanol and xylene using an automated processor (Leica), and embedded in paraffin according to the PaxGene manufacturer's instructions. Sections of 4 μm were cut by microtome and floated on to laser dissection slides (Leica #11505158 membrane slides PEN-membrane 2 um). The cornified crop epithelial cells and the basal cells of 5 serial sections of each crop were laser dissected using a Leica LMD6000 machine and collected by gravity into 500 μl PCR tubes. The dissected cells were dissolved in QIAzol by pipetting up and down, and RNA was extracted using the RNeasy Lipid Tissue kit according to the manufacturer's instructions, and eluted in 30 μl water. RNA was quantified using a Bioanalyzer RNA Pico chip, and an equal amount of RNA of each of the four samples was used for two rounds of RNA amplification using an Ambion MessageAmp II aRNA Amplification Kit, according to the manufacturer's instructions.
Microarray hybridisation, scanning and data preprocessing RNA was extracted from whole frozen pigeon crop tissue according to the manufacturer's instructions (Qiagen RNeasy Lipid Tissue kit). RNA quality and quantity was measured using a Bioanalyser RNA Pico chip and 5 μg of this RNA was used to synthesise first-strand cDNA with oligo dt primer according to the manufacturer's instructions (Invitrogen SuperScript SuperMix) which was then purified using a PCR purification kit (Qiagen). cDNA was synthesised and purified from whole crop RNA and from amplified laser dissected sample RNA. All cDNA samples were labelled with Cy3 using a Roche One-Color DNA Labelling Kit according to the manufacturer's instructions. The labelled microarray probes were re-suspended with a sample tracking control and hybridisation buffer and loaded on 12-plex 135 k custom pigeon microarrays (ArrayExpress ID A-MEXP-2257). These were hybridised for 20 hours in a NimbleGen Hybridisation Station (Roche) at 42°C and then washed using the NimbleGen wash buffer kit (Roche) according to the manufacturer's instructions. Each subarray was scanned at 2 μm on autogain with a NimbleGen MS200 microarray scanner (Roche). Sample tracking controls and control spots were used to autoalign a grid over each subarray using NimbleGen MS200 Software (Roche).

Microarray normalisation and statistical analysis
Robust Multichip Average (RMA) analysis [47] was used to background correct and normalise spot signal intensity. To compare datasets hybridised to different slides, the data were subjected to the non-parametric CombatR algorithm to remove batch effects [48]. The datasets were exported into GeneSpring (Agilent) and differentially expressed genes were identified using an unpaired Welch t-test assuming unequal variances with a Benjamini and Hochberg post-hoc test, with a false discovery rate of p = 0.05. The comparison of cell layers from laser dissected RNA omitted the post-hoc test as there were only two samples per group. All microarray data has been deposited into ArrayExpress (accession numbers E-MTAB-1317 for whole crop and E-MTAB-1318 for laser dissected cell layers).