Transcriptome of the dead: characterisation of immune genes and marker development from necropsy samples in a free-ranging marine mammal
© Hoffman et al.; licensee BioMed Central Ltd. 2013
Received: 31 October 2012
Accepted: 23 January 2013
Published: 24 January 2013
Transcriptomes are powerful resources, providing a window on the expressed portion of the genome that can be generated rapidly and at low cost for virtually any organism. However, because many genes have tissue-specific expression patterns, developing a complete transcriptome usually requires a 'discovery pool' of individuals to be sacrificed in order to harvest mRNA from as many different types of tissue as possible. This hinders transcriptome development in large, charismatic and endangered species, many of which stand the most to gain from such approaches. To circumvent this problem in a model pinniped species, we 454 sequenced cDNA from testis, heart, spleen, intestine, kidney and lung tissues obtained from nine adult male Antarctic fur seals (Arctocephalus gazella) that died of natural causes at Bird Island, South Georgia.
After applying stringent quality control criteria based on length and annotation, we obtained 12,397 contigs which, in combination with 454 data previously obtained from skin, gave a total of 23,096 unique contigs. Homology was found to 77.0% of dog (Canis lupus familiaris) transcripts, suggesting that the combined assembly represents a substantial proportion of this species' transcriptome. Moreover, only 0.5% of transcripts revealed sequence similarity to bacteria, implying minimal contamination, and the percentage of transcripts involved in cell death was low at 2.6%. Transcripts with immune-related annotations were almost five-fold enriched relative to skin and represented 13.2% of all spleen-specific contigs. By reference to the dog, we also identified transcripts revealing homology to five class I, ten class II and three class III genes of the Major Histocompatibility Complex and derived the putative genomic distribution of 17,121 contigs, 2,119 in silico mined microsatellites and 9,382 single nucleotide polymorphisms.
Our findings suggest that transcriptome development based on samples collected post mortem may greatly facilitate genomic studies, not only of marine mammals but also more generally of species that are of conservation concern.
KeywordsTranscriptome Genomics Non-model organism Post mortem Immune gene Major Histocompatibility Complex (MHC) Microsatellite Single Nucleotide Polymorphism (SNP) Marine mammal Antarctic fur seal Arctocephalus gazella Pinniped
Massively parallel sequencing approaches such as Roche 454 and Illumina HiSeq are transforming the study of non-model organisms by dramatically increasing sequencing depth and coverage in return for a greatly reduced investment in time, labour and resources. This has facilitated the development of transcriptomes, which provide access to the coding sequences of literally thousands of genes and can be mined for genetic markers for use in genome scans, quantitative trait loci mapping and various other applications[2, 3].
mRNA sequencing is a particularly powerful approach for developing 'entry level' genomic resources for studying natural populations of non-model organisms, which are often compelling from ecological or evolutionary perspectives but poorly characterised genetically[4, 5]. The resulting transcriptomes can in turn be mined for Single Nucleotide Polymorphisms (SNPs) which have already proven powerful for gene mapping and are likely to become increasingly important in conservation genetics since they allow the characterisation of population structure and genetic diversity with unprecedented resolution. However, to exhaustively sample a given species' transcriptome usually requires animals to be sacrificed so that transcripts can be harvested from as many different organs as possible, with the spleen being a particularly important source of immune genes. Although this is generally considered acceptable for small, tractable and highly abundant organisms such as many insects, molluscs and even fish, it is less appropriate for species that are large, highly charismatic or threatened. One potential solution, supported by recent expression studies of human cadavers and slaughtered domestic pigs, is to sequence tissues obtained shortly after animals have died of natural causes.
Marine mammals provide an interesting case in point, being large and charismatic but extremely difficult to study given that they spend most, if not all, of their time at sea. Unfortunately, many marine mammal populations are also severely depleted due to a combination of historical exploitation and contemporary threats including bycatch and other fisheries interactions and climate change. These factors may explain why only two marine mammal EST libraries, in both cases developed from either skin or blood, have been published to date[17, 18]. Nevertheless, marine mammals are strong candidates for generating transcriptomes from tissues collected post mortem, since colonially breeding pinnipeds tend to suffer from high mortality rates and occasional mass mortality events[19, 20], while many cetacean species are routinely stranded ashore en masse.
The Antarctic fur seal (Arctocephalus gazella) is a highly sexually dimorphic pinniped that breeds in crowded rookeries where adult males compete fiercely for access to females. On Bird Island, South Georgia, a colony has been studied since the early 1980s with an aerial walkway providing unprecedented access for tissue sampling and the collection of detailed daily behavioural observations including metre-resolution locations of every adult male sighted ashore. An ongoing molecular study spanning almost two decades has shown that most, if not all, pups are conceived on land and that heterozygosity measured at nine microsatellites correlates with virtually every component of male reproductive success so far analysed including territory holding ability, body size and attractiveness to females. However, a paucity of genomic resources for this species, as well as for pinnipeds in general, constrains our ability to understand the underlying mechanisms.
To circumvent this problem, we recently developed a partial transcriptome assembly by 454 sequencing non-destructively obtained skin biopsy samples from twelve individuals of this species. In a subsequent pilot study, we showed that it was possible to obtain polymorphic microsatellites targeted towards candidate genes related to immunity and growth by selecting loci that appear variable in silico. We also exploited homology to the dog (Canis lupus familiaris) genome to map transcripts to specific chromosomes, allowing the development of a genome-wide distributed panel of 104 validated, polymorphic SNPs.
Although our initial transcriptome was more than adequate for many purposes, immune-related transcripts were not as numerous as originally hoped for, probably due to our having been restricted to the use of skin samples. Moreover, many of the immune-related contigs that we were able to construct had too little depth of sequence coverage to allow SNPs to be called with high confidence. This hampers the development of SNPs within candidate immune genes, a classic example of which, the Major Histocompatibility Complex (MHC), was recently found to be a remarkably strong predictor of survivorship to adulthood in the closely related grey seal, Halichoerus grypus.
Here, we 454 sequenced tissues obtained from six different organs (testis, heart, spleen, intestine, kidney and lung) obtained at necropsy from nine adult male fur seals that died of natural causes. We constructed tissue-specific assemblies to compare the representation of immune-related transcripts, including those showing homology to the MHC, and exploited comparative genomics with the dog to map the putative locations of transcripts and in silico derived genetic markers.
RNA yields from necropsy samples
Tissue samples collected after death from nine adult male Antarctic fur seals
Approximate time between death and necropsy (hours)
Whole RNA concentration (ng/μl)
Sequence data generated
Summary of 454 sequence data obtained from the Antarctic fur seal
Number of reads
Mean read length
Total base pairs (bp)
Assembly of the necropsy data consecutively with Newbler and CAP 3 to minimise redundancy resulted in a total of 14,362 contigs. Selection based on a minimum contig length of 500bp or, in the case of smaller contigs, for annotation e-values below 1e-10 resulted in 12,397 contigs (86.3%) being retained for further analysis. Added to these reads was another set of sequencing on skin biopsy samples described previously. These data were assembled using the same bioinformatic pipeline into 20,330 contigs, of which 14,271 (70.2%) were retained after filtering on size and annotation. Besides the assemblies of these two sets of sequences, allowing for direct comparison, a combined transcriptome using both sets of reads was constructed. This comprised 30,765 contigs, of which 23,096 (75.1%) were retained for the final transcriptome database. Contig lengths were qualitatively very similar among the three assemblies (Figure1) and the combined assembly had an average contig length of 971bp.
We next exploited GO annotation terms to extract a total of 521 immune-related contigs, which are listed in Additional file1: Table S1. These had a mean length of 854bp and contained a total of 1,471 SNPs (see subsequent section for SNP discovery). Associated GO terms were diverse, including 'immune system process', 'immune response', chemokine activity', 'antigen processing and presentation', and 'T cell selection'. As anticipated, immune-related contigs were over three times more numerous in the necropsy assembly than in the skin assembly, despite the overall amount of 454 sequence data being lower (Figure5b). This is largely attributable to our having included spleen tissues in the necropsy assembly, which alone yielded 433 contigs with GO annotations relating to immunity, accounting for 13.2% of all spleen-specific contigs. Moreover, in contrast to the other tissues, four of the top ten most abundant transcripts identified in the spleen were immune related (Additional file2: Table S2), implying relatively high levels of expression.
Comparative genomics between the fur seal and dog
Another question that can be addressed through comparative genomics is the extent to which a given collection of transcripts represents an exhaustive transcriptome. To evaluate the completeness of the fur seal transcriptome, we therefore explored homology between the combined assembly and the complete set of dog transcripts using a standard e-value threshold of 1e-10. A total of 14,435 (62.5%) fur seal contigs showed homology to 19,691 of the 25,559 (77.0%) dog transcripts, suggesting that the combined assembly represents a substantial proportion of the fur seal transcriptome.
Numbers of reads per million mapping to known canine MHC genes, summarised for the various tissues plus the combined assembly (see Methods for details)
Chromosome in dog
DQ Alpha precursor
Total class I
Total class II
Total class III
Molecular marker discovery
Transcriptome sequencing has become an increasingly popular means of developing genomic resources for non-model organisms. However, to sample a transcriptome exhaustively usually requires animals to be sacrificed so that mRNA can be harvested from multiple tissues. To circumvent this problem in a natural population of Antarctic fur seals, we 454 sequenced cDNA from various tissues obtained at necropsy from nine adult males that died of natural causes. This appears to have been remarkably successful, leading to measurable increases in the number and diversity of transcripts characterised, a greater representation of transcripts involved in immunity, and many more genetic markers discovered.
It is important to acknowledge that this study is not the first to apply high throughput sequencing to tissues obtained after death. For example, Cherel et al. recently explored links between gene expression and meat quality traits in recently slaughtered pigs, and Kang et al. characterised the developmental progression of gene expression patterns in the human brain. However, such studies are few and far between, and tend to focus on humans and their companion species. Our study is novel in that we have harvested multiple organs from animals dying of natural causes in a natural population of a non-model organism in order to generate an exhaustive as possible de novo transcriptome assembly.
Given that some of our samples were obtained from adult males that had died up to thirty hours prior to necropsy (Table1), we initially held concerns over the quantity of the RNA that could be extracted and whether this could have downstream impacts on the amount of resulting 454 sequence data. As anticipated, significant variation was found in the amount of total RNA that could be extracted from the different tissues, the more fibrous heart yielding the lowest average concentration despite our having used a bead mill. However, there was no clear relationship between RNA yield and the time elapsed between death and necropsy. Moreover, 454 read lengths were actually longer on average than previously obtained for the skin transcriptome. Although this is almost certainly due to our having switched from FLX to FLX+ sequencing chemistry, it nevertheless appears that using RNA from recently dead animals did not have a major detrimental impact on sequence length.
An important caveat to the above is that organs were harvested shortly after death (1–30 hours, mean = 13.9 hours) in order to minimise the risk of RNA degradation. Moreover, the prevailing climate at Bird Island during the austral summer is cool (usually between 1.5 and 4.5 degrees centigrade). It remains unclear how much high-quality RNA will be retained in carcasses at more advanced stages of decomposition, nor how dependent RNA degradation will be upon ambient temperature, making it difficult to extrapolate to other study systems. To evaluate this in fur seals would require significant further effort, since carcasses would need to be repeatedly sampled at different time intervals, generating multiple cDNA pools for sequencing. Moreover, such an experiment would ideally evaluate the importance of factors such as carcass size and prevailing environmental conditions.
For the previous assembly based on skin samples, we employed the cDNA option within Roche Newbler assembler version 2.3. However, a recent paper comparing the merits of five different assemblers found that this program performed poorly, generating the assembly with the greatest sequence redundancy and lowest proportion of mappings to reference sequences. In contrast, a pre-release version of Newbler 2.5 was identified as the joint top performing assembler, with further improvements being obtained through the concurrent use of more than one assembler. To obtain the best possible quality transcriptome, we therefore assembled our data consecutively with Newbler 2.5 and CAP 3. After filtering on the basis of size and annotation, we obtained 23,096 contigs, which is considerably more than the equivalent value of 14,271 obtained when the original skin data were re-assembled and filtered based on the same criteria. Taken together with the limited degree of overlap between the necropsy and skin assemblies, this suggests that our improved transcriptome is larger and contains numerous transcripts that had not previously been described. This is consistent with previous studies that have found tissue-specific patterns of gene expression[35, 36].
Quality filtering for size and annotation led to many small contigs being discarded from the final assembly. However, this was reflected in a greatly improved rate of annotation (65.3% of contigs yielded BLAST hits with an e-value cutoff of 1e-10) relative to the original skin transcriptome (47.0% of contigs yielded BLAST hits with an e-value cutoff of 1e-4). As found previously, the majority of matches were to vertebrates, with the giant panda and the dog being the two most frequently represented species. Interestingly, more matches were obtained to the panda than the dog for the necropsy and skin assemblies, but this pattern was reversed for the combined assembly. This difference may reflect the fact that the combined assembly contains substantially more larger transcripts than either the necropsy or skin assemblies (number of transcripts >1,000bp in length = 7,780, 3,789 and 4,320 respectively) due to the greater total number of reads available.
Reassuringly, the proportion of BLAST matches to bacterial sequences was actually lower for the necropsy than the skin assembly (0.5% versus 2.1%). This not only implies that we managed to avoid contaminating the samples during the necropsy procedure, but may also indicate that the seals we necropsied were not suffering from heavy bacterial infections at the time of death. The difference between the two assemblies if genuine would also be consistent with a previous study that isolated a diverse bacterial assemblage from fight wounds and lesions in adult males of this species. Thus, the greater representation of bacterial sequences in skin could quite possibly reflect the relatively unsanitary conditions of life in a crowded fur seal breeding colony.
Although it was not critical that our transcriptome be perfectly representative of normal patterns of gene expression, we initially held concerns that a transcriptome derived from necropsy samples could be dominated by transcripts involved in cell death. To evaluate this possibility, we therefore interrogated each of the assemblies, including the tissue-specific assemblies, for GO terms relating to apoptosis. Contigs with apoptosis-related GO terms were around twice as numerous in the necropsy than in the skin assembly. However, their overall percentage was low, at only 2.6% for the combined assembly and rising to a maximum of 4.3% for the heart and lung tissues. Thus, although our data are consistent with a certain degree of up-regulation of genes involved in apoptosis, there is no suggestion of a major bias being present.
This study was partly motivated by the limited representation of immune-related transcripts in our original skin assembly, together with the fact that many of these had insufficient depth of sequence coverage to allow SNPs to be identified with high confidence. Using the same approach as above, but this time filtering for transcripts based on GO annotations relating to immunity, we identified a total of 521 immune-related contigs, almost four times more than in the skin assembly. This improvement appears largely attributable to the inclusion of samples from the spleen, which carried by far the greatest proportion of immune-related transcripts, at 13.2%. However, the main causes of death in adult male Antarctic fur seals are fight wounds and pneumonia, raising the possibility that at least some of these individuals could have been mounting an immune response to bacterial infection prior to death. This is difficult to judge given the low proportion of sequence matches to bacteria, but could potentially have contributed towards the elevated representation of immune-related transcripts in the necropsy assembly.
Through comparative genomics with the dog, we also analysed the tissue-specific representation of transcripts revealing homology to 21 different canine MHC genes. All but three of these genes, including class I, II and III genes, were represented in at least one tissue. Consistent with immune-related contigs being four times more common in the spleen, the total number of transcripts per million mapping to MHC class I and II genes was also highest for the spleen, followed by the lung. A contrasting pattern was obtained for the MHC class III, with the lung having almost twice as many transcripts per million mapping as the spleen. This may reflect the role of class III genes in mounting the immune response via innate immunity, inflammation and immunomodulation, processes that could conceivably be of greater importance in sensitive mucosal tissues such as the lung. Overall, we also found that MHC-related transcipts had the lowest representation in the skin, providing further justification for our having expanded and improved upon our original transcriptome.
Despite seals and dogs having diverged approximately 43 million years ago, the two genomes appear to have retained large syntenic blocks. Sequence homology is also strong enough to allow the flanking sequences of most pinniped microsatellites to be mapped to unique locations in the dog[28, 40], making the canine genome a powerful resource for comparative purposes. We therefore took previous seal studies a step further by inferring the genomic distribution of 1,521,212 reads (61.1%), 17,121 contigs (74.1%), 2,119 microsatellite loci (81.8%) and 9,382 SNPs (58.0%) by reference to the dog. Much as expected, the resulting distributions appear relatively even (Figure6), as supported by a strong positive correlation between the number of contigs mapping to a given chromosome and the length of that chromosome in the dog. Moreover, although the mapping locations should be regarded as putative, we have reasons to believe that a substantial proportion will be correct. For example, Hoffman et al. previously found that four microsatellite loci, described as putatively X-linked because they were homozygous in 84 males but carried the expected proportions of heterozygote genotypes in females, all mapped to the X chromosome in the dog. Similarly, a SNP recently developed within a transcript revealing homology to mitochondrial NADH dehydrogenase revealed a pattern in which all individuals were called as homozygotes but for different alleles. Nevertheless, to shed further light on synteny between the two genomes, it would be desirable to develop a high-density comparative linkage map. This is made feasible for the first time by the large number of genetic markers we have identified in this study (see below).
Mining the combined transcriptome assembly for microsatellites yielded marginally more markers than found in the original skin assembly (2,592 loci versus 2,271 loci respectively), although these numbers are not strictly comparable because different programs were used. A more obvious improvement was observed for SNPs, with SWAP454 identifying 2.5 times as many SNPs at the 'strict' level (1,585 versus 642) and 1.8 times more SNPs at the 'relaxed' level (11,454 versus 6,261). This is perhaps to be expected given the increased number and diversity of contigs and the improved depth of coverage, to which SNP discovery is particularly sensitive. Even more SNPs were identified by the Newbler mapping program (n = 14,574), with a clear peak in the parameter space corresponding to a MAF of around 0.3 and a depth of coverage of approximately 16x. This suggests that many of these SNPs may comfortably exceed the minimum selection criteria of at least three non-duplicate reads showing the variant and seven or more high-quality reads in total. One possible reason for the difference in the number of SNPs called by the two programs could be that SWAP454 relies on mapping the raw sequence reads back to a reference sequence, in this case the transcriptome assembly. This can lead to some loss of data due to incomplete mapping. Moreover, because the program only calls SNPs on the basis of reads that map reliably to a single contig, redundancy within the assembly, whether unintentional or due to the use of assembly methods that classify related contigs into 'isogroups' as constructed using the Newbler assembler, could potentially reduce the total number of SNPs called.
By 454 sequencing samples obtained at necropsy, we have developed a greatly improved transcriptome assembly, thereby facilitating future evolutionary genetic studies of an important pinniped species, the Antarctic fur seal. We have also demonstrated that post mortem sampling provides a viable alternative to sacrificing animals, with positive implications for developing transcriptomes for charismatic and / or threatened species.
Tissue samples were collected from nine freshly dead adult male Antarctic fur seals at Freshwater Inlet on Bird Island, South Georgia (54° 00Â´ S, 38° 02Â´ W) during the austral summer of 2010 / 2011. All specimens were known to have died within the last 30 hours by direct observation, and were transported immediately to the station laboratory to prevent scavenging by seabirds. The necropsies were conducted systematically within the shortest possible time window after death (see Table1 for details). Tissue samples from all six organs were available for all but one of the animals, Agaz1011, due to this individual's testes having been scavenged by giant petrels (Macronectes giganteus and M. halli). Sampling equipment was sterilised using 95% ethanol between uses. Samples were transferred to RNAlater® and stored individually at −20°C for up to one month before being placed in a −80°C freezer for transport back to the UK.
RNA isolation and cDNA generation
Approximately 10mg of each tissue sample was disrupted and homogenised by bead milling within a TissueLyserII (Qiagen). Total RNA was then extracted using a Qiagen RNeasy® mini kit following the manufacturer’s recommended protocols, with an optional on-column DNAse digestion step included. The resulting RNA pellets were resuspended in 50μl of RNAse-free water (Ambion) and quantified using PicoGreen (Invitrogen) fluorometry. RNA quantity and quality were also assessed visually by running a fraction of each isolate on a 1% agarose gel. Total RNA samples were pooled in equimolar ratios, as far as possible, for each of the tissue types. PolyA+ RNA was purified from total RNA using selection on Oligo-dT containing paramagnetic beads from the MicroPoly(A)PuristTM mRNA Purification Kit (Ambion, Life Technologies) according to the manufacturer’s instructions. 200ng mRNA of each tissue was used for cDNA Rapid Library construction according to 454/Roche FLX+ protocols. The individual tissue libraries were MID tagged, pooled and sequenced on a Roche Genome Sequencer FLX+ instrument using the GS FLX Titanium Sequencing Kit XL+ (Roche).
Skin transcriptome data
Sequence data were also available from a previously published skin transcriptome. This was based on a normalised cDNA library constructed from skin biopsy samples of twelve individuals and subjected to a full run on a Roche Genome Sequencer FLX Titanium, which yielded 1,443,397 454 reads of mean length 286bp.
The pooled necropsy data and the previously sequenced skin biopsy data were separately assembled using Newbler (version 2.5.3), with a large genome style assembly followed by CAP 3 using default parameters to reduce redundancy. A combined assembly using all the 454 data from both sets was then constructed in the same way to generate the best possible reference transcriptome for further analysis. After annotation using the Genbank non-redundant (nr), swissprot and dog transcript (ftp://ftp.ensembl.org/pub/release-67/fasta/canis_familiaris/ref) databases, we discarded all contigs less than 500bp in length that failed to reveal homology at an e-value less than 1e-10 in at least one of these databases.
BLAST mapping and sequence annotation
To determine homology to known genes, Basic Local Alignment Search Tool (BLAST) searches with a standard e-value cutoff of 1e-10 were used to query contig sequences against the GenBank nr and Swissprot databases. Gene Ontology (GO) mappings were determined using an in-house database taking the top five Swissprot matches. Immune and apoptosis related GO terms were then used to select fur seal transcripts with such matches.
Comparative genomics with the dog
Using the combined assembly, four approaches were employed to explore sequence homology between the fur seal and dog. First, individual 454 reads were mapped to the dog genome (build 2.0) using Roche gsMapper version 2.3. The dog genome sequences in fasta format were obtained fromftp://ftp.ncbi.nih.gov/genomes/Canis_familiaris and comprised chromosomes 1 to 38 and X. Secondly, BLAST searches were carried out with the fur seal transcripts against each of the dog genome chromosomes. The highest score for a given transcript determined its placement along a chromosome. This provided the basis for positioning microsatellite and SNP loci (see below for marker discovery). Thirdly, BLAST searches were conducted comparing the assembled fur seal contigs against the set of canine nucleotide and peptide gene sequences, both annotated and abinitio, with an e-value cutoff of 1e-10.
We next compared Canis familiaris MHC genes for representation to the fur seal transcripts. These comprised 5, 12 and 4 class I, II, and III genes respectively (see Table3 for details), identified from the literature[44, 45] and through searches of the dog genome at Ensembl (http://www.ensembl.com). Newbler was used to map the 454 reads from the separate tissues against the selected MHC genes, including the associated variant transcripts. Following Ekblom et al., we then used the number of transcripts mapping per million as a proxy for gene expression.
Mining for molecular markers
The combined assembly was interrogated for microsatellite motifs using the program Phobos to identify sequences containing perfect di-, tri- and tetranucleotide repeats with a minimum length of five repeat units. SNP detection was carried out using two programs. First, to provide a direct comparison with the numbers of SNPs previously obtained from the skin transcriptome, we applied the Swap454 pipeline. This program first maps the raw reads back to the assembled contigs and then determines, while taking into account an error model for the 454 data, which positions are called as SNPs according to two user-specified thresholds. The first of these, 'MIN_RATIO' corresponds to the percentage of reads that differ from the reference sequence at a given position and the second, 'MIN_READS' to the number of copies present of the minor allele. We applied a 'strict' criterion to minimize the possibility of false positives arising from sequencing error and a 'relaxed' criterion to maximize the discovery of relatively infrequent alleles. For the former, MIN_RATIO was set to 0.33 and MIN_READS to 8, and for the latter MIN_RATIO was set to 0.1 and MIN_READS to 2. For comparison, we also used the Newbler mapping program (454 Life Sciences;http://www.454.com) to call SNPs that were deemed high confidence. For a SNP to be called in this way, there must be at least three non-duplicate reads showing the variant, with these reads being represented in both the forward and reverse directions, and at least seven reads with Phred quality scores of at least 20.
Availability of supporting data
DNA sequences are available at the Genbank Sequence Read Archive (number: SRA064103).
We would like to acknowledge library preparation and 454 sequencing performed by Shilo Dickens of the DNA sequencing facility of the Department of Biochemistry, University of Cambridge. We are also grateful to Juliane Zelwies for assistance preparing Figure6. Fieldwork was approved by BAS and samples were collected and retained under permits issued by the Department for Environment, Food and Rural Affairs (DEFRA) and in accordance with the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES). This work contributes to the British Antarctic Survey (BAS) Ecosystems (Polar Science for Planet Earth) programme. The research was supported by a Marie Curie FP7-Reintegration-Grant (PCIG-GA-2011-303618) within the 7th European Community Framework Programme, together with NERC core funding to the British Antarctic Survey Ecosystems programme. We also acknowledge support of the publication fee by the Deutsche Forschungsgemeinschaft and the Open Access Publication Funds of Bielefeld University.
- Vera JC, Wheat CW, Fescemyer HW, Frilander MJ, Crawford DL, Hanski I, Marden JH: Rapid transcriptome characterization for a non-model organism using 454 pyrosequencing. Mol Ecol. 2008, 17: 1636-1647. 10.1111/j.1365-294X.2008.03666.x.View ArticlePubMed
- Nagaraj SH, Gasser RB, Ranganathan S: A hitchhiker’s guide to expressed sequence tag (EST) analysis. Briefings in Bioinformatics. 2007, 8: 6-21.View ArticlePubMed
- Bouck A, Vision T: The molecular ecologist's guide to expressed sequence tags. Mol Ecol. 2007, 16: 907-924.View ArticlePubMed
- Hudson ME: Sequencing breakthroughs for genomic ecology and evolutionary biology. Mol Ecol Resour. 2008, 8: 3-17. 10.1111/j.1471-8286.2007.02019.x.View ArticlePubMed
- Wheat CW: Rapidly developing functional genomics in ecological model systems via 454 transcriptome sequencing. Genetica. 2010, 138: 433-451. 10.1007/s10709-008-9326-y.View ArticlePubMed
- Slate J, Santure AW, Feulner PGD, Brown EA, Ball AD, Johnston SE, Gratten J: Genome mapping in intensively studied wild vertebrate populations. Trends in Genetics. 2010, 26: 275-284. 10.1016/j.tig.2010.03.005.View ArticlePubMed
- Ouborg NJ, Pertoldi C, Loeschcke V, Bilksma R, Hedrick PW: Conservation genetics in transition to conservation genomics. Trends in Genetics. 2010, 26: 177-187. 10.1016/j.tig.2010.01.001.View ArticlePubMed
- Wang B, Ekblom R, Castoe TA, Jones EP, Kozma R, Bongcam-Rudloff E, Pollock DD, Hoglund J: Transcriptome sequencing of black grouse (Tetrao tetrix) for immune gene discovery and microsatellite development. Open Biology. 2012, 2: 120054-10.1098/rsob.120054.PubMed CentralView ArticlePubMed
- Vogel H, Altincicek B, Glockner G, Vilcinkas A: A comprehensive transcriptome and immune-gene repertoire of the lepidopteran model host Galleria mellonella. BMC Genomics. 2011, 12: 308-10.1186/1471-2164-12-308.PubMed CentralView ArticlePubMed
- Clark MS, Thorne MAS, Vieira FA, Cardoso JCR, Power DM, Peck LS: Insights into shell deposition in the Antarctic bivalve Laternula elliptica: gene discovery in the mantle transcriptome using 454 pyrosequencing. BMC Genomics. 2010, 11: 362-10.1186/1471-2164-11-362.PubMed CentralView ArticlePubMed
- Renaut S, Nolte AW, Bernatchez L: Mining transcriptome sequences towards identifying adaptive single nucleotide polymorphisms in lake whitefish species pairs (Coregonus spp. Salmonidae). Mol Ecol. 2010, 19: 115-131.View ArticlePubMed
- Kang HJ, Kawasawa YI, Cheng F, Zhu Y, Xu X, Li M, Sousa AM, Pletikos M, Meyer KA, Sedmak G: Spatio-temporal transcriptome of the human brain. Nature. 2011, 478: 483-489. 10.1038/nature10523.PubMed CentralView ArticlePubMed
- Cherel P, Herault F, Vincent A, Le Roy P, Damon M: Genetic variability of transcript abundance in pig skeletal muscle at slaughter: relationships with meat quality traits. J Anim Sci. 2011, 90: 699-708.View ArticlePubMed
- Hoffman JI, Grant SM, Forcada J, Phillips CD: Bayesian inference of a historical genetic bottleneck in a heavily exploited marine mammal. Mol Ecol. 2011, 20: 3989-4008. 10.1111/j.1365-294X.2011.05248.x.View ArticlePubMed
- Read AJ: The looming crisis: interactions between marine mammals and fisheries. J Mammal. 2008, 89: 541-548. 10.1644/07-MAMM-S-315R1.1.View Article
- Simmonds MP, Isaac SJ: The impacts of climate change on marine mammals: early signs of significant problems. Oryx. 2007, 41: 19-26. 10.1017/S0030605307001524.View Article
- Mancia A, Lundqvist ML, Romano TA, Peden-Adams M, Fair PA, Kindy MS, Ellis BC, Gattoni-Celli S, McKillen DJ, Trent HF: A dolphin peripheral blood leukocyte cDNA microarray for studies of immune function and stress reactions. Dev Comp Immunol. 2007, 31: 520-529. 10.1016/j.dci.2006.07.011.View ArticlePubMed
- Ierardi JL, Mancia A, McMillan J, Lundqvist ML, Romano TA, Wise JP, Warr GW, Chapman RW: Sampling the transcriptome of the North Atlantic right whale. Comp Biochem Physiol, Part D. 2009, 4: 154-158.
- Osterhaus ADME, Groen J, Spijkers HEM, Broeders HWJ, UytdeHaag FGCM, de Vries P, Teppema JS, Visser IKG, van de Bildt MWG, Vedder EJ: Mass mortality in seals caused by a newly discovered virus-like morbillivirus. Vet Microbiol. 1990, 23: 343-350. 10.1016/0378-1135(90)90165-R.View ArticlePubMed
- Mattlin RH: Pup mortality of the New Zealand fur seal (Arctocephalus forsteri lesson). N Z J Ecol. 1978, 1: 138-144.
- Pyenson ND: The high fidelity of the cetacean stranding record: insights into measuring diversity by integrating taphonomy and macroecology. Proc R Soc London, Ser B. online early
- McCann TS: Territoriality and breeding behaviour of adult male Antarctic fur seal, Arctocephalus gazella. J Zool (London). 1980, 192: 295-310.View Article
- Doidge DW, Croxall JP, Ricketts C: Growth rates of Antarctic fur seal Arctocephalus gazella pups at South Georgia. J Zool (London). 1984, 203 (MAY): 87-93.
- Hoffman JI, Boyd IL, Amos W: Male reproductive strategy and the importance of maternal status in the Antarctic fur seal Arctocephalus gazella. Evolution. 2003, 57 (8): 1917-1930.View ArticlePubMed
- Hoffman JI, Boyd IL, Amos W: Exploring the relationship between parental relatedness and male reproductive success in the Antarctic fur seal Arctocephalus gazella. Evolution. 2004, 58 (9): 2087-2099.View ArticlePubMed
- Hoffman JI, Forcada J, Amos W: Getting long in the tooth: a strong positive correlation between canine size and heterozygosity in the Antarctic fur seal Arctocephalus gazella. J Hered. 2010, 101: 527-538. 10.1093/jhered/esq045.View ArticlePubMed
- Hoffman JI, Forcada J, Trathan PN, Amos W: Female fur seals show active choice for males that are heterozygous and unrelated. Nature. 2007, 445: 912-914. 10.1038/nature05558.View ArticlePubMed
- Hoffman JI, Forcada J, Amos W: Exploring the mechanisms underlying a heterozygosity-fitnesss correlation for canine size in the Antarctic fur seal Arctocephalus gazella. J Hered. 2010, 101: 539-552. 10.1093/jhered/esq046.View ArticlePubMed
- Hoffman JI: Gene discovery in the Antarctic fur seal (Arctocephalus gazella) skin transcriptome. Mol Ecol Resour. 2011, 11: 703-710. 10.1111/j.1755-0998.2011.02999.x.View ArticlePubMed
- Hoffman JI, Nichols HJ: A novel approach for mining polymorphic microsatellite markers in silico. PLoS One. 2011, 6: e23283-10.1371/journal.pone.0023283.PubMed CentralView ArticlePubMed
- Hoffman JI, Tucker R, Clark MS, Forcada J, Slate J: Rates of assay success and genotyping error when single nucleotide polymorphism genotyping in non-model organisms: a case study in the Antarctic fur seal. Mol Ecol Resour. 2012, 12: 861-872. 10.1111/j.1755-0998.2012.03158.x.View ArticlePubMed
- da Assunção Soares Franco M, Hoffman JI, Harwood J, Amos W: MHC genotype is a dominant predictor of mortality in the grey seal. Scientific Reports. 2012, 2: 659-
- Huang X, Madan A: CAP 3: a DNA sequence assembly program. Genome Res. 1999, 9: 868-877. 10.1101/gr.9.9.868.PubMed CentralView ArticlePubMed
- Kumar S, Blaxter ML: Comparing de novo assemblers for 454 transcriptome data. BMC Genomics. 2010, 11: 571-10.1186/1471-2164-11-571.PubMed CentralView ArticlePubMed
- Ekblom R, Balakrishnan CN, Burke T, Slate J: Digital gene expression analysis of the zebra finch genome. BMC Genomics. 2010, 11: 219-10.1186/1471-2164-11-219.PubMed CentralView ArticlePubMed
- Santure AW, Gratten J, Mossman JA, Sheldon BC, Slate J: Characterisation of the transcriptome of a wild great tit Parus major population by next generation sequencing. BMC Genomics. 2011, 12: 283-10.1186/1471-2164-12-283.PubMed CentralView ArticlePubMed
- Baker JR, Doidge DW: Pathology of the Antarctic fur seal (Arctocephalus gazella) in South Georgia. Br Vet J. 1984, 140: 210-219. 10.1016/0007-1935(84)90084-8.View ArticlePubMed
- Acevedo-Whitehouse K, Cunningham AA: Is MHC enough for understanding wildlife immunogenetics?. Trends Ecol Evol. 2006, 21 (8): 433-438. 10.1016/j.tree.2006.05.010.View ArticlePubMed
- Higdon JW, Bininda-Edmonds ORP, Beck RMD, Ferguson SH: Phylogeny and divergence of the pinnipeds (Carnivora: Mammalia) assessed using a multigene dataset. BMC Evol Biol. 2007, 7: 216-10.1186/1471-2148-7-216.PubMed CentralView ArticlePubMed
- Osborne AJ, Brauning R, Schultz JK, Kennedy MA, Slate J, Gemmell NJ: Development of a predicted physical map of microsatellite locus positions for pinnipeds, with wider applicability to the Carnivora. Mol Ecol Resour. 2010, 11: 503-513.View ArticlePubMed
- Galperin MY, Cochrane GR: The 2011 nucleic acids research database issue and the online molecular biology database collection. Nucleic Acids Res. 2011, 39: D1-D6. 10.1093/nar/gkq1243.PubMed CentralView ArticlePubMed
- Bairoch A, Apweiler R: The SWISS-PROT protein sequence data bank and its new supplement TREMBL. Nucleic Acids Res. 1996, 24: 21-25. 10.1093/nar/24.1.21.PubMed CentralView ArticlePubMed
- Altschul SF, Gish W, Miller W, Myers EW, L DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.View ArticlePubMed
- Wagner JL: Molecular organization of the canine major histocompatibility complex. J Hered. 2003, 94 (1): 23-26. 10.1093/jhered/esg002.View ArticlePubMed
- Debenham SL, Hart EA, Ashurst JL, Howe KL, Quail MA, Ollier WER, Binns MM: Genomic sequence of the class I region of the canine MHC: comparison with the MHC of other mammalian species. Genomics. 2005, 85: 48-59. 10.1016/j.ygeno.2004.09.009.View ArticlePubMed
- Phobos 3.3.11.http://www.rub.de/spezzoo/cm/cm_phobos.htm,
- Brockman W, Alvarez P, Young S, Garber M, Giannoukos G, Lee WL, Russ C, Lander ES, Nusbaum C, Jaffe DB: Quality scores and SNP detection in sequencing-by-synthesis systems. Genome Res. 2008, 18: 763-770. 10.1101/gr.070227.107.PubMed CentralView ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.