Sequence comparison of prefrontal cortical brain transcriptome from a tame and an aggressive silver fox (Vulpes vulpes)
- Anna V Kukekova†1Email author,
- Jennifer L Johnson†1,
- Clotilde Teiling†2,
- Lewyn Li†3,
- Irina N Oskina4,
- Anastasiya V Kharlamova4,
- Rimma G Gulevich4,
- Ravee Padte1,
- Michael M Dubreuil1,
- Anastasiya V Vladimirova4,
- Darya V Shepeleva4,
- Svetlana G Shikhevich4,
- Qi Sun5,
- Lalit Ponnala5,
- Svetlana V Temnykh1,
- Lyudmila N Trut4 and
- Gregory M Acland1
© Kukekova et al; licensee BioMed Central Ltd. 2011
Received: 23 April 2011
Accepted: 3 October 2011
Published: 3 October 2011
Two strains of the silver fox (Vulpes vulpes), with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain are defensive and exhibit aggression to humans. To understand the genetic differences underlying these behavioral phenotypes fox-specific genomic resources are needed.
cDNA from mRNA from pre-frontal cortex of a tame and an aggressive fox was sequenced using the Roche 454 FLX Titanium platform (> 2.5 million reads & 0.9 Gbase of tame fox sequence; >3.3 million reads & 1.2 Gbase of aggressive fox sequence). Over 80% of the fox reads were assembled into contigs. Mapping fox reads against the fox transcriptome assembly and the dog genome identified over 30,000 high confidence fox-specific SNPs. Fox transcripts for approximately 14,000 genes were identified using SwissProt and the dog RefSeq databases. An at least 2-fold expression difference between the two samples (p < 0.05) was observed for 335 genes, fewer than 3% of the total number of genes identified in the fox transcriptome.
Transcriptome sequencing significantly expanded genomic resources available for the fox, a species without a sequenced genome. In a very cost efficient manner this yielded a large number of fox-specific SNP markers for genetic studies and provided significant insights into the gene expression profile of the fox pre-frontal cortex; expression differences between the two fox samples; and a catalogue of potentially important gene-specific sequence variants. This result demonstrates the utility of this approach for developing genomic resources in species with limited genomic information.
Until recently, genome wide sequencing has been cost limited to relatively few species. Newer sequencing technologies, offering rapid high throughput sequencing, reduction of sequencing cost, and complementary software have changed the situation dramatically, making these methods more readily considered as standard molecular tools for biology and medicine [1–9]. Transcriptome sequencing in particular can now be undertaken by individual research groups to generate genomic information for novel or "orphan" species of scientific interest [10–14]. One such species is the red fox (Vulpes vulpes), including its farm bred coat color variant, the silver fox.
The red fox is one of the most widely distributed mammalian species across the globe [15, 16] - inhabiting the Northern Hemisphere naturally and Australia by introduction. Foxes live in a wide range of temperature zones, geographical regions, wild and urban environments [16, 17] and are close relatives of the domestic dog (Canis lupus familiaris). The grey wolf (Canis lupus lupus) and the fox diverged from a common ancestor about 10 million years ago , and the dog was domesticated from the grey wolf at least 15,000 years ago [19–24]. In contrast, the red fox -- or, more precisely, the silver fox -- has only recently been domesticated as part of a deliberate scientific experiment at the Institute of Cytology and Genetics (ICG) of the Russian Academy of Sciences in the second part of the twentieth century [for review [25–28]].
Starting from a farm-bred population of silver foxes, the ICG selectively bred foxes for tameness for over 50 generations [25–27]. Foxes from this tame strain exhibit human-directed behavior resembling that of domestic dogs http://cbsu.tc.cornell.edu/ccgr/behaviour/Index.htm. In parallel, the ICG also undertook a selective breeding program to develop another strain of foxes that are aggressive and difficult to handle. Both strains of foxes are maintained at the ICG as outbred populations. The genetic basis of these tame and aggressive behavioral phenotypes was confirmed in experiments that included cross-fostering of pups, transplantation of embryos and cross-breeding experiments [30, 25, 26, 31, 32]. These fox strains present a robust model, intermediate between rodent and primate in biological complexity, for the study of genetics of affiliative versus aggressive behaviors.
By exploiting genomic resources developed for the domestic dog - the only canid species with a sequenced genome, to date - we recently began molecular genetic studies of these selectively bred tame and aggressive fox strains. To this end, we developed crossbred F1, back-cross and intercross pedigrees segregating behavioral differences derived from the founder strains; a meiotic linkage map of the fox genome based on 385 canine-derived microsatellite markers [33, 34, 32]; and, using these resources, mapped several quantitative trait loci (QTL) in the fox genome associated with behavior .
Although genetic mapping with canine derived microsatellite markers allowed identification of several behavioral loci in fox genome, this type of genetic analysis has inherent limitations in its mapping power. Single nucleotide polymorphism (SNP) markers would be much more powerful for fine mapping these behavioral loci, and identification of haplotypes cosegregating with QTLs that were under selection in the founder strains. Unfortunately, the majority of dog SNPs are not informative in foxes and fox-specific SNPs need to be identified to move this study forward.
To this end we undertook deep transcriptome sequencing of pre-frontal brain samples from one tame and one aggressive fox. Because pre-frontal cortex is a brain area relatively easy to identify, samples from different individuals can be readily collected in a relatively consistent manner. Furthermore, the role of this brain region in aspects of behavior is well established , and thus particularly relevant to the study. Comparison of gene expression profiles between a tame and an aggressive individual offers insights into functional differences in the pre-frontal brain cortex. Comparison of transcriptome sequences of the same genes between the tame and aggressive fox samples, has identified a large set of informative SNP markers and begun a catalogue of gene-specific sequence variants between the two strains.
454 GS FLX Titanium sequencing the transcriptome of the silver fox prefrontal cortex
Silver fox transcriptome: summary statistics of reads and assemblies.
Total number of reads
Number of filtered bases (Gb)
Average (median) read length in bp +/- S.D.
Fully aligned (%)*
Partially aligned (%)*
Number of Isogroups*
Number of Isotigs*
Average Isotig Length
N50 Isotig Length
Number of Contigs* (length >= 100 bp)
Total number of bases in contigs at least 100 bp in length
Too Short* (%)*
376 +/-148 (433)
358 +/-146 (403)
Assembly of fox transcriptome sequencing reads
Fox sequencing reads that passed quality filtering were assembled de novo in three different manners: i) all reads together; ii) reads from the tame sample alone; iii) reads from the aggressive sample alone (see Table 1). Over 70% of reads were fully assembled, and close to a further 10% of reads were partly assembled in both the assembly of all reads together and in each individual assembly run (Table 1).
BLAT of fox contigs to the dog genome
Among all contigs identified in the assembly of all reads together we selected the 96,461 contigs that were at least 100 bp long (Table 1). BLAT of these contigs against the canine genome sequence assembly (CanFam2) successfully located 91,262 contigs. The output file of BLAT results was then visualized on the UCSC Genome browser (Additional file 1, Figure S1). Alignment of fox contigs against the dog genome allowed quick visual examination of the genes represented in the fox transcriptome, revealed gene structure and possible splice variants.
BLASTX of fox isotigs against the SwissProt data base
To estimate the number of genes well represented in this fox transcriptome, all isotigs identified in the assembly of all fox reads together (Table 1) were aligned against the SwissProt database using BLASTX. Of these, 46,776 isotigs (53.52% of the total number of isotigs) yielded a hit to one or more proteins in the database. Only the best hit for each isotig was selected. Successfully aligned isotigs belonged to 21,307 isogroups (35.38% of the total number of isogroups).
In total, 13,624 genes were identified, of which 6,952 (51.03%) were in more than one isogroup. The total number of genes identified in the fox transcriptome was significantly lower than the number of isogroups identified in the assembly of fox reads.
Putative homologs of canine genes in the fox transcriptome
Summary statistics from mapping fox sequencing reads against the Canine RefSeq database.
Total number of reads
Too short to map
Completeness of the fox transcriptome
Identification of fox SNPs
Sequence differences between dog and fox and among fox chromosomes were identified by mapping fox reads against the canine genome sequence assembly, and contigs from the assembly of all fox reads together.
•SNPs identified by mapping fox sequencing reads against the dog genome
Summary statistics from mapping fox sequencing reads against the repeatmasked CanFam2 sequence assembly of the dog genome.
Total number of reads
Too short to map
Comparison of sequences from the four fox chromosomes represented in the present study with the one canine reference chromosome (i.e. CanFam2) identified a total of 991,041 sequence differences, including single nucleotide differences, indels and 2-3 nucleotide differences. For 3,511 of these differences only 1-5% of fox reads were in disagreement with the dog sequence. These latter differences were deemed less likely to be true nucleotide differences and were excluded from further analysis. This left 987,530 differences representing one difference (either between dog and fox or among fox chromosomes) for every 117 bases of dog-fox contigs. Of these 987,530 differences, 898,940 were fox-versus-dog differences, where at least 95% of fox reads were in agreement with each other, and differed from the dog sequence. This represented one difference per 129 mapped bases. An example of one such dog-fox nucleotide difference is presented in Supplementary Figure 2a (Additional file 3). In total, 88,590 differences (1 per 1,309 mapped bases) were identified among fox reads that aligned to the dog genome, an example of which is presented in Supplementary Figure 2b (Additional file 3).
The set of differences where the fox reads differed from each other were filtered to obtain 19,245 high confidence SNPs.
• SNPs identified by mapping fox reads against the assembled fox transcriptome sequence
A total of 96,461 fox contigs that were at least 100 bp long, from the assembly of all reads together (Table 1), were used as a reference for mapping fox reads. In total, 85,064 differences between fox reads were identified. These polymorphisms included SNPs, indels, and 2-3 nucleotide polymorphisms. After applying filtering criteria (see Methods section) 17,328 high-confidence fox SNPs were identified. Contigs containing high-confidence SNPs were selected and localized in the dog genome by BLAT, and the output file was visualized on the UCSC Genomic Browser.
• Non-redundant set of high-confidence fox-specific SNPs
Numbers of SNPs detected, and their informativeness, in tame and aggressive fox samples.
To establish parameters for filtering high-confidence fox SNPs (see Methods section) we selected a subset of SNPs for validation by Sanger sequencing. Three sets of SNPs were selected. The first set included 20 SNPs in which the minor allele was present in at least 25% of the sequencing reads per sample and each allele was present in more than three reads in one individual (Additional file 5, Table S2a); the second set included 17 SNPs in which the minor allele was present in at least 25% of sequencing reads per sample and by exactly three reads in one individual (Additional file 5, Table S2b); the third set included 15 SNPs in which the minor allele was present in at least three reads between the two individuals (Additional file 5, Table S2c). SNPs that mapped to the canine X chromosome and showed heterozygosity in at least one sample were filtered out (except for SNPs that mapped to the pseudoautosomal region of CFAX) and not included in the SNP validation study. Of 51 SNPs, all but one (SNP #3-15) were validated by Sanger sequencing (Additional file 5, Table S2). Allele heterozygosity, identified by Sanger and 454 sequencing for each SNP in each individual, differed for two SNPs in the first dataset (Additional file 5, Table S2a), five SNPs in the second dataset (Additional file 5, Table S2b), and two SNPs in the third data set (Additional file 5, Table S2c). For all these SNPs except SNP #3-6 one of the two individuals was homozygous by 454 sequencing, but heterozygous by Sanger sequencing (Additional file 5 Table S2). The allele identified by Sanger sequencing corresponded to the second allele observed in another individual. The opposite situation was observed for SNP #3-6, with two SNP alleles identified in the tame individual by 454 sequencing but only one by Sanger sequencing (Additional file 5, Table S2c).
All SNPs in the first two data sets were validated by Sanger sequencing, indicating that the selection parameters applied identified high-confidence SNPs reliably. Over 90% of SNPs in the third data set (selected with modestly lower stringency filtering parameters) were also validated. Sanger resequencing of SNPs did find, however, that a second SNP allele was not always identified in the 454 sequencing datasets at the coverage cutoff used for selection of high-confidence SNPs (Additional file 5, Table S2). Indeed, all the minor SNP alleles not identified by 454 sequencing were among those with the lowest read coverage depth (~3-5× per sample, see Additional file 5, Table S2).
Distribution of fox SNPs among different types of genomic regions
Mapping of SNPs, identified from the fox transcriptome, onto the dog genome sequence.
Number of SNPs
Type of genomic region
Comparison of gene expression between the two fox samples
Validation of gene expression by RT-qPCR
Eleven genes that showed expression differences between the two fox samples, in analysis of transcriptome data, were selected for validation by RT-qPCR. Genes were chosen that had more than one exon and did not belong to a highly conserved gene family. Differences in expression of selected genes varied between two samples in terms of number of reads, fold differences, and statistical significance of observed expression differences. Six genes in this set had higher expression levels (more sequencing reads) in the transcriptome of the aggressive fox, versus five in the tame individual. Primers were designed to successfully amplify unique products for nine of these genes, but not successfully for PKHD1L1 and BTNL9 - the latter two genes were thus excluded from further analysis.
RT-qPCR validation of expression differences identified by transcriptome sequence analysis.
Fold difference between two samples
Number of reads in the sample of tame fox
Number of reads in the sample of aggressive fox
Fold difference between two samples
This study was designed to expand genomic resources for the red (silver) fox, an emerging animal model for genetic research whose genome has not yet been sequenced. Using the 454 GS FLX Titanium platform we sequenced non-normalized cDNA libraries from brain samples of two farm-bred foxes with markedly different genetically determined behavioral phenotypes.
Samples for sequencing were selected from prefrontal cortex because this region is easily recognisable in the brain; can thus be collected in a consistent manner; and is related to behavior. The role of different subdivisions of prefrontal cortex in regulating various aspects of cognition, memory, emotions, and social behavior is reviewed in Clark et al., 2010 . In humans, a link between brain damage of prefrontal cortex and aggressive behavior has been reported .
A total of 0.96 Gb of transcriptome sequencing data for the tame fox and 1.21 Gb for the aggressive fox were obtained. The depth of sequencing chosen for this study was based on previous experience sequencing non-normalized cDNA libraries . The fox reads were assembled in three different combinations: i) only reads from the tame fox, ii) only reads from the aggressive fox; iii) all reads together. The percent of reads assigned into contigs in each of the three assemblies was very similar (Table 1).
The assembly software used, the 454 GS De Novo assembler ("Newbler"), takes into account the existence of gene splice variants. Newbler first assembles sets of self-consistent reads into contigs, which are the rough equivalent of exons. It then builds isotigs (the rough equivalent of transcripts) from contigs that are consistently connected by subpopulations of reads. When there are splice variants, contigs can be connected via different paths that are supported by read data, leading to multiple isotigs being generated. Isotigs are grouped into isogroups, which are the rough equivalent of genes. In the context of a transcriptome assembly by Newbler, contigs are the sections of isogroups where corresponding isotig sequences are the same. Consequently, the assembly produces two sets of sequences: a theoretically non-redundant set of contigs, and a set of isotigs which are candidate long- or full-length transcripts from multiple splice variants of genes, and contains some partially redundant sequence segments. Newbler is currently the only transcripome assembler that attempts this task, and is one of the mostly widely used assemblers for de novo assembly of organisms with few genomic recourses .
Comparison of transcriptome assemblies from different studies is inherently problematic because different assembly algorithms handle the complexities of the transciptome in a variety of ways. CAP3, which was used in the assembly of transcriptomes of two mammalian species -- the bank vole (Myodes glareolus) and the domestic ferret (Mustela putorious furo) [14, 10] -- does not split reads into multiple contigs as Newbler does . This makes the number of contigs non-comparable. One measure of comparison is the span of contigs, the total number of bases in the all contigs in an assembly. This number is available for the bank vole, as the number of contigs reported, 63,581, multiplied by the average length of the contigs, 418, for an assembly span of 30.6 Mb. Assembly of the fox transcriptome spans 84.1 Mb. Relative to the bank vole, the fox transcriptome was sequenced to approximately 6.2 times greater depth and the assembly span was higher by 2.7 times.
The number of isogroups, isotigs, and contigs identified in the assembly of all fox reads together was significantly higher than the number of isogroups, isotigs, and contigs identified in the separate assemblies of tame and aggressive reads (Table 1). Singleton reads in the individual assemblies, when assembled together, created new contigs, isotigs and isogroups. Furthermore, sequence variations between individuals increased the number of contigs and isotigs in isogroups, when these variations were sufficiently large for the assembly program to assume them to be different exons, or real differences in splice variants of some genes between the two individuals. The average isotig length and N50 isotig length were also longer in the assembly of all reads together, as would be expected from an increase in the amount of sequencing data (Table 1).
Because the fox genome has not yet been sequenced and a fox RefSeq database is thus not available, we used SwissProt and the dog RefSeq database to identify gene orthologs in the fox transcriptome. BLASTX of fox isotigs from the assembly of all reads together against SwissProt database and mapping of all fox reads against dog RefSeq database identified very similar number of genes (13,624 and 14,418 genes, respectively).
The number of identified genes in the fox transcriptome was significantly lower than the number of isogroups (59,713) found in the assembly of fox reads. The total number of isogroups was also larger than the total number of genes in mammalian genome. One hypothesis is that the amount of sequencing data used in the assembly was not sufficient to cover the full lengths of all fox gene transcripts, therefore some poorly covered genes might have been partitioned into two or more isogroups. Consistent with this hypothesis, BLASTX of fox isotigs against SwissProt identified 13,624 genes, of which 6,952 (51.03%) were found in more than one isogroup (see Results section). In addition, a subset of fox reads (Table 5) was mapped to non-gene regions in the dog genome. These regions are expected not to be present in the dog RefSeq database. We expect that in the fox transcriptome these regions will each be represented by an isogroup with only one isotig assigned to it.
Also, not all genes that are transcribed are described in the databases we used. The genes not in public databases are expected to be genes which would have the more rare, and less covered sequences, and subsequently less complex isogroups. This low level of expression would imply that they would have been less likely to be found and described previously. This may explain why only 28.31% (14,347) of all isogroups with a single isotig were successfully aligned using BLASTX. At the same time, 77.08% (6,960) of isogroups containing more than one isotig (9,030) yielded a hit to the SwissProt database.
The completeness of the fox transcriptome was tested by mapping fox reads combined variously into datasets of different sizes against the dog RefSeq database (Figure 2). One striking result of this experiment was the identification of a very similar number of dog gene orthologs in transcriptomes from tame and aggressive samples (13,618 and 13,855 genes, respectively). The total number of genes identified in both transcriptomes was only approximately 5% larger than a number of genes identified in individual samples (Figure 2). These data demonstrate that, without increasing the depth of sequencing by at least an order of magnitude, it is unlikely that a significantly higher number of rare transcripts would be identified in these transcriptome libraries. The number of genes identified, in this study, from fox pre-frontal brain transcriptome is similar to that identified from mouse cortex (14,787 genes at day E18 and 15,423 genes at day P7, respectively) .
For each accession number in the dog RefSeq database to which at least one fox read mapped, we estimated the breadth of coverage of dog transcripts by fox reads. Figure 3 demonstrates the bivariate coverage of identified transcripts: roughly 75% of dog transcripts had less than 90% coverage and ~25% of dog transcripts had coverage over 90%. We examined the depth of coverage of dog transcripts with relatively low coverage. The slight rise in the middle of the graph, from around 40-60% of coverage contains, as expected, mainly transcripts with a low depth of coverage, but it also contains a significant portion of the transcripts, responsible for the rise above a simple fit curve, that have a large number of fox reads mapped to them. It suggests that in many cases it is not the depth of coverage that is limiting the breadth of coverage. The transcripts in the canine RefSeq database can originate from different tissues and it is likely that for some genes, brain transcripts only partially overlap with transcripts from other tissues. Possible sequence differences between canine and fox transcripts would also reduce the coverage. The breadth of coverage of dog transcripts mapped by reads from two individual samples or from both samples together was almost identical. These data again suggest that a moderate further increase in the amount of sequencing data is unlikely to increase the breadth of coverage of dog transcripts.
The dog is the closest relative of the fox with a sequenced genome . The dog and fox, however, have very different karyotypes, the dog having 78 mostly acrocentric chromosomes, whereas the red fox has 34 metacentric chromosomes [42, 43]. The cytological relationship between the dog and fox genomes is well understood: cytogenetic methods [44–47] and alignment of the fox meiotic map against of the dog genome  have demonstrated that most fox chromosomes correspond to two or three canine chromosomes. The canine genome sequence assembly CanFam2 was used to localize fox sequencing reads and contigs in the dog genome. The approximate location of corresponding regions in the fox genome could be then inferred using the comparative dog-fox genetic and cytogenetic maps.
Mapping fox reads to the canine genome allowed the sequence divergence between the dog and fox genomes to be estimated. One sequence difference (single nucleotide difference, small indel, or difference in 2-3 adjacent nucleotides) between fox and dog was identified per 129 mapped bases. The number of differences between fox chromosomes was one difference per 1,309 mapped bases. The level of divergence between fox and dog coding regions is comparable with the level of divergence between sheep and cattle .
Mapping fox reads against both the canine genome and the assembled fox transcriptome sequence identified 30,491 high-confidence fox-specific SNPs. The filtering criteria were established by Sanger resequencing of SNPs with different numbers of reads per allele. The number of SNPs identified in this study significantly exceeded those identified in the bank vole transcriptome (19,114 SNPs), although filtering parameters in our study (minor allele present in at least three reads in one individual) significantly exceed the filtering parameters used in the vole transcriptome study (each allele present in at least two reads) . These comparisons demonstrate that a greater depth of transcriptome sequencing enables the discovery of a large number of SNP's with higher confidence.
Our filtering parameters yielded 100% confirmation of the polymorphisms in the validation study (Additional file 5, Table S2a,b) but differences in individual genotype (homozygosity vs heterozygosity) identified by 454 sequencing were different from Sanger sequencing in some cases. To validate SNPs by Sanger sequencing, we used cDNA prepared from exactly the same RNA samples which were used for 454 sequencing (Methods section; SNP validation). Therefore, we expect allele specific bias to be equal in the samples used in both 454 and Sanger sequencing experiments. All SNPs for which the second allele in the same sample was not identified by 454 sequencing were among SNPs with the lowest read coverage depth.
Over 90% of SNPs from those with the lowest sequence coverage included in the validation study (minor allele present in at least three reads between the individuals) were also validated by Sanger sequencing. This filter parameter is still stricter than that in the bank vole study . Therefore, for a small risk of false positives, we can also use some of these lower confidence SNP's to increase the total set of SNPs by about 50% to around 45,000 SNPs. Taking into account the significant cost difference between 454 and Sanger technologies, our data suggest that, at the small cost of obtaining a relatively small number of false polymorphisms, SNPs from this category would be worth including in a SNP array. A small number of false SNPs can be easily detected during post-processing analysis of genotyping calls and filtered out prior to further analysis. For regions in the fox genome associated with traits of interest it would also be useful to invest in validation of SNPs represented by an even lower number of reads. Although the false positive rate for these SNPs is expected to be higher, validation experiments can still provide a significant number of SNPs for these critical regions. The 454 technology has been successfully used for developing SNP resources for multiple organisms with limited genomic resources [49–51].
Initially, a large number of SNP's that mapped to the canine X chromosome were heterozygous in at least one male individual. To test whether this represented errors in our data or in the assembly of the canine X chromosome, the contigs bearing these SNPs were mapped onto the Human X chromosome assembly. All fox contigs containing SNPs that mapped to both the human and the dog X chromosome outside of the recognized pseudoautosomal region were homozygous for each individual fox. To gain a better understanding of the high heterozygosity of SNPs assigned to CFAX, we mapped approximately 100 bp region surrounding 50 randomly selected SNPs that were reported as heterozygous on the X back to the dog genome. We found that, in 34% of cases the area surrounding the SNP could not be located unambiguously to the X, or to a single area on the X; 50% of SNPs had high homology to the X but also had significant homology with another area of the genome. This casts doubt on the X chromosome SNPs showing heterozygosity in our male individuals. We subsequently filtered out all SNPs on the CFAX that were not in the pseudoautosomal region and were heterozygous in either individual.
Of all the fox high-confidence SNPs identified 97.8% were localised in the dog genome, and the inferred location in the fox genome was identified for 96.3% (Additional file 4, Figure S3; Figure 4). Heterozygosity of SNPs in the two fox samples was compared and several continuous regions along fox chromosomes where SNPs were fixed for opposite alleles between tame and aggressive sample were revealed. These regions warrant investigation in a larger number of samples from tame and aggressive fox strains for evaluation of putative selection sweeps. One extended region of reduced heterozygosity in the tame sample was observed in a region of VVU12 corresponding to part of a previously identified QTL interval . Overall, the number of high confidence SNPs and their locations on fox chromosomes indicated that now we have a set of SNPs sufficient for genome wide mapping in these populations.
Detailed evaluation of the location in the dog genome of a randomly selected set of 100 fox SNPs revealed that approximately 77% of SNPs are located in transcribed regions and distributed approximately equally between coding regions and UTRs. Approximately 9% of SNPs are located in introns and 14% in non-coding regions. Similar results were also identified in the bank vole transcriptome study . As has become apparent recently, the mammalian transcriptome is more complex than previously recognized [52–54].
Comparison of gene expression between the two fox samples revealed only a relatively small number of genes whose expression level differed by two-fold or more (355 genes p < 0.05). The biofunction groups over-represented in the set of differentially expressed genes differed between the two samples. Genes associated with neurological diseases were over represented in the tame sample. Interestingly, the second biofunction group over-represented in the aggressive sample represents genes associated with cardiovascular diseases. The links between the cardiovascular system and behavior have been increasingly recognized recently [55–58].
To validate differences in gene expression that were identified by transcriptome sequencing, we selected a subset of nine genes for expression analysis by RT-qPCR. Exactly the same two RNA samples (one from a tame and one from an aggressive fox) which had been used for transcriptome sequencing were used for RT-qPCR validation experiments. Eight of nine differentially expressed genes selected for the validation study showed differences in expression in the same direction by both methods. All genes that were validated by RT-qPCR showed differential expression between the two samples in transcriptome analysis with p < 0.04. Although the consistent validation by RT-qPCR of results obtained by transcriptome analysis increases confidence in the observed differences in expression between the two samples, it will clearly be necessary to test these observed differences in a larger set of samples before coming to conclusions regarding biological relevance. Among the genes that were validated by RT-qPCR, HTR2C is particularly noteworthy. HTR2C plays an important role in serotonergic and dopaminergic signalling [59–61]; and is differentially expressed in specific brain regions of two strains of rats - one of which was selected for and the other against aggressive behavior . Increased level of HTR2C mRNA in the frontal cortex and hippocampus of tame rats was observed . Notably, the direction of difference in HTR2C expression between the tame and aggressive fox individuals in the present study was consistent with that between rat strains in the Popova et al. study. None of the genes reported in other studies [63–65] investigating comparative gene expression in canid brain samples were significantly different between the two fox samples in the present study.
Further resequencing and deeper analysis of brain transcriptome, including identification of non-synonymous mutations, comparison of splice variants between tame and aggressive samples, and evaluation of differentially expressed genes in larger independent sample sets (biological replicates) will be necessary to characterize critical differences in these fox strains associated with markedly different behavioral phenotypes. In any experimental system there will be a certain amount of noise that might stem from a variety of sources. These biological replicates would let us explore the differences in the underlying biological system we are interested in and validate the differences in gene expression.
Using new sequencing technologies it is now possible to develop genomic resources for species without a sequenced genome. That dramatically facilitates the study of interesting but previously inaccessible biological phenomena and helps to establish new animal models for genetic research. Overall, transcriptome sequencing from the prefrontal cortex of two fox individuals identified over 30,000 high-confidence fox-specific SNPs, fox orthologs of over 14,000 dog genes, and yielded new insights into potentially important differences in expression of genes in the pre-frontal cortex between tame and aggressive foxes.
Fox samples were collected at the experimental farm of the Institute of Cytology and Genetics in Novosibirsk, Russia. All animal procedures complied with standards for humane care and use of laboratory animals by foreign Institutions. Samples of pre-frontal cortex were collected into RNAlater (PE Biosystems, Foster City, CA) and stored according to the manufacturer's protocol, from two 7 month old male foxes (one from the tame and one from the aggressive population) and subsequently used for transcriptome sequencing.
cDNA library preparation and sequencing
Total RNA was extracted using TRIZOL reagent (Invitrogen, Carlsbad, CA). cDNA libraries were prepared at Roche Life Sciences facility using Roche cDNA Rapid Library Preparation protocol (Method Manual, Roche, 2009) cDNA sequencing experiments require the conversion of mRNA into cDNA prior to preparation with the GS Rapid Library Prep Kit. Utilizing the cDNA Synthesis System Kit from Roche, transcription of RNA into cDNA is performed using 200 ng of RNA as starting material (OD 260/280 ≥ 1.8). mRNA is isolated and then treated with zinc chloride to fragment the mRNA into the desired 450 bp range. The sequencing read length is determined by the fragmentation of the RNA. RNA size varies.
Random hexamer primers were used, comprising single-stranded DNA containing every possible 6-base combination, to enable hybridization anywhere on the RNA. Once randomly fragmented, cDNA first-strand synthesis is primed utilizing the random hexamers, which diminishes the priming of the 3' poly(A) tail; it is then followed by second-strand synthesis. From this hybridization, reverse transcriptase utilizes the double-stranded sequence as a primer to start translation. The double-stranded cDNA is used directly as starting material for rapid library preparation, yielding blunt-ended cDNA with the addition of the overhanging A. The resulting cDNA fragments are then polished and prepared for 454 Adaptor ligation. Once ligated, the DNA is ready for emulsion PCR (emPCR TM). 454 pyrosequencing is then performed using standard GS FLX Titanium series reagent kits.
454 GS FLX Titanium sequencing
Amplified products were sequenced at Roche Life Sciences facility using the Roche 454 FLX Titanium platform. Samples were sequenced in parallel in 6 runs. Data for 5 half full runs for tame individual and 6 half full runs for aggressive individual were obtained.
Post-processing of sequencing reads
Image analysis, signal processing and data filtering were performed using the standard 454 Software with default settings. 454 reads were first quality filtered with standard parameters and raw reads were cleaned from adaptor sequences. Files containing 454 reads and their quality scores are available from the National Center for Biotechnology Information (NCBI) Sequence Read Archive [SRA029285.1]. All fox reads that passed quality filtering were retained for subsequent analysis (Table 1).
De novo assembly of fox reads
Fox sequencing reads were assembled using the 454 GS De Novo Assembler software (March 2010, R&D release) with the transcriptome (-cdna) option activated. The reads were assembled in three combinations, resulting in three separate assemblies of fox transcriptome: i) assembly of all sequencing reads from both samples; ii) assembly of sequencing reads from the sample of tame fox; iii) assembly of sequencing reads from the sample of aggressive fox. All three assemblies were generated using the same parameters: minimum overlap length of 40 bp, minimum identity of 90%, with cDNA mode set to true. A trimming FASTA file, which includes polyA sequences of 5 bp, 10 bp and 20 bp in length, was used in conjunction with the vector-trimming option (-vt) of the assembler software, in order to trim off the polyA tails among the sequencing reads prior to assembly.
For each assembly, GS De Novo Assembler generated a set of output files including: i) "454NewblerMetrics.txt", which reports numerous overall assembly metrics; ii) "454AllContigs.fna", which contains the nucleotide sequences of the assembled contigs generated by the assembly process; iii) "454Isotigs.fna", which contains the nucleotide sequences of the assembled isotigs and their corresponding isogroups, as generated by the assembly process.
Reads were analyzed into contigs and singletons, to generate sets of isotigs, and to classify isotigs into isogroups. In brief, a contig corresponds to the consensus sequence of a self-consistent set of overlapping reads, as determined by the algorithmic steps of the 454 GS De Novo Assembler software, which include pairwise overlap, multiple sequence alignment, detangling and basecalling. An isogroup is a collection of contigs containing reads that imply connections between them. An isotig is meant to be analogous to an individual transcript. Different isotigs within a given isogroup can be inferred to be splice-variants. The reported isotigs are the putative transcripts that can be constructed using overlapping reads provided as input to the assembler.
BLAT of fox contigs to the dog genome
All contigs at least 100 bp long were located on the dog genome (CanFam2) by BLAT and the result yielding the best score was kept for each. The results (.psl files) from using BLAT to locate fox contigs on the dog genome, and the output file from a similar BLAT run using only contigs with sequence differences between foxes, were uploaded to the web as custom tracks on the UCSC genome browser to visualize their locations on the dog genome. To identify fox X-chromosome sequences, all contigs at least 100 bp long were also mapped to the human genome by BLAT: those that mapped over more than 50% of their length to the human X, and also mapped to the dog X were scored as located on the X chromosome with high confidence.
BLASTX of fox isotigs to the protein database
BLASTX was run, comparing the isotigs.fna file to the SwissProt database. An e cutoff of 1e-05 was used. When more then one entry in the SwissProt database was hit, the one with the best e-value was used to assign isotigs to genes. In the case of a tie the one listed first in the file was used.
Mapping sequencing reads to the canine RefSeq database
Cleaned sequencing reads from the tame and aggressive individuals were mapped in separate runs to the canine RefSeq database using 454 GS Reference Mapper (Version 2.3), with a threshold of 95% identity over 90% of length, using the cDNA mode. For each mapping, GS Reference Mapper generated a series of output files including: i) "454RefStatus.txt", which reports the statistical information on the number of reads mapping to each reference sequence; ii) "454GeneStatus.txt", which reports statistics on the number of reads mapped exclusively to each gene; iii) "454AllDiffs.txt", which contains the list of variations (of at least two reads) relative to the reference sequence or to other reads aligned at a specific location; iv) "454HCDiffs.txt", which contains the list of high confidence variations relative to the reference sequence or to other reads aligned at a specific location. The canine RefSeq database was downloaded from NCBI http://www.ncbi.nlm.nih.gov/nuccore/limits on June 15, 2010, using the search parameters: Organism=Canis lupus familiaris, Database = RefSeq, Molecule=mRNA. 33,476 sequences representing 19,859 genes were downloaded. Because of redundancy in the RefSeq data set, caused by splice variants of genes having multiple accession numbers, many of the fox reads mapped to multiple references and were not reported in the 454RefStatus.txt file. To solve this problem, a rename file was created to assign multiple dog splice variants to the same gene using the -accno option of GS Reference Mapper. The 454GeneStatus.txt file was then used to determine the number of fox sequencing reads corresponding to each gene. To estimate the total percentage of genes from the dog genome in our data set, we calculated the percentage of genes from the canine RefSeq database that were represented by at least one sequence in our set.
Breadth of coverage of dog transcripts by fox reads was calculated as percent of homologous dog transcript covered by at least one uniquely mapped read as reported in the file 454RefStatus.txt.
Mapping fox sequencing reads against the canine genome sequence assembly
Fox sequencing reads were mapped to the repeat-masked, annotated canine genome sequence assembly (CanFam2, UCSC) with GS Reference Mapper (Version 2.3) at a threshold of 95% identity over 90% of length, using cDNA mode. Information about single nucleotide differences, indels, and sequence differences involving two or three adjacent nucleotides was extracted from GS Reference Mapper file "454AllDiffs.txt". To estimate the divergence of the dog from the fox, the number of nucleotide differences reported in the file "454AllDiffs.txt" and identified between dog and fox for which fox reads did not differ from each other, was compared to the number of nucleotide differences observed between fox reads.
Fox-specific nucleotide differences were extracted from the output file "454HCDiffs.txt" and filtered to create a set of higher confidence differences. Retained SNPs were selected by the following criteria: the minor allele had to be present in at least 25% of the sequencing reads per sample, each SNP allele had to be present in at least three reads per allele for an individual, and SNPs assigned to CFAX outside of pseudoautosomal region were homozygous for each individual.
Identification of fox SNPs using the fox assembly as reference
The set of contigs over 100 bp from the assembly of all reads (All together) was used as a reference to map all sequencing reads using GS Reference Mapper (Version 2.3) with default parameters using cDNA mode. The 454HCDiffs.txt file was used to find sequence differences between and within foxes: the information about single nucleotide differences, indels and 2-3 nucleotide polymorphisms (two or three adjacent nucleotides that differ from the reference sequence) was extracted from "454HCDiffs.txt" file. Single nucleotide differences were filtered to remove (i) those for which the minor allele in an individual accounted for less than 25% of the sequencing reads and (ii) those with less than three reads for an allele for an individual, (iii) those that were not SNPs, (iv) SNP's assigned to the canine X chromosome were further filtered to remove SNPs which had apparent heterozygosity in either male individual, and assigned to the part of CFAX outside of the recognized pseudoautosomal region. The list of differences was also classified according to heterozygosity/homozygosity in each individual. The contigs were assigned to the dog genome by BLAT and the results were used to locate the fox SNPs. This set of SNPs and the set of SNPs identified by mapping fox reads to the dog genome were combined and the duplicates between the sets were removed. The inferred location of these SNPs on fox autosomes was assigned using the comparative fox/dog map.
To establish parameters for selecting high-confidence SNPs identified by both methods - (i) mapping fox reads against the dog genome; and (ii) mapping fox reads against fox contigs - we selected a subset of SNPs for validation by Sanger sequencing. Three sets of SNPs for validation experiments were generated using the following parameters: 1) the minor allele had to be represented in at least 25% of the sequencing reads per sample and each SNP allele had to be represented in more then three reads per allele for an individual; 2) the minor allele had to be represented in at least 25% of the sequencing reads per sample and each SNP allele in at exactly three reads per allele for an individual; 3) the minor allele had to be represented in at least three reads between two individuals.
All sequencing reads corresponding to selected SNPs were extracted and aligned manually using Sequencher® 4.2.2 Software (Gene Codes Corporation, Ann Arbor, MI). Assembled contigs were used for primer design. Primers were designed using Primer3 to amplify across the SNPs (Additional file 6, Table S3).
PCRs were performed using fox cDNA as a template. Same RNA samples which were used for 454 sequencing were converted into cDNA and used in SNP validation study. Fox cDNA was synthesized as described in the section "Real-Time PCR". PCRs were performed using 0.5 μl of cDNA from the standard cDNA reaction recommended by Thermoscript kit (Invitrogen, Carlsbad, CA). PCR reactions were performed in 25 μl containing 10mM of each primer and 1× GoTAQ (Promega, Madison, WI) master mix. Standard PCR conditions were used: an initial 2 min denaturation at 96°C; then 30 cycles of 96°C (20 seconds), 58°C (20 seconds), 72°C (20 seconds); and a final extension step at 72°C for 10 minutes. PCR fragments were purified using Qiagen PCR purification kit (Qiagen, Valencia, CA) and sequenced from both directions. Sequencing was performed on an ABI3730 Genetic Analyzer (PE Biosystems, Foster City, CA) at Cornell sequencing facility. Sequencing reads were assembled and compared to the 454 reads using Sequencher®.
Analysis of SNP distribution among coding and non-coding regions
100 SNPs were chosen at random to estimate the overall percentage of SNPs in coding regions. Each of the 31,264 SNPs from the combined unique set were assigned a random number using the random function of FileMakerPro, version 7 (FileMaker Inc., Santa Clara, CA), and the 100 with the lowest number were chosen for further characterization. A short sequence (30-62 bp) surrounding the polymorphism was localized to the dog genome using BLAT, and the "RefSeq Genes" track and the "Non-Dog RefSeq Genes track" were used to judge if the location of the polymorphism was located in a coding region, in an intron that is between coding exons, in an area where the gene untranslated region (utr) is likely to be (either in a known utr, or in the boundaries of utr's from other species), in inter-gene space, or non-locatable.
Comparison of gene expression between two fox samples using transcriptome sequencing
To estimate the difference in gene expression between the two fox samples, the number of hits to a given RefSeq gene for each individual was first normalized to the total number of reads for the same individual, to thus reduce possible bias introduced by an overall greater number of total reads for one individual versus the other. To ensure that this normalization did not inappropriately distort the data, MA plots  were examined pre and post normalization to compare the patterns of data distribution (Additional file 7, Figure S4). The difference in this normalized count between individuals, was used as an estimate of expression difference between the individuals for the relevant gene. A Z-test was run on the data and a two-tailed p-value was computed for each gene. The Z-score (λ-value in ) was computed using test statistics defined by equation (2) in Reinartz et al. . The p-value (two-sided) was calculated in Microsoft Excel using the formula: p = 1-|(NORMDIST(λ)-NORMDIST(-λ))|.
A subset of genes with differences in expression between the two samples was selected for quantitative Real-Time PCR (RT-qPCR) validation. Sequencing reads corresponding to each gene were extracted and assembled manually using Sequencher® 4.2.2 Software (Gene Codes Corporation, Ann Arbor, MI). Assembled contigs were aligned against the canine genome (CanFam2) using BLAT (UCSC, CA). Intron-exon boundaries within fox contigs were identified by comparison of fox contigs to the corresponding canine genes. Primers were designed to amplify across intron-exon boundaries to minimize amplification of gDNA (Additional file 8, Table S4), using Primer3 (Broad Institute, Boston, MS) with parameters optimized for RT-qPCR conditions (primer size: min 18 bp, opt 20 bp, max 23 bp; primer Tm: min 58°C, opt 60°C, max 62°C; product size: 75-150 bp).
Fox cDNA was prepared using Thermoscript kit (Invitrogen, Carlsbad, CA). Aliquots of the same RNA samples that had been previously prepared for the transcriptome sequencing experiment were used for RT-qPCR experiments and SNP validation study. cDNA was synthesized using total RNA (1 mkg per reaction) and oligo dT primer according to the manufacture protocol (Invitrogen, Carlsbad, CA). RT-qPCR was performed using SYBR Green Fast (PE Biosystems, Foster City, CA) on an ABI machine 7500 Fast (PE Biosystems, Foster City, CA) following the manufacturer's instructions. RT-qPCR reactions were performed in 20 μl volumes containing gene-specific primer pairs (0.15 or 0.3uM), 2μl of cDNA and 10 μl of SYBR Green Fast (PE Biosystems, Foster City, CA). Reactions were performed in triplicate. Primer concentration was optimized for each primer pair. Fox gDNA and water were used as controls. All reactions were performed under standardized conditions: an initial denaturation at 95° (20 seconds); then 40 cycles of 95° (3 seconds), 60° (30 seconds); following by dissociation step: 95° (15 seconds), 60° (20 seconds), 95° (15 seconds), 60° (15 seconds). Results were analyzed using SDS1.4 software (PE Biosystems, Foster City, CA).
Relative quantification was performed using HPRT1 gene as an endogenous control. A genome-wide panel of canine reference genes has been evaluated by Brinkhof et al., (2006) . This study advocated using ribosomal protein S19 (RPS19), ribosomal protein S5 (RPS5), b-2-microglobulin (B2M), and hypoxanthine phosphoribosyltransferase (HPRT) as reference genes for RT-qPCR. HPRT has been used as a reference gene for RT-qPCR in multiple canine studies [[70–74], and others ]. HPRT was also found to be one of the most stable reference genes in the cerebral cortex in rats . HPRT1 primers were modified from dog-specific primers  using fox sequence (Additional file 8, Table S4). RT-qPCR reactions for target genes and HPRT1 were performed in different tubes. RT-qPCR for each tested gene and HPRT1 were run simultaneously on the same plate. Relative gene expression levels were calculated using 2-ddCt method .
Kukekova and Johnson;
Teiling and Li.
We are grateful to Irina V. Pivovarova, Tatyana I. Semenova, Vasiliy V. Ivaykin, Vera I. Vladimirova, Tatyana V. Konovalova, Vera L. Haustova, and all the animal keepers at the ICG experimental farm for research assistance. We want to thank Dr. Gordon K. Lark and Kevin Chase for insightful discussions, Donald Miller for advising on RT-qPCR experimental design, Laura Goodman for help with RT-qPCR data analysis, and three reviewers for their comments. Research was supported by MH077811, EY006855, NIH FIRCA R03TW008098, GM082910, the Foundation Fighting Blindness, and grants # 05-04-4837 and # 06-04-48142 from the Russian Fund for Basic Research, Program of the Russian Academy of Sciences: "Biodiversity and Genome Dynamics".
- Murchison EP, Tovar C, Hsu A, Bender HS, Kheradpour P, Rebbeck CA, Obendorf D, Conlan C, Bahlo M, Blizzard CA, Pyecroft S, Kreiss A, Kellis M, Stark A, Harkins TT, Marshall Graves JA, Woods GM, Hannon GJ, Papenfuss AT: The Tasmanian devil transcriptome reveals Schwann cell origins of a clonally transmissible cancer. Science. 2010, 327: 84-7. 10.1126/science.1180616.PubMed CentralPubMedView Article
- Gregg C, Zhang J, Butler JE, Haig D, Dulac C: Sex-specific parent-of-origin allelic expression in the mouse brain. Science. 2010, 329: 682-5. 10.1126/science.1190831.PubMed CentralPubMedView Article
- Gregg C, Zhang J, Weissbourd B, Luo S, Schroth GP, Haig D, Dulac C: High-resolution analysis of parent-of-origin allelic expression in the mouse brain. Science. 2010, 329: 643-8. 10.1126/science.1190830.PubMed CentralPubMedView Article
- Lupski JR, Reid JG, Gonzaga-Jauregui C, Rio Deiros D, Chen DC, Nazareth L, Bainbridge M, Dinh H, Jing C, Wheeler DA, McGuire AL, Zhang F, Stankiewicz P, Halperin JJ, Yang C, Gehman C, Guo D, Irikat RK, Tom W, Fantin NJ, Muzny DM, Gibbs RA: Whole-genome sequencing in a patient with Charcot-Marie-Tooth neuropathy. N Engl J Med. 2010, 362: 1181-91. 10.1056/NEJMoa0908094.PubMed CentralPubMedView Article
- Worthey EA, Mayer AN, Syverson GD, Helbling D, Bonacci BB, Decker B, Serpe JM, Dasu T, Tschannen MR, Veith RL, Basehore MJ, Broeckel U, Tomita-Mitchell A, Arca MJ, Casper JT, Margolis DA, Bick DP, Hessner MJ, Routes JM, Verbsky JW, Jacob HJ, Dimmock DP: Making a definitive diagnosis: Successful clinical application of whole exome sequencing in a child with intractable inflammatory bowel disease. Genet Med. 2010,
- Ng SB, Bigham AW, Buckingham KJ, Hannibal MC, McMillin MJ, Gildersleeve HI, Beck AE, Tabor HK, Cooper GM, Mefford HC, Lee C, Turner EH, Smith JD, Rieder MJ, Yoshiura K, Matsumoto N, Ohta T, Niikawa N, Nickerson DA, Bamshad MJ, Shendure J: Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome. Nat Genet. 2010, 42: 790-3. 10.1038/ng.646.PubMed CentralPubMedView Article
- Ng SB, Buckingham KJ, Lee C, Bigham AW, Tabor HK, Dent KM, Huff CD, Shannon PT, Jabs EW, Nickerson DA, Shendure J, Bamshad MJ: Exome sequencing identifies the cause of a mendelian disorder. Nat Genet. 2010, 42: 30-5. 10.1038/ng.499.PubMed CentralPubMedView Article
- Twine NA, Janitz K, Wilkins MR, Janitz M: Whole transcriptome sequencing reveals gene expression and splicing differences in brain regions affected by Alzheimer's disease. PLoS One. 2011, 6 (1): e16266-10.1371/journal.pone.0016266.PubMed CentralPubMedView Article
- Ku CS, Naidoo N, Pawitan Y: Revisiting Mendelian disorders through exome sequencing. Hum Genet. 2011, 129: 351-70. 10.1007/s00439-011-0964-2.PubMedView Article
- Bruder CE, Yao S, Larson F, Camp JV, Tapp R, McBrayer A, Powers N, Granda WV, Jonsson CB: Transcriptome sequencing and development of an expression microarray platform for the domestic ferret. BMC Genomics. 2010, 11: 251-10.1186/1471-2164-11-251.PubMed CentralPubMedView Article
- Wall CE, Cozza S, Riquelme CA, McCombie WR, Heimiller JK, Marr TG, Leinwand LA: Whole transcriptome analysis of the fasting and fed Burmese python heart: insights into extreme physiologic cardiac adaptation. Physiol Genomics. 2010,
- Schwartz TS, Tae H, Yang Y, Mockaitis K, Vanhemert JL, Proulx SR, Choi JH, Bronikowski AM: A garter snake transcriptome: pyrosequencing, de novo assembly, and sex-specific differences. BMC Genomics. 2010, 11: 694-10.1186/1471-2164-11-694.PubMed CentralPubMedView Article
- Salem M, Rexroad CE, Wang J, Thorgaard GH, Yao J: Characterization of the rainbow trout transcriptome using Sanger and 454-pyrosequencing approaches. BMC Genomics. 2010, 11: 564-10.1186/1471-2164-11-564.PubMed CentralPubMedView Article
- Babik W, Stuglik M, Qi W, Kuenzli M, Kuduk K, Koteja P, Radwan J: Heart transcriptome of the bank vole (Myodes glareolus): towards understanding the evolutionary variation in metabolic rate. BMC Genomics. 2010, 11: 390-10.1186/1471-2164-11-390.PubMed CentralPubMedView Article
- Sheldon JW: Wild dogs: The natural history of the nondomestic Canidae. 1992, San Diego, Academic Press
- Macdonald DW, Reynolds JC: Red Fox. Canids: Foxes, Wolves, Jackals and Dogs. Edited by: Sillero-Zubiri C, Hoffmann M, Macdonald DW. 2004, Cambridge, IUCN, 129-136.
- Doncaster CP, Maconald DW: Drifting territoriality in the red fox Vulpes vulpes. Journal of Animal Ecology. 1991, 60: 423-439. 10.2307/5288.View Article
- Wayne RK, Geffen E, Girman DJ, Koepfli KP, Lau LM, Marshall CR: Molecular systematics of the Canidae. Syst Biol. 1997, 46: 622-53. 10.1093/sysbio/46.4.622.PubMedView Article
- Nobis G: Der älteste Haushund lebte vor 14,000 Jahren. Umschau. 1979, 19: 610-
- Sablin MV, Khlopachev G: The earliest ice age dogs: Evidence from Eliseevichi 1. Curr Anthropol. 2002, 43: 795-10.1086/344372.View Article
- Skoglund P, Götherström A, Jakobsson M: Estimation of population divergence times from non-overlapping genomic sequences: examples from dogs and wolves. Mol Biol Evol. 2011, 28: 1505-17. 10.1093/molbev/msq342.PubMedView Article
- vonHoldt BM, Pollinger JP, Lohmueller KE, Han E, Parker HG, Quignon P, Degenhardt JD, Boyko AR, Earl DA, Auton A, Reynolds A, Bryc K, Brisbin A, Knowles JC, Mosher DS, Spady TC, Elkahloun A, Geffen E, Pilot M, Jedrzejewski W, Greco C, Randi E, Bannasch D, Wilton A, Shearman J, Musiani M, Cargill M, Jones PG, Qian Z, Huang W, et al: Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication. Nature. 2010, 464: 898-902. 10.1038/nature08837.PubMed CentralPubMedView Article
- Verginelli F, Capelli C, Coia V, Musiani M, Falchetti M, Ottini L, Palmirotta R, Tagliacozzo A, De Grossi Mazzorin I, Mariani-Costantini R: Mitochondrial DNA from prehistoric canids highlights relationships between dogs and South-East European wolves. Mol Biol Evol. 2005, 22: 2541-51. 10.1093/molbev/msi248.PubMedView Article
- Wayne RK, Ostrander EA: Origin, genetic diversity, and genome structure of the domestic dog. Bioessays. 1999, 21 (3): 247-57. 10.1002/(SICI)1521-1878(199903)21:3<247::AID-BIES9>3.0.CO;2-Z.PubMedView Article
- Trut LN: Early Canid domestication: The Farm Fox Experiment. American Scientist. 1999, 87: 160-169.View Article
- Trut LN: Experimental Studies of Early Canid Domestication. The Genetics of the Dog. Edited by: Ruvinsky A, Sampson J. 2001, London, CABI, 15-43.View Article
- Trut L, Oskina I, Kharlamova A: Animal evolution during domestication: the domesticated fox as a model. Bioessays. 2009, 31: 349-60. 10.1002/bies.200800070.PubMed CentralPubMedView Article
- Statham MJ, Trut LN, Sacks BN, Kharlamova AV, Oskina IN, Gulevich RG, Johnson JL, Temnykh SV, Acland GM, Kukekova AV: On the origin of a domesticated species: identifying the parent population of Russian silver foxes (Vulpes vulpes). Biol J Linn Soc Lond. 2011, 103: 168-175. 10.1111/j.1095-8312.2011.01629.x.PubMed CentralPubMedView Article
- Study of the Molecular Basis of Tame and Aggressive Behavior in the Silver Fox. [http://cbsu.tc.cornell.edu/ccgr/behaviour/Fox_Behavior.htm]
- Trut LN: The Genetics and Phenogenetics of Domestic Behaviour. Problems in General Genetics (Proceeding of the XIV International Congress of Genetics), Moscow. 1980, 2: 123-136. book 2
- Kukekova AV, Trut LN, Chase K, Shepeleva DV, Vladimirova AV, Kharlamova AV, Oskina IN, Stepika A, Klebanov S, Erb HN, Acland GM: Measurement of segregating behaviors in experimental silver fox pedigrees. Behavior Genetics. 2008, 38: 185-94. 10.1007/s10519-007-9180-1.PubMed CentralPubMedView Article
- Kukekova AV, Trut LN, Chase K, Kharlamova AV, Johnson JL, Temnykh SV, Oskina IN, Gulevich RG, Vladimirova AV, Klebanov S, Shepeleva DV, Shikhevich SG, Acland GM, Lark KG: Mapping loci for fox domestication: deconstruction/reconstruction of a behavioral phenotype. Behavior Genetics. 2011, 41: 593-606. 10.1007/s10519-010-9418-1.PubMed CentralPubMedView Article
- Kukekova AV, Trut LN, Oskina IN, Kharlamova AV, Shikhevich SG, Kirkness EF, Aguirre GD, Acland GM: A marker set for construction of a genetic map of the silver fox (Vulpes vulpes). Journal of Heredity. 2004, 95: 185-94. 10.1093/jhered/esh033.PubMedView Article
- Kukekova AV, Trut LN, Oskina IN, Johnson JL, Temnykh SV, Kharlamova AV, Shepeleva DV, Gulievich RG, Shikhevich SG, Graphodatsky AS, Aguirre GD, Acland GM: A Meiotic Linkage Map of the Silver Fox, Aligned and Compared to the Canine Genome. Genome Research. 2007, 17: 387-99. 10.1101/gr.5893307.PubMed CentralPubMedView Article
- Clark DL, Boutros NN, Mendez MF: The Brain and Behavior. An introduction to Behavioral Neuroanatomy. 2010, Cambridge University Press, ThirdView Article
- Ingenuity IPA. [http://www.ingenuity.com]
- Anderson SW, Bechara A, Damasio H, Tranel D, Damasio AR: Impairment of social and moral behavior related to early damage in human prefrontal cortex. Nat Neurosci. 1999, 2: 1032-7. 10.1038/14833.PubMedView Article
- Mane SP, Evans C, Cooper KL, Crasta OR, Folkerts O, Hutchison SK, Harkins TT, Thierry-Mieg D, Thierry-Mieg J, Jensen RV: Transcriptome sequencing of the Microarray Quality Control (MAQC) RNA reference samples using next generation sequencing. BMC Genomics. 2009, 10: 264-10.1186/1471-2164-10-264.PubMed CentralPubMedView Article
- Kumar S, Blaxter ML: Comparing de novo assemblers for 454 transcriptome data. BMC Genomics. 2010, 11: 571-10.1186/1471-2164-11-571.PubMed CentralPubMedView Article
- Han X, Wu X, Chung WY, Li T, Nekrutenko A, Altman NS, Chen G, Ma H: Transcriptome of embryonic and neonatal mouse cortex by high-throughput RNA sequencing. Proc Natl Acad Sci USA. 2009, 106: 12741-6. 10.1073/pnas.0902417106.PubMed CentralPubMedView Article
- Lindblad-Toh K, Wade CM, Mikkelsen TS, Karlsson EK, Jaffe DB, Kamal M, Clamp M, Chang JL, Kulbokas EJ, Zody MC, Mauceli E, Xie X, Breen M, Wayne RK, Ostrander EA, Ponting CP, Galibert F, Smith DR, DeJong PJ, Kirkness E, Alvarez P, Biagi T, Brockman W, Butler J, Chin CW, Cook A, Cuff J, Daly MJ, DeCaprio D, Gnerre S, et al: Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature. 2005, 438: 803-19. 10.1038/nature04338.PubMedView Article
- Makinen A: The standard karyotype of the silver fox (Vulpes fulvus Desm.). Committee for the standard karyotype of Vulpes fulvus Desm. Hereditas. 1985, 103: 289-297.
- Switonski M, Reimann N, Bosma AA, Long S, Bartnitzke S, Pienkowska A, Moreno-Milan MM, Fischer P: Report on the progress of standardization of the G-banded canine (Canis familiaris) karyotype. Chromosome Res. 1996, 4: 306-309. 10.1007/BF02263682.PubMedView Article
- Graphodatsky AS, Radjably SI: Comparative cytogenetics of three canids species (Carnivoar, Canidae). II. Distribution of C-heterochromatin. Genetika (Russ). 1981, 17: 1500-1504.
- Yang F, O'Brien PC, Milne BS, Graphodatsky AS, Solanky N, Trifonov V, Rens W, Sargan D, Ferguson-Smith MA: A complete comparative chromosome map for the dog, red fox, and human and its integration with canine genetic maps. Genomics. 1999, 62: 189-202. 10.1006/geno.1999.5989.PubMedView Article
- Yang F, Milne BS, Schelling C, Dolf G, Schlapfer J, Switonski M, Ladon D, Pienkowska A, Bosma AA, Sargan DR, Ferguson-Smith MA: Chromosome identification and assignment of DNA clones in the dog using a red fox and dog comparative map. Chromosome Research. 2000, 8: 93-100. 10.1023/A:1009251917072.PubMedView Article
- Graphodatsky AS, Yang F, O'Brien PC, Serdukova N, Milne BS, Trifonov V, Ferguson-Smith MA: A comparative chromosome map of the Arctic fox, red fox and dog defined by chromosome painting and high resolution G-banding. Chromosome Research. 2000, 8: 253-63. 10.1023/A:1009217400140.PubMedView Article
- Kijas JW, Menzies M, Ingham A: Sequence diversity and rates of molecular evolution between sheep and cattle genes. Anim Genet. 2006, 37: 171-4. 10.1111/j.1365-2052.2005.01399.x.PubMedView Article
- Han Y, Kang Y, Torres-Jerez I, Cheung F, Town CD, Zhao PX, Udvardi MK, Monteros MJ: Genome-wide SNP Discovery in Tetraploid Alfalfa Using 454 Sequencing and High Resolution Melting Analysis. BMC Genomics. 2011, 12: 350-10.1186/1471-2164-12-350.PubMed CentralView Article
- Broders KD, Woeste KE, San Miguel PJ, Westerman RP, Boland GJ: Discovery of single-nucleotide polymorphisms (SNPs) in the uncharacterized genome of the ascomycete Ophiognomonia clavigignenti-juglandacearum from 454 sequence data. Mol Ecol Resour. 2011, 11: 693-702. 10.1111/j.1755-0998.2011.02998.x.PubMedView Article
- Groenen MA, Megens HJ, Zare Y, Warren WC, Hillier LW, Crooijmans RP, Vereijken A, Okimoto R, Muir WM, Cheng HH: The development and characterization of a 60K SNP chip for chicken. BMC Genomics. 2011, 12: 274-10.1186/1471-2164-12-274.PubMed CentralPubMedView Article
- Carninci P, Yasuda J, Hayashizaki Y: Multifaceted mammalian transcriptome. Curr Opin Cell Biol. 2008, 20: 274-80. 10.1016/j.ceb.2008.03.008.PubMedView Article
- Costa V, Angelini C, De Feis I, Ciccodicola A: Uncovering the complexity of transcriptomes with RNA-Seq. J Biomed Biotechnol. 2010, 2010: 853916-PubMed CentralPubMedView Article
- Lindberg J, Lundeberg J: The plasticity of the mammalian transcriptome. Genomics. 2010, 95: 1-6. 10.1016/j.ygeno.2009.08.010.PubMedView Article
- Thayer JF, Hansen AL, Saus-Rose E, Johnsen BH: Heart rate variability, prefrontal neural function, and cognitive performance: the neurovisceral integration perspective on self-regulation, adaptation, and health. Ann Behav Med. 2009, 37: 141-53. 10.1007/s12160-009-9101-z.PubMedView Article
- Thayer JF, Lane RD: Claude Bernard and the heart-brain connection: further elaboration of a model of neurovisceral integration. Neurosci Biobehav Rev. 2009, 33: 81-8. 10.1016/j.neubiorev.2008.08.004.PubMedView Article
- Wager TD, Waugh CE, Lindquist M, Noll DC, Fredrickson BL, Taylor SF: Brain mediators of cardiovascular responses to social threat: part I: Reciprocal dorsal and ventral sub-regions of the medial prefrontal cortex and heart-rate reactivity. Neuroimage. 2009, 47: 821-35. 10.1016/j.neuroimage.2009.05.043.PubMed CentralPubMedView Article
- Gray MA, Rylander K, Harrison NA, Wallin BG, Critchley HD: Following one's heart: cardiac rhythms gate central initiation of sympathetic reflexes. J Neurosci. 2009, 29: 1817-25. 10.1523/JNEUROSCI.3363-08.2009.PubMed CentralPubMedView Article
- Drago A, Serretti A: Focus on HTR2C: A possible suggestion for genetic studies of complex disorders. Am J Med Genet B Neuropsychiatr Genet. 2009, 150B: 601-37. 10.1002/ajmg.b.30864.PubMedView Article
- Iwamoto K, Bundo M, Kato T: Serotonin receptor 2C and mental disorders: genetic, expression and RNA editing studies. RNA Biol. 2009, 6: 248-53. 10.4161/rna.6.3.8370.PubMedView Article
- Reynolds GP, Templeman LA, Zhang ZJ: The role of 5-HT2C receptor polymorphisms in the pharmacogenetics of antipsychotic drug treatment. Prog Neuropsychopharmacol Biol Psychiatry. 2005, 29: 1021-8. 10.1016/j.pnpbp.2005.03.019.PubMedView Article
- Popova NK, Naumenko VS, Kozhemyakina RV, Plyusnina IZ: Functional characteristics of serotonin 5-HT2A and 5-HT2C receptors in the brain and the expression of the 5-HT2A and 5-HT2C receptor genes in aggressive and non-aggressive rats. Neurosci Behav Physiol. 2010, 40: 357-61. 10.1007/s11055-010-9264-x.PubMedView Article
- Saetre P, Lindberg J, Leonard JA, Olsson K, Pettersson U, Ellegren H, Bergström TF, Vilà C, Jazin E: From wild wolf to domestic dog: gene expression changes in the brain. Brain Res Mol Brain Res. 2004, 126: 198-206.PubMedView Article
- Lindberg J, Björnerfeldt S, Bakken M, Vilà C, Jazin E, Saetre P: Selection for tameness modulates the expression of heme related genes in silver foxes. Behav Brain Funct. 2007, 3: 18-10.1186/1744-9081-3-18.PubMed CentralPubMedView Article
- Våge J, Bønsdorff TB, Arnet E, Tverdal A, Lingaas F: Differential gene expression in brain tissues of aggressive and non-aggressive dogs. BMC Vet Res. 2010, 6: 34-10.1186/1746-6148-6-34.PubMed CentralPubMedView Article
- NCBI. [http://www.ncbi.nlm.nih.gov/nuccore/limits]
- Dudoit S, Yang YH, Callow MJ, Speed TP: Statistical methods for identifying genes with differential expression in replicated cDNA microarray experiments. Stat Sin. 2002, 12: 111-139.
- Reinartz J, Bruyns E, Lin JZ, Burcham T, Brenner S, Bowen B, Kramer M, Woychik R: Massively parallel signature sequencing (MPSS) as a tool for in-depth quantitative gene expression profiling in all organisms. Brief Funct Genomic Proteomic. 2002, 1: 95-104. 10.1093/bfgp/1.1.95.PubMedView Article
- Brinkhof B, Spee B, Rothuizen J, Penning LC: Development and evaluation of canine reference genes for accurate quantification of gene expression. Anal Biochem. 2006, 356: 36-43. 10.1016/j.ab.2006.06.001.PubMedView Article
- Meyers-Wallen VN, Bowman L, Acland GM, Palmer VL, Schlafer D, Fajt V: Sry-negative XX sex reversal in the German shorthaired pointer dog. J Hered. 1995, 86: 369-74.PubMed
- Johnson CM, Yang S, Sellins KS, Frank GR: Selection of HPRT primers as controls for determination of mRNA expression in dogs by RT-PCR. Vet Immunol Immunopathol. 2004, 99: 47-51. 10.1016/j.vetimm.2004.01.008.PubMedView Article
- Etschmann B, Wilcken B, Stoevesand K, von der Schulenburg A, Sterner-Kock A: Selection of reference genes for quantitative real-time PCR analysis in canine mammary tumors using the GeNorm algorithm. Vet Pathol. 2006, 43: 934-42. 10.1354/vp.43-6-934.PubMedView Article
- Schlotter YM, Veenhof EZ, Brinkhof B, Rutten VP, Spee B, Willemse T, Penning LC: A GeNorm algorithm-based selection of reference genes for quantitative real-time PCR in skin biopsies of healthy dogs and dogs with atopic dermatitis. Vet Immunol Immunopathol. 2009, 129: 115-8. 10.1016/j.vetimm.2008.12.004.PubMedView Article
- Galac S, Kool MM, Naan EC, Daminet S, Mol JA, Kooistra HS: Expression of the ACTH receptor, steroidogenic acute regulatory protein, and steroidogenic enzymes in canine cortisol-secreting adrenocortical tumors. Domest Anim Endocrinol. 2010, 39: 259-67. 10.1016/j.domaniend.2010.07.001.PubMedView Article
- Cook NL, Vink R, Donkin JJ, van den Heuvel C: Validation of reference genes for normalization of real-time quantitative RT-PCR data in traumatic brain injury. J Neurosci Res. 2009, 87: 34-41. 10.1002/jnr.21846.PubMedView Article
- Livak KJ, Schmittgen TD: Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods. 2001, 25: 402-8. 10.1006/meth.2001.1262.PubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.