- Research article
- Open Access
The complete swine olfactory subgenome: expansion of the olfactory gene repertoire in the pig genome
BMC Genomics volume 13, Article number: 584 (2012)
Insects and animals can recognize surrounding environments by detecting thousands of chemical odorants. Olfaction is a complicated process that begins in the olfactory epithelium with the specific binding of volatile odorant molecules to dedicated olfactory receptors (ORs). OR proteins are encoded by the largest gene superfamily in the mammalian genome.
We report here the whole genome analysis of the olfactory receptor genes of S. scrofa using conserved OR gene specific motifs and known OR protein sequences from diverse species. We identified 1,301 OR related sequences from the S. scrofa genome assembly, Sscrofa10.2, including 1,113 functional OR genes and 188 pseudogenes. OR genes were located in 46 different regions on 16 pig chromosomes. We classified the ORs into 17 families, three Class I and 14 Class II families, and further grouped them into 349 subfamilies. We also identified inter- and intra-chromosomal duplications of OR genes residing on 11 chromosomes. A significant number of pig OR genes (n = 212) showed less than 60% amino acid sequence similarity to known OR genes of other species.
As the genome assembly Sscrofa10.2 covers 99.9% of the pig genome, our analysis represents an almost complete OR gene repertoire from an individual pig genome. We show that S. scrofa has one of the largest OR repertoires, suggesting an expansion of OR genes in the swine genome. A significant number of unique OR genes in the pig genome may suggest the presence of swine specific olfactory stimulation.
Insects and animals can recognize the world around them by detecting thousands of chemical odorants. In mammals, odorant molecules are detected by olfactory receptors (ORs), which are part of the G-protein-coupled receptor superfamily of proteins having seven transmembrane domains. This superfamily was first discovered in rodents about two decades ago . Olfaction is a complicated process; it begins in the olfactory epithelium with the specific binding of volatile odorant molecules to dedicated ORs expressed by olfactory sensory neurons (OSNs) [2–5].
OR proteins are encoded by the largest gene superfamily in the mammalian genome. Using the available genome sequences, several studies have been conducted to elucidate OR subgenomes in species such as mice [6–9], humans [10–13], dogs and rats [14–16], and other vertebrates [14, 17–19]. OR gene families can be grouped into the following two classes: the fish-like Class I ORs consisting of 17 families and the tetrapod-specific Class II ORs consisting of 14 families . The number of functional OR genes ranges from less than 100 in some fishes including fugu (n = 44) and tetraodon (n = 42)  to ~1,200 in rats. A significant number of OR genes have pseudogenes, and the fraction of OR pseudogenes ranges from less than 20% in the opossum to more than 50% in humans or platypus [14, 17]. Interestingly, in spite of the large number of genes that make up the OR subgenome, most OR neurons express a single gene and in fact, even just a single allele [1, 21].
Pigs are an attractive animal model to study olfaction and its influence on animal behavior because of their agricultural importance and their strong reliance on their sense of smell in various behavioral contexts. The characterization of the swine OR gene repertoire is necessary to better understand the underlying biology of olfaction in pigs. In addition, the comparison of OR gene repertoires and the abilities to smell among evolutionarily important animals is an interesting subject. In this study, we analyzed the pig genome assembly Sscrofa10.2, constructed by the Swine Genome Sequencing Consortium (SGSC), to characterize OR genes in pigs. We report here the nearly complete porcine olfactory subgenome. In addition, we classified the pig OR genes into families and compared OR gene repertoires of humans, dogs, mice, and pigs.
Detection of OR genes from the pig genome
The swine draft genome sequences (Sscrofa10.2) were retrieved from the National Center for Biotechnology Information (NCBI). A translated basic local alignment search tool (TBLASTN) search was performed to identify regions containing OR related sequences that had at least two of the following conserved motifs: MAYDRYVAIC (TMIII), KAFSTCASH (TMVI), and PMLNPFIY (TMVII), or their variants with less than 50% of sequence difference from the conserved motifs. From the identified regions, we selected the sequences in the region one kilobase (kb) upstream and downstream of the BLAST matches. From the analysis, we identified 1,644 OR candidate sequences that were 2 kb in length and translated to amino acid sequences in all six frames. Then, we retrieved 24,809 OR protein sequences from 222 species from NCBI and performed a protein BLAST (BLASTP) analysis against the translated OR candidate sequences to determine the positions of the start and stop codons of the open reading frames (ORFs) on the basis of structural similarity to known OR proteins. For sequences that deviated from the sequences of reported OR proteins, the methionine and stop codon most similar in sequence context to those of the coding sequences of known OR proteins were selected as the start and end of the coding regions. We again performed TBLASTN analysis against the 1,644 sequences to evaluate the presence of all four conserved motifs [GN, MAYDRYVAIC (TMIII), KAFSTCASH (TMVI), and PMLNPFIY (TMVII)]. The candidate sequences were considered “functional ORs” if they were at least 300-amino acid long without any interrupting stop codons and/or frameshifts within the ORFs, “OR pseudogenes” if they were at least 300-amino acid long but contained stop codons or frameshifts within the ORFs, and “partial ORs” if they were shorter than 300 amino acids in length but matched the sequences of the known OR genes. Sequences similar to non-OR G-protein-coupled receptors or partial sequences were removed from our analyses, leaving 1,301 OR genes (including pseudogenes).
Phylogenetic analysis and classification
The nucleotide sequences of 3,511 OR genes from human (457), mouse (908), dog (845), and pig (1,301, 1644 putative ORs minus 343 partial ORs) were combined and aligned together using CLUSTALW . An unrooted phylogenetic tree was constructed after 1,000 rounds of bootstrapping. The tree was used for classifying OR gene families and subfamilies. Pig OR sequences that did not form a cluster with any reference ORs from the other three species were additionally classified using a sequence similarity matrix (data not shown) in which 40% and 60% amino acid similarity were used as the thresholds to distinguish between families and subfamilies, respectively, as previously described .
OR gene nomenclatures
For naming pig OR genes, we followed the OR gene classification system described by Glusman et al.. Functional pig OR genes were named “sORmXn” whereas pseudogenes were named “sORmXnP”, where “s” stands for S. scrofa, “OR” is the root name indicating an olfactory receptor, “m” is an integer representing the family that the gene belongs to, “X” is a single letter denoting the subfamily of the gene, and “n” is an integer representing an individual family member. The names of the pig OR sequences were devised on the basis of on their phylogenetic relationships. For example, sOR1A1 is an OR gene of family 1, subfamily A, and is the first member of this subfamily. In the case of pseudogenes, a name such as sOR7E12P indicates an OR pseudogene of family 7, subfamily E, that is the twelfth member of this subfamily. Duplicated genes with the exact same coding sequences were indicated by adding the suffix A, B, or C at the end of their names, i.e., sOR51N3A and sOR51N3B.
Identification of pig specific OR genes
Multispecies OR gene clustering analysis was performed with OR protein sequences from humans, dogs, mice, and pigs using the OrthoMCL 3 software , in order to group them on the basis of their sequence similarity and divergence. In total, 706 clusters were formed from 3,511 sequences. The cutoff value for a cluster was 60% similarity at the level of the protein sequence, resulting in sequences with greater than 60% similarity being clustered together regardless of the species of origin.
Detection of conserved motifs and patterns
To detect conserved motifs in predicted OR protein sequences, sequence logos were generated from an alignment of functional OR gene sequences using the WebLogo program . The PRATT  program from the Pattern Discovery Platform  was used to define pig OR-specific patterns with the criteria listed in Additional file 1.
Composition of the pig OR gene repertoire
The four motif sequences, GN, MAYDRYVAIC, KAFSTCASH and PMLNPFIY, which are common to mammalian OR genes were used to search the full repertoire of ORs in the pig genome (Figure 1A). We identified 1,301 OR gene-related sequences with lengths of 900–1,000 base pairs (bp). We also analyzed their ORFs and grouped them into the following two categories: functional and pseudo genes. In total, 1,113 OR sequences were identified as functional and 188 were identified as pseudogenes. Among the identified functional genes and pseudogenes, 91.19% of the sequences contained all three OR domains and the rest were missing one of the conserved motifs (Figure 1B). For the GN motif, the presence of the motif was difficult to evaluate because the motif was defined by only two amino acids and may also have sequence variations. Therefore we did not include the result.
Chromosomal distribution of OR genes in the pig genome
The locations of the OR genes were analyzed on the basis of their relative positions in the pig genome by grouping them into positional regions according to their positional proximity. If the coding sequences of the OR genes were more than one megabase (Mb) apart, they were considered to be present on different regions. Of the 1,301 functional genes and pseudogenes, 1,290 were mapped to 46 different chromosomal regions across 16 pig chromosomes and the remaining 11 were located on chromosome U, which contains unmapped sequences (Figure 2). Except for chromosomes 11, 16, 17, and Y, which were devoid of OR genes, all the other chromosomes contained one to 406 OR genes (Table 1). Chromosome 2 had the largest number of OR genes (341), followed by chromosomes 7, 9, and 1. Accordingly, chromosome 2 contained the largest number of OR subfamilies with 121 subfamilies, while only a single subfamily was present on both chromosomes 8 and 10 (Table 1).
We observed extensive variations in the number of OR genes at individual OR gene clusters from one to 123 OR genes per locus/cluster (Table 2). Due to the presence of a large number of OR genes in the genome, the number of pseudogenes was also high (n = 188). The percentage of pseudogenes varied among clusters and ranged from 0 to 100% (Table 1). Of the 46 OR gene clusters, the locus “10-78” was the only OR gene locus that had only one pseudogene, while the other 45 clusters had at least one functional gene (Table 2). In the current swine genome assembly Sscrofa10.2, 11 OR genes (nine functional genes and two pseudogenes) were located on unmapped contigs without any chromosome information. Complete information on the distribution of all OR functional genes and pseudogenes in the pig genome is detailed in Additional file 2.
Classification of OR gene repertoires
Understanding the diversity of OR genes is important for elucidating the differences in their functional responses to various odorants. ORs with more than 60% identity in protein sequence are suggested to recognize odorants with related structures [29, 30]. To evaluate the diversity in the OR gene repertoire of pigs, the identified pig OR genes were classified into families and subfamilies according to the results of phylogenetic analyses (data available upon requested) and their sequence similarity. Then, the results obtained after the classification were compared with those previously obtained for from humans, dogs, mice, and rats [9, 13, 16]. Our analysis showed that the pig OR repertoire comprises 17 families and 349 subfamilies; this repertoire is largest among the known repertoires of mammals (Additional file 3). This suggests that compared to other species, pigs may have a more sophisticated system to sense smell and may be able to distinguish more diverse odorants. Although humans and dogs have relatively large number of OR subfamilies (300 each), humans have a higher pseudogene frequency (52%) than pigs, and dogs have a lower number of functional genes (n = 872) than pigs (n = 1,113). This supports the idea that the functional complexity of the pig olfactory system could be attributed, in part, to genetic complexity. Similar to the OR genes of other mammals, pig OR genes could also be classified into two classes, with three Class I families and 14 Class II families (Additional file 3).
The number of OR genes belonging to each subfamily may represent the importance of the specific subfamilies for the species, as the OR gene subfamilies that are important for the survival of the species are likely to expand in the genome through evolution. Therefore, we counted the number of ORs in each subfamily (Additional file 4). The size of pig OR subfamilies was extremely variable with one to 52 OR genes per subfamily. While most subfamilies had one to six members, six subfamilies had more than 20 genes each. The most common type of subfamily comprised only a single OR gene, accounting for 146 subfamilies. In contrast, subfamily sOR6A consisted of 52 genes (data not shown).
Distribution of OR subfamilies within the OR gene clusters
To study the possible associations between the subfamily structure and the chromosomal organization of OR genes in pigs, the chromosomal locations of all OR gene members of the 349 pig OR subfamilies were analyzed (Table 1). The largest OR cluster in the pig genome was the cluster “9-2” on chromosome 9, which contained 123 OR genes making up 52 subfamilies. We observed that 275 (78.8%) subfamilies were encoded by genes at a single chromosomal cluster, suggesting possible functional similarities among OR genes within a cluster. When we determined the subfamily composition of individual OR gene clusters, the number of subfamilies within a cluster ranged from one to 52 (Table 2). About 26% (12/46) of the OR clusters encoded only one OR subfamily, while 74% of clusters (34/46) encoded OR genes of more than two subfamilies. The general characteristics of the OR subgenome including the number of functional OR genes within a cluster, the number of clusters within a subfamily, and the number of subfamilies within a cluster in the pig (Table 2) were consistent with those reported for other species such as mouse and human [9, 13].
Analysis of OR gene duplication
Gene duplication plays an important role in establishing the biological characteristics or diversity of organisms during evolution . From our analysis to identify OR genes in the pig genome, we found 100% identical coding sequences of OR genes that mapped to different regions in the pig genome. Further analysis showed that the sizes of duplications ranged from 1.1 to 120 kb (data not shown). Duplicated OR genes were found for both functional genes (n = 166) and pseudogenes (n = 22) (Additional file 5), although most of the duplications were of functional genes. There are 80 functional and 11 pseudo genes that have one identical copy each, making 160 and 22 OR genes in total, and two OR genes sOR7A6[ABC] and sOR5AT1[ABC] were found three times each in the pig genome assembly Sscrofa10.2 (Additional file 6). In total, 93 duplication events consisting of 87 intra- and six inter-chromosomal duplications (data not shown) were observed at 11 chromosomes with duplication of two to 41 genes depending on the chromosome (Additional file 5). The most frequent duplication pattern was the presence of two identical OR coding sequences in the genome (Additional file 6). However, we also were not able to entirely exclude the possibility that some of these duplications might result from the errors in the genome assembly. Although we reexamined the partial or duplicated OR genes with respect to assembly issues such as locations in the contigs and relationship between individual members of identical duplicates, we did not find any logical evidences to support that a part of partial or duplicated OR genes were caused by assembly errors.
Patterns of characteristic amino acid motifs in pig OR proteins
Using the criteria in Additional file 1, we performed a pattern discovery analysis for pig OR genes. Table 3 shows five motif patterns identified from four conserved transmembrane domains of pig OR genes, TMII, TMIII, TMVI and TMVII, which are similar to those reported from other species including dogs , rats , and humans  except for minor differences at variable amino acid sites. Analysis of the similarities and differences in conserved OR transmembrane motifs among different species could elucidate the functional importance of each site within the motifs.
Potential odorant specificity of OR subfamilies
To identify potential target specificity of pig OR subfamilies in odor perception, we compared the amino acid sequences of the 1,113 translated pig OR genes to those of other species with previously described information on odorant specificity, including two human ORs [32, 33] and 20 mouse ORs [29, 30, 34–38]. From the analysis, we found that 18 pig ORs matched ORs from other species with known specificity with at least 60% sequence identity, suggesting that these ORs may share similar olfactory specificities (Table 4). There were three mouse ORs, Olfr672, Olfr586 and Olfr545, showing less than 60% sequence similarity to pig ORs and they are known to sense n-aliphatic acids, n-aliphatic alcohols, n-aliphatic dicarboxylic acids, and (−) citronellal. In addition, our analysis also showed that no pig OR has sequence similarity to OR3A1; this human OR is known to perceive helional, which has sweet and hay-like smell.
ORs in mammals are encoded by several hundreds to many thousands of genes in the genome, which together form the OR subgenome [7, 9, 10, 13, 15, 16, 18, 19]. With the availability of whole genome sequence information, several studies have been carried out to characterize the OR subgenomes of vertebrates [9, 13, 15, 16, 18, 19, 39] in an attempt to better understand the underlying biology of olfaction. In this study, we analyzed the current genome assembly of S. scrofa using conserved OR motifs and 24,809 OR protein sequences available from NCBI. We also identified and characterized 1,301 OR related sequences and their genomic distributions. Our study, as the first analysis of the OR gene repertoire in artiodactyla, shows the presence of similarities and differences in the genetic make-up between the pig OR system and that of other animals.
The percentage of OR pseudogenes in the OR subgenome could be an important factor in determining the actual size of the OR repertoire and the number of OR genes present in the genome. Our analysis shows that the percentage of OR pseudogenes in the pig genome is 14%, which is the lowest reported fraction of pseudogenes in any species followed by dogs and rats (Table 5). Pigs and rats have the largest functional OR repertoire with 1,113 and 1,201 genes, respectively. It is interesting to speculate that the olfactory capacity of pigs and rats could be superior to that of dogs, which have 872 functional OR genes, when only gene numbers but not the anatomical difference of olfactory system are considered.
The prevalence of pseudogenes in humans and nonhuman primates has been described in several studies as characteristic of these lineages [4, 41–44]. Because of the anatomical and physiological similarity between pigs and humans, the importance of pigs as biomedical models or donors for human xenotransplantation has recently been suggested . On the other hand, the genetic system of olfaction could be the one of the major differences between humans and pigs; this is consistent with the concept of primates as visual mammals with reduced olfaction . Although detailed anatomical and functional studies on the olfactory system of pigs are not available, the general behavior of pigs and the size of the genetic content responsible for olfaction in pigs support the hypothesis of olfactory expansion in the pig.
When we compared the structural characteristics of OR gene clusters between pigs, humans, mice, rats, and dogs, we did not observe any distinctive trends or patterns that reflected the size of the OR gene repertoire (Additional file 7). However, the number of OR genes per cluster was related to the size of the OR gene repertoire, indicating that an increase in OR gene numbers in pigs during evolution was not due to an increase in the number of OR clusters, but more likely due to an increase in gene numbers within clusters. The number of nonfunctional OR clusters consisting of only OR pseudogenes without functional genes was limited to only one locus in the pig genome, while 13 such clusters were identified in humans .
MHC haplotypes and olfaction have been suspected to be related . Therefore, we determined the number of OR genes that were located on the same chromosome as the MHC region in humans, dogs, mice, rats, and pigs. While the number of OR genes on chromosome 7, which contains the MHC region in pigs, was very high (n = 253), the distribution of OR genes on the MHC containing chromosomes in other species was much lower than that of the pig (data not shown). Further evaluation of the physical distance between OR genes and the MHC region among five species showed that these clusters were not always physically proximal to each other. Especially in dogs, no ORs were found near the MHC region. Although functional relationships may present between OR and MHC molecules, our analysis suggest that the physical linkage between OR clusters and MHC regions may not be strong to all species.
To understand the evolutionary relationships between OR genes from pigs, humans, mice, and dogs, we combined 3,511 OR gene sequences from these four species and performed clustering according to their protein sequence similarity (Figure 3). Using a cutoff of more than 60% sequence identity to group sequences together into a single cluster, 706 clusters were generated according to sequence similarity between pigs, humans, mice, and dogs. Intra-species OR subfamily genes that have more than 60% sequence homology have been indicated to bind to odorants with similar chemical structures [29, 30]. Similarly, OR genes with high sequence homology across different species could also recognize similar odorant substances.
We observed that 21% of the OR clusters (n = 148) had genes that were common to all four species, and this type of cluster was the most common (Table 6). The second most common type of clusters contained genes common among mice, dogs, and pigs but not humans; this is consistent with the preferential loss of OR genes in the human genome. We found 171 of the 212 pig specific OR genes were functional genes, showing that the pig contains the largest number of unique OR genes among the species considered in this study. The number of clusters or subfamilies specific to pigs, humans, mice, and dogs was 61, 4, 39, and 19, respectively (Figure 3, Additional file 8). The presence of unique or common OR genes across different species reflects the maintenance and diversification of genes from common ancestors or the loss of genes within specific lineages during evolution, thus leading to OR subgenome diversity. Consistent with this, we found that the protein sequences of functional OR genes in pigs were highly similar (>70%) with those of OR pseudogenes of other species (Additional file 9).
A recent study in humans showed that a polymorphism in a region on chromosome 11 containing the OR genes OR51B5 and OR51B6 was associated with fetal hemoglobin concentration. This indicates that the elements within this OR gene cluster may play a regulatory role in gamma-globin gene expression . The stereotypical mating posture of an estrus female pig when exposed to a compound in the saliva of boars is also mediated by the olfactory system . Further studies on OR genes and their functional importance could elucidate phenotypes other than olfaction, such as reproductive or behavioral traits, that may be associated with OR gene clusters.
We performed a genome level analysis of OR genes in the pig genome using conserved motif sequences specific to OR genes. Since the current pig genome assembly covers 99.9% of the pig genome, our result represents almost the entire OR gene repertoire of an individual pig genome. The pig OR gene family consists of 1,301 genes including pseudogenes, thus making it one of the largest known OR repertoires and suggesting an expansion of OR genes in the pig genome. The large number of OR subfamilies in pigs could contribute to the functional diversity of the olfactory system of pigs and allow pigs to recognize more diverse odorants than other animals.
Buck L, Axel R: A novel multigene family may encode odorant receptors: a molecular basis for odor recognition. Cell. 1991, 65 (1): 175-187. 10.1016/0092-8674(91)90418-X.
Buck LB: Information coding in the vertebrate olfactory system. Annu Rev Neurosci. 1996, 19: 517-544. 10.1146/annurev.ne.19.030196.002505.
Touhara K: Odor discrimination by G protein-coupled olfactory receptors. Microsc Res Tech. 2002, 58 (3): 135-141. 10.1002/jemt.10131.
Firestein S: How the olfactory system makes sense of scents. Nature. 2001, 413 (6852): 211-218. 10.1038/35093026.
Mombaerts P: Seven-transmembrane proteins as odorant and chemosensory receptors. Science. 1999, 286 (5440): 707-711. 10.1126/science.286.5440.707.
Young JM, Friedman C, Williams EM, Ross JA, Tonnes-Priddy L, Trask BJ: Different evolutionary processes shaped the mouse and human olfactory receptor gene families. Hum Mol Genet. 2002, 11 (5): 535-546. 10.1093/hmg/11.5.535.
Zhang X, Firestein S: The olfactory receptor gene superfamily of the mouse. Nat Neurosci. 2002, 5 (2): 124-133.
Zhang X, Rodriguez I, Mombaerts P, Firestein S: Odorant and vomeronasal receptor genes in two mouse genome assemblies. Genomics. 2004, 83 (5): 802-811. 10.1016/j.ygeno.2003.10.009.
Godfrey PA, Malnic B, Buck LB: The mouse olfactory receptor gene family. Proc Natl Acad Sci USA. 2004, 101 (7): 2156-2161. 10.1073/pnas.0308051100.
Glusman G, Yanai I, Rubin I, Lancet D: The complete human olfactory subgenome. Genome Res. 2001, 11 (5): 685-702. 10.1101/gr.171001.
Niimura Y, Nei M: Evolution of olfactory receptor genes in the human genome. Proc Natl Acad Sci USA. 2003, 100 (21): 12235-12240. 10.1073/pnas.1635157100.
Zozulya S, Echeverri F, Nguyen T: The human olfactory receptor repertoire. Genome Biol. 2001, 2 (6): RESEARCH0018-
Malnic B, Godfrey PA, Buck LB: The human olfactory receptor gene family. Proc Natl Acad Sci USA. 2004, 101 (8): 2584-2589. 10.1073/pnas.0307882100.
Niimura Y, Nei M: Extensive gains and losses of olfactory receptor genes in mammalian evolution. PLoS One. 2007, 2 (8): e708-10.1371/journal.pone.0000708.
Olender T, Fuchs T, Linhart C, Shamir R, Adams M, Kalush F, Khen M, Lancet D: The canine olfactory subgenome. Genomics. 2004, 83 (3): 361-372. 10.1016/j.ygeno.2003.08.009.
Quignon P, Giraud M, Rimbault M, Lavigne P, Tacher S, Morin E, Retout E, Valin AS, Lindblad-Toh K, Nicolas J, et al: The dog and rat olfactory receptor repertoires. Genome Biol. 2005, 6 (10): R83-10.1186/gb-2005-6-10-r83.
Niimura Y, Nei M: Evolutionary dynamics of olfactory receptor genes in fishes and tetrapods. Proc Natl Acad Sci USA. 2005, 102 (17): 6039-6044. 10.1073/pnas.0501922102.
Freitag J, Krieger J, Strotmann J, Breer H: Two classes of olfactory receptors in Xenopus laevis. Neuron. 1995, 15 (6): 1383-1392. 10.1016/0896-6273(95)90016-0.
Freitag J, Ludwig G, Andreini I, Rossler P, Breer H: Olfactory receptors in aquatic and terrestrial vertebrates. J Comp Physiol A. 1998, 183 (5): 635-650. 10.1007/s003590050287.
Alioto TS, Ngai J: The odorant receptor repertoire of teleost fish. BMC Genomics. 2005, 6: 173-10.1186/1471-2164-6-173.
Chess A, Simon I, Cedar H, Axel R: Allelic inactivation regulates olfactory receptor gene expression. Cell. 1994, 78 (5): 823-834. 10.1016/S0092-8674(94)90562-2.
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22 (22): 4673-4680. 10.1093/nar/22.22.4673.
Glusman G, Bahar A, Sharon D, Pilpel Y, White J, Lancet D: The olfactory receptor gene superfamily: data mining, classification, and nomenclature. Mamm Genome. 2000, 11 (11): 1016-1023. 10.1007/s003350010196.
Li L, Stoeckert CJ, Roos DS: OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003, 13 (9): 2178-2189. 10.1101/gr.1224503.
Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14 (6): 1188-1190. 10.1101/gr.849004.
Jonassen I, Collins JF, Higgins DG: Finding flexible patterns in unaligned protein sequences. Protein Sci. 1995, 4 (8): 1587-1595. 10.1002/pro.5560040817.
Pattern Discovery Platform: Pattern Discovery Platform.http://genouest.org,
Terje R, Bhanu PC: Cytogenetics and Chromosome Maps. The genetics of the pig. Edited by: Rothschild MF, Ruvinsky A. 2011, Chippenham: CABI Press, 136-
Malnic B, Hirono J, Sato T, Buck LB: Combinatorial receptor codes for odors. Cell. 1999, 96 (5): 713-723. 10.1016/S0092-8674(00)80581-4.
Kajiya K, Inaki K, Tanaka M, Haga T, Kataoka H, Touhara K: Molecular bases of odor discrimination: Reconstitution of olfactory receptors that recognize overlapping sets of odorants. J Neurosci. 2001, 21 (16): 6018-6025.
Taylor JS, Raes J: Duplication and divergence: the evolution of new genes and old ideas. Annu Rev Genet. 2004, 38: 615-643. 10.1146/annurev.genet.38.072902.092831.
Spehr M, Gisselmann G, Poplawski A, Riffell JA, Wetzel CH, Zimmer RK, Hatt H: Identification of a testicular odorant receptor mediating human sperm chemotaxis. Science. 2003, 299 (5615): 2054-2058. 10.1126/science.1080376.
Wetzel CH, Oles M, Wellerdieck C, Kuczkowiak M, Gisselmann G, Hatt H: Specificity and sensitivity of a human olfactory receptor functionally expressed in human embryonic kidney 293 cells and Xenopus Laevis oocytes. J Neurosci. 1999, 19 (17): 7426-7433.
Zhao H, Ivic L, Otaki JM, Hashimoto M, Mikoshiba K, Firestein S: Functional expression of a mammalian odorant receptor. Science. 1998, 279 (5348): 237-242. 10.1126/science.279.5348.237.
Krautwurst D, Yau KW, Reed RR: Identification of ligands for olfactory receptors by functional expression of a receptor library. Cell. 1998, 95 (7): 917-926. 10.1016/S0092-8674(00)81716-X.
Touhara K, Sengoku S, Inaki K, Tsuboi A, Hirono J, Sato T, Sakano H, Haga T: Functional identification and reconstitution of an odorant receptor in single olfactory neurons. Proc Natl Acad Sci USA. 1999, 96 (7): 4040-4045. 10.1073/pnas.96.7.4040.
Gaillard I, Rouquier S, Pin JP, Mollard P, Richard S, Barnabe C, Demaille J, Giorgi D: A single olfactory receptor specifically binds a set of odorant molecules. Eur J Neurosci. 2002, 15 (3): 409-418. 10.1046/j.0953-816x.2001.01871.x.
Bozza T, Feinstein P, Zheng C, Mombaerts P: Odorant receptor expression defines functional units in the mouse olfactory system. J Neurosci. 2002, 22 (8): 3033-3043.
Rouquier S, Taviaux S, Trask BJ, Brand-Arpon V, van den Engh G, Demaille J, Giorgi D: Distribution of olfactory receptor genes in the human genome. Nat Genet. 1998, 18 (3): 243-250. 10.1038/ng0398-243.
Niimura Y, Nei M: Evolutionary dynamics of olfactory and other chemosensory receptor genes in vertebrates. J Hum Genet. 2006, 51 (6): 505-517. 10.1007/s10038-006-0391-8.
Dulac C, Torello AT: Molecular detection of pheromone signals in mammals: from genes to behaviour. Nat Rev Neurosci. 2003, 4 (7): 551-562. 10.1038/nrn1140.
Brunet LJ, Gold GH, Ngai J: General anosmia caused by a targeted disruption of the mouse olfactory cyclic nucleotide-gated cation channel. Neuron. 1996, 17 (4): 681-693. 10.1016/S0896-6273(00)80200-7.
Rouquier S, Blancher A, Giorgi D: The olfactory receptor gene repertoire in primates and mouse: evidence for reduction of the functional fraction in primates. Proc Natl Acad Sci USA. 2000, 97 (6): 2870-2874. 10.1073/pnas.040580197.
Sharon D, Glusman G, Pilpel Y, Khen M, Gruetzner F, Haaf T, Lancet D: Primate evolution of an olfactory receptor cluster: diversification by gene conversion and recent emergence of pseudogenes. Genomics. 1999, 61 (1): 24-36. 10.1006/geno.1999.5900.
Min TS, Han HJ, Park SH: Porcine Xenotransplantation to Primates. Asian Austral J Anim. 2010, 23 (11): 1535-1542. 10.5713/ajas.2010.10299.
Barton RA: Olfactory evolution and behavioral ecology in primates. Am J Primatol. 2006, 68 (6): 545-558. 10.1002/ajp.20251.
Santos PSC, Uehara CJS, Ziegler A, Uchanska-Ziegler B, Bicalho MD: Variation and linkage disequilibrium within odorant receptor gene clusters linked to the human major histocompatibility complex. Hum Immunol. 2010, 71 (9): 843-850. 10.1016/j.humimm.2010.06.011.
Solovieff N, Milton JN, Hartley SW, Sherva R, Sebastiani P, Dworkis DA, Klings ES, Farrer LA, Garrett ME, Ashley-Koch A, et al: Fetal hemoglobin in sickle cell anemia: genome-wide association studies suggest a regulatory region in the 5' olfactory receptor gene cluster. Blood. 2010, 115 (9): 1815-1822. 10.1182/blood-2009-08-239517.
Dorries KM, Adkins-Regan E, Halpern BP: Sensitivity and behavioral responses to the pheromone androstenone are not mediated by the vomeronasal organ in domestic pigs. Brain Behav Evol. 1997, 49 (1): 53-62. 10.1159/000112981.
This work was supported by the Next-Generation BioGreen 21 (No.PJ0080892011) Program and "Cooperative Research Program for Agriculture Science & Technology Development (PJ009103)" of the Rural Development Administration, Republic of Korea.
The authors declare that they have no competing interests.
Both DTN and KL carried out the bioinformatics analyses and classification of porcine OR genes, interpreted the data, and drafted the manuscript. HC, MC, MTL, and NS evaluated the results of the bioinformatics analyses. JHK, HGS, and JWO provided helpful ideas and discussion for the experiments. KL and THK were collaborators for the SGSC within Korea and played important roles in carrying out the project. CP was involved in project planning, discussion, and writing of the manuscript as a project principal investigator. This study was based on the swine genome sequencing results provided by the SGSC. All authors read and approved the final manuscript.
Dinh Truong Nguyen, Kyooyeol Lee contributed equally to this work.
Electronic supplementary material
Additional file 1: Criteria for pattern recognition of pig OR genes using the PRATT program. Table describing parameters and values for pattern recognition of pig OR genes. (XLS 34 KB)
Additional file 2: The pig OR gene coordinates in the pig genome assembly Sscrofa10.2. Table listing positions of OR functional and pseudo genes in the pig genome. (XLS 2 MB)
Additional file 3: Comparison of the family and subfamily diversity of OR genes among pigs, humans, dogs, mice, and rats. Table showing the results of comparative analysis of the number of classes, families, and subfamilies among five species including pigs, humans, dogs, mice, and rats. (XLS 19 KB)
Additional file 4: The number of OR gene members in OR subfamilies. Table showing the number of OR gene members in each OR subfamily. (XLS 34 KB)
Additional file 5: Distribution of OR gene duplications in the pig genome. Table showing the distribution of OR genes duplicated in the pig genome. (XLS 34 KB)
Additional file 6: The number of identical OR genes and their copy numbers in the pig genome. Table listing the number of OR gene duplications and their copy numbers in the pig genome Sscrofa10.2. The maximum number of identical genes was three. (XLS 18 KB)
Additional file 7: Comparison of structural characteristics of OR gene clusters among five mammalian species. Table listing number of clusters, number of genes per cluster, and number of clusters with only pseudogenes for pigs, humans, dogs, mice, and rats. (XLS 18 KB)
Additional file 8: Amino acid sequence similarity between the functional OR genes of pigs and OR pseudogenes of other species. Table listing 19 pairs of pig OR functional genes and pseudo genes of other species with high protein sequence homology (>70%). (XLS 21 KB)
Additional file 9: The uncompressed image of Figure3. The figure shows the uncompressed image of 706 clusters from 3,511 sequences of pig, human, mouse and dog ORs. (XLS 164 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Nguyen, D.T., Lee, K., Choi, H. et al. The complete swine olfactory subgenome: expansion of the olfactory gene repertoire in the pig genome. BMC Genomics 13, 584 (2012). https://doi.org/10.1186/1471-2164-13-584
- Olfactory receptor
- OR genes