Comprehensive search for intra- and inter-specific sequence polymorphisms among coding envelope genes of retroviral origin found in the human genome: genes and pseudogenes
- Nathalie de Parseval†1,
- Gora Diop†2,
- Sandra Blaise1,
- François Helle1,
- Alexandre Vasilescu2,
- Fumihiko Matsuda2 and
- Thierry Heidmann1Email author
© de Parseval et al; licensee BioMed Central Ltd. 2005
Received: 16 March 2005
Accepted: 09 September 2005
Published: 09 September 2005
The human genome carries a high load of proviral-like sequences, called Human Endogenous Retroviruses (HERVs), which are the genomic traces of ancient infections by active retroviruses. These elements are in most cases defective, but open reading frames can still be found for the retroviral envelope gene, with sixteen such genes identified so far. Several of them are conserved during primate evolution, having possibly been co-opted by their host for a physiological role.
To characterize further their status, we presently sequenced 12 of these genes from a panel of 91 Caucasian individuals. Genomic analyses reveal strong sequence conservation (only two non synonymous Single Nucleotide Polymorphisms [SNPs]) for the two HERV-W and HERV-FRD envelope genes, i.e. for the two genes specifically expressed in the placenta and possibly involved in syncytiotrophoblast formation. We further show – using an ex vivo fusion assay for each allelic form – that none of these SNPs impairs the fusogenic function. The other envelope proteins disclose variable polymorphisms, with the occurrence of a stop codon and/or frameshift for most – but not all – of them. Moreover, the sequence conservation analysis of the orthologous genes that can be found in primates shows that three env genes have been maintained in a fully coding state throughout evolution including env W and env FRD.
Altogether, the present study strongly suggests that some but not all envelope encoding sequences are bona fide genes. It also provides new tools to elucidate the possible role of endogenous envelope proteins as susceptibility factors in a number of pathologies where HERVs have been suspected to be involved.
A large fraction (8%) of the human genome contains elements of retroviral origin, with thousands of sequences closely related to the integrated proviral form of infectious retroviruses with the canonical gag, prt, pol and env genes . These elements, named human endogenous retroviruses (HERV), are most probably the proviral remnants of ancestral germline infections by active retroviruses, which have thereafter been transmitted in a Mendelian manner. HERVs have been grouped according to sequence homologies into more than 80 distinct families, each containing a few to several hundreds elements (reviewed in [2–4], see  for their classification). Most HERV genes are non-coding, due to either terminating mutations or deletions, but we have characterized 16 human endogenous env genes that have retained a coding capacity among the 30,000 endogenous proviral elements of the human genome . The analysis of their transcriptome in healthy human tissues has revealed that three of them are highly expressed in the placenta, namely the erv3/HERV-R, the HERV-W and the HERV-FRD env genes .
Phylogenetic as well as functional analyses have revealed strong similarities between HERVs and the present-day infectious retroviruses, suggesting a common history and shared ancestors. Accordingly, it has been proposed that HERVs may still possess some of the functions of infectious retroviruses and as such have pathogenic effects, provided that they are transcriptionally active. Conversely, it is also plausible that HERV proteins may have been co-opted by the host for its benefit. Along this line, it has been proposed that the HERV envelope proteins could play a role in several processes including i) protection against infection by present-day retroviruses through receptor interference , ii) protection of the fetus against the maternal immune system via an immunosuppressive domain located in the envelope transmembrane (TM) subunit [8, 9], and iii) placenta morphogenesis through fusogenic effects allowing differentiation of cytotrophoblastic cells into the syncytiotrophoblast [10–12]. In accordance with a symbiotic role for HERVs, it has recently been shown that the HERV-W and HERV-FRD envelope gene products are highly fusogenic glycoproteins that are specifically expressed in the placenta and can mediate cell-cell fusion ex vivo [12, 13]. Involvement of HERV proteins in physiological processes, however, remains a debated issue, and definite evidence is still lacking. Because selection pressure on a functional gene should result in a limited mutation rate, the survey of single nucleotide polymorphisms (SNPs) among the human population is a way to evaluate functional constraints on these genes. Using this approach, we had previously demonstrated that one postulated candidate for a role in placentation, namely the highly-expressed erv3/HERV-R envelope gene carries a homozygous stop mutation resulting in a severe protein truncation in 1% of individuals of caucasian origin, which strongly suggests that it is not necessary for any fundamental placental function . The unexpectedly low number of still coding envelope genes present in the human genome  now allows a comprehensive analysis of such genes to be performed, in order to assess their possible physiological and/or physiopathological role. Here, we analysed the SNP level of the 12 coding env genes present in the human genome that could be characterized by this approach, together with their conservation among the orthologous genes that can be identified in primates. The two series of data are consistent with a role beneficial to the host for some of the genes, whereas others are likely to be subjected to progressive inactivation. In both cases, the identified SNPs should be useful tools to evaluate the possible role of these genes as "susceptibility genes" in several human pathologies where HERVs have been suspected to be involved.
Structure and PCR-amplification of the fully coding HERV envelope genes
The coding envelope genes of the human genome studied for their SNPs.
Bibliographic gene name
Family name (repbase identifier)
Approximate number of elements
Amplification primers sequence (5'-3')
Chr12: 57008431–57010527 (-)
Chr7: 4367317–4369416 (-)
Chr6: 78423172–78425268 (-)
Chr:X: 95874118–95875872 (+)
Chr2: 166767244–166768998 (-)
Chr2: 155931277–155932944 (+)
Chr19: 20341241–20343121 (+)
Chr7: 91710108–91711724 (-)
Chr7: 63863079–63865094 (-)
Chr3: 16786814–16788358 (+)
Chr:6: 11211913–11213529 (-)
SNP of the human coding envelope genes and haplotype analysis
Based on the number of nonsynonymous and indel mutations, a hierarchy among the endogenous coding env genes can be established, with the env W, env FRD and env F(c)1 genes being the most conserved (see also ), and the env K1, env K2, env T, env R, env R(b), and env F(c)2 genes being affected by numerous mutations, including mutations resulting in truncation of the protein due to frameshifting or generation of stop codons. There is no correlation between the number of SNPs and the "age" of the corresponding gene in the primate lineage. This is clearly illustrated in Figure 3, where the env genes have been ordered according to the date of entry of each corresponding provirus into the host genome as previously determined via an analysis of the orthologous loci throughout evolution (reviewed in ). For instance, the env genes of the HERV-K(HML-2) family are human-specific, i.e. are present in the genome of primate since less than five million years (Myrs), whereas env R(b) and env FRD have entered the genome of the common ancestor of Old World and New World monkeys more than 40 Myrs ago. For the env FRD gene, which is among the "oldest" env genes, only 4 SNPs are found, whereas for the "recent" env genes of the HERV-K(HML-2) family, the SNP number can be as high as 16. Although the lack of correlation between the "age" of the genes in the primate lineage and the numbers of SNPs is not unexpected taking into account the occurrence of "bottlenecks" giving rise to founder effects during the evolution of the human population, what remains surprising is the important variability of the number and "severity" of SNPs among the env genes. This should be a strong indication for a differential selection pressure exerted on these genes (see below).
A further characterization of the SNPs, including genotype distribution, haplotype frequency, and linkage disequilibrium was performed (additional data files 1, 2 and 3). The genotype distributions are compatible with a Hardy-Weinberg equilibrium, except for some positions on the HERV-K1 (HERV-K1_44, HERV-K1_133, HERV-K1_403, HERV-K1_482, HERV-K1_651, HERV-K1_673, HERV-K1_2144), HERV-K4 (HERV-K4_292, HERV-K4_369, HERV-K4_382, HERV-K4_586) and HERV-F(c)1 (HERV-F(c)1_122, HERV-F(c)1_226, HERV-F(c)1_235) env genes, consistent with recent integration of these elements in the primate lineage (see Figure 3). Haplotype frequencies were estimated for each gene based on the Expectation-Maximization (EM) algorithm  for haplotypes with frequency estimates >1%. The results are summarized in additional data file 2. The three most frequent haplotypes for each env gene represent >80% of all the haplotypes, suggesting that these regions have a low recombination rate. A linkage disequilibrium (LD) plot was generated (additional data file 3) with pairwise LDs measured between each pair of polymorphisms using the D and D' methods (see materials and methods). As expected, the majority of high LD values occur within the env genes, and the LDs calculated between SNPs of different HERV env genes is low. Low LDs were obtained as well for env genes located on the same chromosome, i.e. for HERV-F(c)2, HERV-K2, HERV-R, HERV-W (on chromosome 7) and for HERV-H1 and HERV-H3 (on chromosome 2; 10 Mb apart). These observations suggest an independent evolution for each env gene.
Interspecific sequence conservation
The present investigation of the fully coding human env genes, including the human SNP search and the analysis of the coding status of the identified primate orthologs, pinpoints two of these genes – namely env FRD and env W – that disclose the characteristic features of a gene subjected to a functional constraint, i.e. low polymorphism and maintenance of an open reading frame during evolution. Interestingly, these two genes are highly – and specifically – expressed in the placenta, and possess a well-characterized fusogenic function which led to the proposal that they are bona fide genes that have been co-opted by the host for a physiological function related to placenta physiology [10, 12, 13, 19]. Among the other genes, env R is of interest since as with the two former genes it is highly expressed in the placenta and has maintained its fully coding status in all species since its entry into the primate lineage. Yet, the SNP analysis discloses a severe polymorphism – including a premature stop codon – indicating that the preservation of an open reading frame during evolution should not be considered as a sufficient criterion to assign a biological function. The other genes are not conserved in a fully coding state in all the primate branches where they are present and/or disclose a severe polymorphism (with in several cases occurrence of a premature stop codon for some allelic forms).
Another hallmark of the presence of a functional constraint on a gene is a low nonsynonymous/synonymous substitution ratio (Ka/Ks) for orthologous genes found in other species. A bona fide gene with a cellular function should be under purifying selection, which prevents deleterious nonsynonymous substitutions from being fixed and usually does not affect synonymous substitutions, leading to Ka/Ks ratios <1, whereas a gene under genetic drift (e.g. a pseudogene) has a Ka/Ks ratio close to unity (reviewed in ). Such an analysis has already been performed for the env FRD and the env W genes. The mean ratio for all pairwise comparisons was 0.37 for env FRD , demonstrating the existence of a selective pressure. For env W, the corresponding ratio was 0.8 , precluding definite conclusions (yet, a subsequent study on env W identified a region of the gene with a lower ratio compatible with a functional constraint specifically exerted on that domain ). We have calculated the Ka/Ks ratios for the other env genes, when nucleotide sequences of their orthologs were available (data not shown). These ratios were found to be heterogeneous for a given env gene, with values ranging from 0.23 to 1.17, again precluding any definite conclusion to be drawn.
According to the present analysis of intra- and interspecific variability of the env genes, one is led to conclude that most probably only two among the twelve studied env genes are likely to be involved in an essential human physiological function, whereas the others would be on their way to conversion into pseudogenes. The presence of a reading frame still open in human for the latter genes may appear intriguing, but one has to keep in mind that they belong to multicopy HERV families, and as such one of the element could have remained open just by chance, without any purifying selection, since even under completely neutral drift it takes time for a sufficient number of mutations to transform a gene into a pseudogene. Along this line, it is of interest to mention the study by Zhang and Webb on the primate V1R pheromone receptor genes, for which there were approximately 140 copies in the genome of the common ancestor of Old World monkeys and hominoids, whereas the human genome has only five V1R genes that retained an ORF. Examination of the orthologous genes in primates showed that none of the five genes kept an intact ORF in all of the apes. Furthermore, for the orthologous sequences with an intact ORF, Ka/Ks ratios were close to unity. The intraspecific variation of these five human genes was also assessed, and for two of them an allelic form generating a premature stop codon was found. Altogether, the authors concluded that there were no functional constraints on these genes since before the separation of hominoids and Old World Monkeys (approximately 23 Myrs ago) and that they were in the process of pseudogeneization in those primate species . Another possible explanation for the "neutral" conservation of an open reading frame for an HERV env gene without any selection pressure from the host could be related to the relatively autonomous status of these parasitic elements, and be associated with the persistence of active retroviral elements responsible for the maintenance – by a reiterated infection process – of some of the HERV families (e.g. [23, 28]).
In any case, it appears clearly that conservation of an env gene with a coding status cannot be taken as the sole criterion for a possible function to the benefit of the host, with only the env W and env FRD genes emerging from the present study as possible bona fide genes. Yet, this does not mean that the other env genes cannot have any effect in humans. Indeed, the present analysis only indicates that they are not under stringent purifying selection, in terms of evolution, but they still could be involved in pathologies – such as tumors or auto-immune diseases – not deleterious to the species because they occur late in the life of the individuals. One should keep in mind that endogenous retroviruses originate from bona fide retroviruses, and as such might have conserved some of the pathological potency of their progenitors. In this respect, the identified SNPs should be essential tools to determine if this is indeed the case, via an analysis of their distribution among selected groups of individuals with a definite pathology. Along this line, the present data on the env T gene are of special interest. Indeed, this gene is the only non-placental env gene found to be highly expressed in a human tissue – the thyroid – of healthy individuals , and the high level of polymorphism of the gene shown in this report together with its lack of conservation in primates suggest that it is not involved in any essential physiological process and thus not subjected to purifying selection. Thanks to the identified SNPs, it can now be tested whether this expressed gene is involved in a pathological process in humans, among which thyroid tumors could be select candidates for a systematic search.
The systematic SNP search on fully coding human endogenous envelope genes, combined with an analysis of the sequence conservation among the orthologs that can be identified among primates revealed that two genes (env W and env FRD) can be considered as bona fide genes, and identified polymorphisms – to a variable extent – in the other genes. The data are consistent with a physiological role for the former (also called syncytin-1 and syncytin-2 and likely to be involved in human placentation) and provide tools for the latter, to determine their potential role in physiological processes and/or their association with pathological processes in humans – which would be the consequence of their original retroviral status.
DNA samples and genotyping
Ninety-one human samples of French Caucasians were collected from the EGEA (Epidemiological study on the Genetics and Environment of Asthma) study, among the controls ascertained without disease. PCR was performed in mixture containing 25 ng of DNA, 0.3 pmol of each primer, 6 nmol of each dNTP, 0.75 units of ExTaq and 1× reaction buffer (Takara). Sequencing reactions were performed according to the Dye Terminator method using an ABI PRISM® 3700 DNA Analyzer (Applied Biosystems, Foster City, CA, USA). Alignment of sequences, SNP discovery and genotyping were performed with Genalys software . The genomic sequences used for the alignment are env FRD (GenBank accession no. AL136139), env R (AC073210), env T (AC078899), env W (AC000064), env Fc(1) (AL354685), env Fc(2) (AC01222), env H1 (AJ289709), env H2 (AJ289710), env H3 (AJ289711), env R(b) (AC093488), env K1 (AC074261), env K2 (AC072054), env K4 (AF164615). The sources of the simian genomic DNAs are given in ref .
The haplotypes frequencies using all polymorphisms for each gene were estimated with the EM Algorithm . The linkage disequilibrium (LD) estimates between pairs of polymorphisms were obtained by estimating the two polymorphisms haplotypes frequencies using this algorithm. Computation of D and D' (standard disequilibrium measure and standardized disequilibrium measure) in additional data file 3 was performed as in ref .
Cloning of allelic forms of the W and FRD human env genes in expression vectors
The FRD and W env genes were PCR-amplified from human genomic DNAs. PCR was carried out for 25 cycles (10 sec at 93°C, 30 sec at 56°C, 4 min at 68°C), in 50 μl, using 100 ng of genomic DNA, 48 pmol of each primer, 350 μM of each dNTP, 0.75 μl Expand long template enzyme mix and 1× reaction buffer (Roche Applied Science). For the FRD env gene, Xho I-containing primers were ATCACCTCGAGCACCATGGGCCTGCTCCTGCTGGTTCTCATTC as forward primer and ATCACCTCGAGGCTTCAGTACAGGTGGATA as reverse primer. For the W env gene, Xho I-containing primers were ATCACCTCGAGAACAACCAGGAGGAAAGTAA as forward primer and ATCACCTCGAGCTGATCAAGTCGCAAAGC as reverse primer. Each PCR product was then Xho I-cleaved and cloned into the phCMV-G vector (described in ) opened with Xho I. Allelic forms of the cloned env genes were assessed by enzymatic restriction. For env FRD, the 1075 G->A transition (aa 359) predicts the loss of a RsaI site and the 1100 C->T transition (aa 367) predicts the loss of a BstUI site, and for env W the 413 G->A transition (aa 138) predicts the gain of a BstXI site and the 920 G->A transition (aa 307) predicts the gain of a Tsp509I site. As two allelic forms (FRD359T/367M and W138Q/307S) were not available among the cloned envelope genes, we constructed them by exchange of restriction fragments (BsmI-NotI for FRD359T/367M and KpnI-XhoI for W138Q/307S).
Cell-cell fusion assay
The human TE671 rhabdomyosarcoma cells (ATCC CRL8805) and 293T embryonal kidney cells (ATCC CRL11268) were grown in Dulbeco's modified Eagle's medium (DMEM) supplemented with 10% fetal calf serum. All cell culture media were supplemented with streptomycin (100 μg/mL) and penicillin (100 U/mL). Cells were transfected using calcium phosphate precipitation (Invitrogen, 5 μg of DNA for 5 × 105 cells). Fusion activity of envelope glycoproteins was measured 12 to 36 h after transfection of the cells with the corresponding expression vectors. To visualize syncytia, cells were fixed in methanol and stained by adding May-Grünwald and Giemsa solutions (Sigma) according to the manufacturer's instructions. The fusion index, which represents the percentage of fusion events in a cell population is defined as [(N-S)/T] × 100, where N is the number of nuclei in the syncytia, S is the number of syncytia, and T is the total number of nuclei counted.
Characterization of the orthologous env T, envF(c)2 and env R(b) env gene ORFs from simians
The size of the env gene open reading frame in the primate loci was evaluated using a direct coupled in vitro transcription/translation assay based on T7 promoter-containing PCR products as described in , which allows to determine the status of both alleles in the same assay.
For the amplification of env T from gorilla and orangutan, the forward T7 promoter-containing primers were GCTAATACGACTCACTATAGGAACAGACCACCATGTCCTGCTTGGATTCATCAC and GCTAATACGACTCACTATAGGAACAGACCACCATGTTGGATTCATCACTCCCA, respectively, and the common reverse flanking primer was CTGAAGGGAGTTCCTCCTAGG. For the amplification of env R(b) from chimpanzee, gorilla, orangutan, gibbon, Rhesus macaque (Old World Monkey) and Callithrix jacchus (New World Monkey) the forward T7 promoter-containing primer was GCTAATACGACTCACTATAGGAACAGACCACCATGGATCCACTACACACGATTGA and the reverse flanking primer was TGTTTTGGGACACCACGAAT. For the amplification of env F(c)2 from chimpanzee and gorilla, the forward T7 promoter-containing primer was GCTAATACGACTCACTATAGGAACAGACCACCATGAATTCTCCATGTGAC and the reverse flanking primer was GACACTTAATAGTTGCGACA.
The simian env gene sequences were deposited in Genbank with accession numbers AJ862646-AJ862655.
List of abbreviations used
human endogenous retrovirus
single nucleotide polymorphism
Long Terminal Repeat
This work was supported by the CNRS and by grants from the Ligue Nationale contre le Cancer (Equipe Labellisée for T.H.). We thank Evelyne Heyer for helpful discussions and acknowledge Christian Lavialle for critical reading of the manuscript.
- Consortium IHGS: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921. 10.1038/35057062.View ArticleGoogle Scholar
- Boeke JD, Stoye JP: Retrotransposons, endogenous retroviruses, and the evolution of retroelements. Retroviruses. Edited by: Coffin JM, Hughes SH and Varmus HE. 1997, New York, Cold Spring Harbor Laboratory Press, 343-436.Google Scholar
- Bannert N, Kurth R: Retroelements and the human genome: New perspectives on an old relation. Proceedings of the National Academy of Sciences of the United States of America. 2004, 13 Suppl 2: 14572-14579. 10.1073/pnas.0404838101.View ArticleGoogle Scholar
- de Parseval N, Heidmann T: Human endogenous retroviruses: from infectious elements to human genes. Cytogenet Genome Res. 2005, 110: 318-332. 10.1159/000084964.PubMedView ArticleGoogle Scholar
- Repbase update, a database of transposable elements. [http://www.girinst.org]
- de Parseval N, Lazar V, Casella JF, Benit L, Heidmann T: Survey of human genes of retroviral origin: identification and transcriptome of the genes with coding capacity for complete envelope proteins. Journal of Virology. 2003, 77: 10414-10422. 10.1128/JVI.77.19.10414-10422.2003.PubMedPubMed CentralView ArticleGoogle Scholar
- Best S, Le Tissier PR, Stoye JP: Endogenous retroviruses and the evolution of resistance to retroviral infection. Trends in Microbiology. 1997, 5: 313-318. 10.1016/S0966-842X(97)01086-X.PubMedView ArticleGoogle Scholar
- Cianciolo GJ, Copeland T, Orozlan S, Snyderman R: Inhibition of lymphocyte proliferation by a synthetic peptide homologous to retroviral envelope protein. Science. 1985, 230: 453-455.PubMedView ArticleGoogle Scholar
- Mangeney M, Heidmann T: Tumor cells expressing a retroviral envelope escape immune rejection in vivo. Proc Natl Acad Sci USA. 1998, 95: 14920-14925. 10.1073/pnas.95.25.14920.PubMedPubMed CentralView ArticleGoogle Scholar
- Blond JL, Lavillette D, Cheynet V, Bouton O, Oriol G, Chapel-Fernandes S, Mandrand B, Mallet F, Cosset FL: An envelope glycoprotein of the human endogenous retrovirus HERV-W is expressed in the human placenta and fuses cells expressing the type D mammalian retrovirus receptor. Journal of Virology. 2000, 74: 3321-3329. 10.1128/JVI.74.7.3321-3329.2000.PubMedPubMed CentralView ArticleGoogle Scholar
- Mi S, Lee X, Li X, Veldman GM, Finnerty H, Racie L, LaVallie E, Tang XY, Edouard P, Howes S, Keith JCJ, McCoy JM: Syncytin is a captive retroviral envelope protein involved in human placental morphogenesis. Nature. 2000, 403: 785-788. 10.1038/35001608.PubMedView ArticleGoogle Scholar
- Blaise S, de Parseval N, Bénit L, Heidmann T: Genomewide screening for fusogenic human endogenous retrovirus envelopes identifies syncytin 2, a gene conserved on primate evolution. Proceedings of the National Academy of Sciences of the United States of America. 2003, 100: 13013-13018. 10.1073/pnas.2132646100.PubMedPubMed CentralView ArticleGoogle Scholar
- Frendo JL, Olivier D, Cheynet V, Blond JL, Bouton O, Vidaud M, Rabreau M, Evain-Brion D, Mallet F: Direct involvement of HERV-W Env glycoprotein in human trophoblast cell fusion and differentiation. Mol Cell Biol. 2003, 23: 3566-3574. 10.1128/MCB.23.10.3566-3574.2003.PubMedPubMed CentralView ArticleGoogle Scholar
- de Parseval N, Heidmann T: Physiological knock-out of the envelope gene of the single copy ERV-3 human endogenous retrovirus in a fraction of the caucasian population. Journal of Virology. 1998, 72: 3442-3445.PubMedPubMed CentralGoogle Scholar
- Turner G, Barbulescu M, Su M, Jensen-Seaman MI, Kidd KK, Lenz J: Insertional polymorphisms of full-length endogenous retroviruses in humans. Current Biology. 2001, 11: 1531-1535. 10.1016/S0960-9822(01)00455-9.PubMedView ArticleGoogle Scholar
- Hughes JF, Coffin JM: Human endogenous retrovirus K solo-LTR formation and insertional polymorphisms: implications for human and viral evolution. Proc Natl Acad Sci U S A. 2004, 101: 1668-1672. 10.1073/pnas.0307885100.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhao Z, Fu YX, Hewett-Emmett D, Boerwinkle E: Investigating single nucleotide polymorphism (SNP) density in the human genome and its implications for molecular evolution. Gene. 2003, 312: 207-213. 10.1016/S0378-1119(03)00670-X.PubMedView ArticleGoogle Scholar
- Stephens JC, Schneider JA, Tanguay DA, Choi J, Acharya T, Stanley SE, Jiang R, Messer CJ, Chew A, Han JH, Duan J, Carr JL, Lee MS, Koshy B, Kumar AM, Zhang G, Newell WR, Windemuth A, Xu C, Kalbfleisch TS, Shaner SL, Arnold K, Schulz V, Drysdale CM, Nandabalan K, Judson RS, Ruano G, Vovis GF: Haplotype variation and linkage disequilibrium in 313 human genes. Science. 2001, 293: 489-93. Epub 2001 Jul 12.. 10.1126/science.1059431.PubMedView ArticleGoogle Scholar
- Mallet F, Bouton O, Prudhomme S, Cheynet V, Oriol G, Bonnaud B, Lucotte G, Duret L, Mandrand B: The endogenous retroviral locus ERVWE1 is a bona fide gene involved in hominoid placental physiology. Proc Natl Acad Sci U S A. 2004, 101: 1731-1736. 10.1073/pnas.0305763101.PubMedPubMed CentralView ArticleGoogle Scholar
- Laird N: Computational Statistic: the EM algorithm. Handbook of Statistics. Edited by: Rao CR. 1993, Amsterdam, Elsevier Science Publishers, 9: 509–520-10.1016/S0169-7161(05)80138-5.Google Scholar
- Blaise S, Ruggieri A, Dewannieux M, Cosset FL, Heidmann T: Identification of an envelope protein from the FRD family of Human Endogenous Retroviruses (HERV-FRD) conferring infectivity on retroviral particles and functional conservation among simians. Journal of Virology. 2004, 78: 1050-1054. 10.1128/JVI.78.2.1050-1054.2004.PubMedPubMed CentralView ArticleGoogle Scholar
- Herve CA, Forrest G, Lower R, Griffiths DJ, Venables PJ: Conservation and loss of the ERV3 open reading frame in primates. Genomics. 2004, 83: 940-943. 10.1016/j.ygeno.2003.10.003.PubMedView ArticleGoogle Scholar
- Benit L, Calteau A, Heidmann T: Characterization of the low-copy HERV-Fc family: evidence for recent integrations in primates of elements with coding envelope genes. Virology. 2003, 312: 159-168. 10.1016/S0042-6822(03)00163-6.PubMedView ArticleGoogle Scholar
- de Parseval N, Casella JF, Gressin L, Heidmann T: Characterization of the three HERV-H proviruses with an open envelope reading frame encompassing the immunosuppressive domain and evolutionary history in primates. Virology. 2001, 279: 558-569. 10.1006/viro.2000.0737.PubMedView ArticleGoogle Scholar
- Fay JC, Wu CI: Sequence divergence, functional constraint, and selection in protein evolution. Annu Rev Genomics Hum Genet. 2003, 4: 213-235. 10.1146/annurev.genom.4.020303.162528.PubMedView ArticleGoogle Scholar
- Bonnaud B, Bouton O, Oriol G, Cheynet V, Duret L, Mallet F: Evidence of selection on the domesticated ERVWE1 env retroviral element involved in placentation. Mol Biol Evol. 2004, 21: 1895-1901. 10.1093/molbev/msh206.PubMedView ArticleGoogle Scholar
- Zhang J, Webb DM: Evolutionary deterioration of the vomeronasal pheromone transduction pathway in catarrhine primates. Proc Natl Acad Sci U S A. 2003, 100: 8337-8341. 10.1073/pnas.1331721100.PubMedPubMed CentralView ArticleGoogle Scholar
- Belshaw R, Pereira V, Katzourakis A, Talbot G, Paces J, Burt A, Tristem M: Long-term reinfection of the human genome by endogenous retroviruses. Proc Natl Acad Sci U S A. 2004, 101: 4894-4899. 10.1073/pnas.0307800101.PubMedPubMed CentralView ArticleGoogle Scholar
- Takahashi M, Matsuda F, Margetic N, Lathrop M: Automated identification of single nucleotide polymorphisms from sequencing data. J Bioinform Comput Biol. 2003, 1: 253-265. 10.1142/S021972000300006X.PubMedView ArticleGoogle Scholar
- Devlin B, Risch N: A comparison of linkage disequilibrium measures for fine-scale mapping. Genomics. 1995, 29: 311-322. 10.1006/geno.1995.9003.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.