ESTs and EST-linked polymorphisms for genetic mapping and phylogenetic reconstruction in the guppy, Poecilia reticulata
© Dreyer et al; licensee BioMed Central Ltd. 2007
Received: 16 February 2007
Accepted: 08 August 2007
Published: 08 August 2007
The guppy, Poecilia reticulata, is a well-known model organism for studying inheritance and variation of male ornamental traits as well as adaptation to different river habitats. However, genomic resources for studying this important model were not previously widely available.
With the aim of generating molecular markers for genetic mapping of the guppy, cDNA libraries were constructed from embryos and different adult organs to generate expressed sequence tags (ESTs). About 18,000 ESTs were annotated according to BLASTN and BLASTX results and the sequence information from the 3' UTRs was exploited to generate PCR primers for re-sequencing of genomic DNA from different wild type strains. By comparison of EST-linked genomic sequences from at least four different ecotypes, about 1,700 polymorphisms were identified, representing about 400 distinct genes. Two interconnected MySQL databases were built to organize the ESTs and markers, respectively. A robust phylogeny of the guppy was reconstructed, based on 10 different nuclear genes.
Our EST and marker databases provide useful tools for genetic mapping and phylogenetic studies of the guppy.
The Trinidadian guppy, Poecilia reticulata Peters, is well known for the highly polymorphic male color patterns, which have been the subject of genetic analysis for almost a century . The vast literature on the ecology and evolution of the guppy and the extensive phenotypic variation in wild populations make the guppy a particularly attractive choice for understanding the molecular basis of adaptation to varying natural conditions. Despite the wealth of field studies, molecular genetic information about the guppy is still scarce. Therefore, a genetic map would be a first step towards identifying quantitative adaptive traits of the guppy, including male ornamentation and predator-driven adaptations found in different river habitats in Trinidad, underlying heritable differences in life history traits [2, 3].
The genome size of the guppy is estimated to be around 740 Mbp, with a diploid set of 46 chromosomes, including genetically defined X and Y sex chromosomes . A first map of the sex chromosomes, based on classical genetic analysis of male colour patterns, has been sketched out by Winge and co-workers . More recently, Phang and co-workers used ornamental guppies from Singapore to generate a genetic map based on 300 RAPD markers, and a cross between two laboratory strains of different body shape and colour was mapped using a combination of 186 AFLP and microsatellite markers [6–8]. Conservation of microsatellites between closely related species and synteny with respect to 61 microsatellite markers were suggested by an intergeneric cross between Xiphophorus maculatus  and P. reticulata . In all of these studies, the number of linkage groups fell short of the chromosome number, indicating that a higher marker density is required for complete coverage of the genetic map. Unfortunately, RAPD and AFLP markers cannot be easily reused for studying crosses between outbred strains of wild guppies.
We have identified hundreds of expressed sequence tag (EST)-linked single nucleotide polymorphism (SNP) markers, suitable for genetic mapping of wild guppies. The fact that these markers are linked to expressed genes will help to exploit syntenic information from fully sequenced genomes of other fish species . This will in turn also facilitate future identification of candidate genes when mapping qualitative morphological as well as quantitative life history traits of the guppy.
Results and discussion
A P. reticulata EST database
Several guppy cDNA libraries were constructed using SMART technology, as detailed in Materials and Methods. As sources of mRNA we used whole embryos, newborn fish, adult liver, testis, brain, retina, and skin, in order to obtain a broad spectrum of different expressed sequences. Several feral and laboratory strains were used, including the Quare6 strain from East Trinidad and the Tranquille strain from West Trinidad . This allows for direct sequence comparison of abundant transcripts between different strains. Between 100 and 5,700 clones were picked at random from each library, depending on its complexity. The inserts were first sequenced from the 5' end and sequences were compared to EMBL vertebrate databases (see Methods) using NCBI BLASTN and BLASTX algorithm  to assign a possible function. BLAST results were parsed and automatically entered into a MySQL EST database . Sequences lacking sufficiently good support by BLAST hits with an e-value higher than 10-5 were not entered into the database, but were set aside for periodically repeated subsequent BLAST searches.
Origin of guppy ESTs
Source of cDNA
Istanbul wild Skin
Composition of guppy EST database by annotation
% In this category
DEAD box DDX5
Cold shock domain fruYP1
Receptor for activated prot K
No significant homology
The EST database can be searched by clone name, by accession number of the best BLAST search hit, and by possible biological function of the deduced protein product of each EST. Furthermore, the BLAST hits are saved in a table, in which a full text search for annotations as well as accession numbers can be performed. A subset of EST clones was also sequenced from the 3' end, and the longest ORF was extracted from the assembly with its 5' sequence. Information on the coding regions was obtained by database searches using BLASTX. Contigs representing multiple hits of the same gene product were aligned in order to analyse these for polymorphisms.
We provide a download function for the extraction of the 5' and 3' end cDNA sequences, as well as assemblies in the FASTA format . It is also possible to perform BLAST searches against all ESTs.
Development of single nucleotide polymorphism (SNP) markers in guppies
For identification of polymorphisms, we used ESTs to design PCR primers for amplification from genomic DNA of different strains. We primarily amplified 3' UTR sequences, because they are less conserved than coding sequences and not interrupted by introns. The 3' UTRs are typically shorter than in mammals, as has been reported for other lower vertebrates . This somewhat reduces the usefulness of 3' UTRs for identification of SNPs by resequencing of genomic DNA from different strains. Several EST libraries were constructed (Table 1) and some consisted of size-selected sub fractions (data not shown). In the different libraries, between 10 and 30 % of the cDNA sequences had 3' UTRs longer than 400 bp and those were given priority. The 3' end of the coding sequence was included where required to produce fragments about 400 to 500 bp in length. In many instances, PCR amplification and sequencing revealed that the 3' ends of coding regions contained short (< 100 bp) introns. A comparison of ESTs from different strains reveals that, as expected, coding sequences are less polymorphic than the 3' UTRs.
Some primers that were designed to flank an intron were also used, yet their efficient design required prediction of the most likely exon-intron boundaries by alignment of the ESTs to genomic sequences from other fish species, and the length of the resulting PCR products was unpredictable. Aside from SNPs, 3' UTRs and introns also contained short insertions and deletions (indels) at lower frequency (about 20% of all polymorphisms). Of these indels, about 60 % were found as parts of either short tandem repeat polymorphisms or of homopolymer stretches (data not shown).
A second MySQL database was established for the management of the strain-specific markers and was linked to the EST database. From the marker database information can be retrieved on the reference clone, the primer pairs used, type (SNP, indel), and position of polymorphisms between genomic sequences of the different guppy strains. The marker database can also accommodate information on available assays for these polymorphisms. So far, these are mainly MALDI TOF assays developed for high throughput detection of SNP markers . All multiple alignments of genomic and cDNA sequences that had been generated for polymorphism detection  were loaded into the multiple SNP query tool (MSQT), a database used for SNP assay development, especially for design of strain-specific assays (Warthmann, Fitz and Weigel, submitted).
Polymorphisms in nuclear genes that distinguish strains
Molecular phylogeny of the guppy
Previous molecular phylogenetic studies on guppies were primarily based on mitochondrial sequences . The molecular resources we developed enable studies on molecular evolution of nuclear genes, e.g. the identification of genes that have rapidly diversified and are potentially under positive selection, as exemplified by a study comparing genes encoding the long wave length sensitive opsins to other opsins .
Additional applications include studies on molecular phylogeny of the guppy. Coding sequences of orthologous expressed nuclear genes from nine different fish species, including the guppy, were identified by reciprocal BLAST. For phylogenetic analysis, these sequences were concatenated and a BIONJ tree was reconstructed , using SplitsTree4 software, version 4.6  [see Additional file 1]. The selected genes include highly conserved ribosomal proteins as well as metabolic enzymes and transcription factors [see Additional file 2]. The number of teleost species in this analysis was limited by available sequence data from so-called non-model species, since the intersection of orthologous genes got smaller the more non-model species were included.
The resulting topology was essentially the same when only four coding sequences were concatenated. By convergence tests using increasing numbers of nuclear genes we showed that the pairwise distance statistics and the bootstrap values further improved upon addition of more sequences. This showed that a stable tree topology requires seven or more genes (data not shown). Furthermore, the topology of the phylogenetic tree was the same when maximum likelihood or maximum parsimony estimation was used instead of BIONJ (data not shown). This is in agreement with previously published comparisons of reconstruction methods, showing that BIONJ is not inferior to maximum likelihood . This tree confirms previous phylogenetic studies on Poeciliid fishes, based on morphological criteria [30, 31] and on genomic sequences [25, 32, 33]. In the future, intraspecific polymorphisms found in nuclear genes of feral guppy populations of different origins will be compared to previous phylogenies of different guppy strains  (Willing et al. in preparation). This will enable investigation of population structure and mapping of quantitative adaptive traits.
We established a non-normalized EST library of the guppy from embryos and six different adult tissues, containing 18,000 entries with three-fold redundancy on average. We show that re-sequencing of 3' UTRs from genomic DNA of different guppy strains is a powerful strategy to identify polymorphisms existing between feral guppies of geographically different origins.
Sequence information of 10 nuclear coding genes is sufficient to reconstruct a robust teleost phylogeny including the guppy Poecilia reticulata.
We used the laboratory strains Istanbul wild and Blue  as well as offspring from wild-caught guppies that had been kept in the laboratory for multiple generations. The wild guppies were from the following locations: Lower Oropuche and Quare Rivers strains (Oropuche Drainage, Trinidad); Tranquille and APUFI strains (Caroni Drainage, Trinidad); Cumaná guppies (central Cumuná, Ve; EnCCFR) , and PV6 (Rio San Miquel, Estado Sucre, Ve).
Whole late-eyed and very late-eyed embryos were isolated from euthanised pregnant females. Embryos as well as newborn fish and tissues from adult guppies were frozen in liquid nitrogen. Frozen embryos, newborn fish, or small pieces of tissue (Table 1) were homogenized for 90 seconds using a rotor-stator homogenizer (Polytron Pt 1200, Kinematica) at full speed and total RNA was isolated using an RNAeasy Miniprep or Midiprep kit (Qiagen). RNA from male skin was prepared using Trizol (Invitrogen). Starting with total RNA extracted from whole embryos of the Quare6 and Tranquille strains, newborn fish (Tranquille), adult liver (Tranquille), testis (Blue), or skin (Oropuche2), polyA+ RNA was enriched using Quiagen Oligotex on a spin column.
Total RNA was reverse transcribed and cloned into a pDNRlib vector utilizing a creatorSMART™ cDNA construction kit (Clonetech). The cDNA was amplified by 18 cycles of long distance PCR using BD Advantage 2PCR™ enzyme mixture, following the manufacturer's protocols.
Alternatively, cDNA was transcribed from polyA+ RNA using the manufacturer's protocol for primer extension, which included only 6 to 10 cycles of PCR. Size fractionation of the double stranded cDNA after digestion with SfiI was performed on a 1.2% low melting agarose gel stained for 2 min with 0.1% methylene blue. After ligation, the products were dialysed against H2O for 20 minutes on a 0.025 μm Millipore filter before transformation into electro competent ElectroTen-Blue cells (Stratagene). Clones were picked at random and plasmid DNA was isolated with a BioRobot 8000 using MagAttract 96 minipreps (Qiagen). cDNAs were sequenced from both ends using the pDNRlib-specific vector primers pDNRfw: AGTCAGTGAGCGAGGAAGC or pDNRrev: CCAAACGAATGGTCTAGAAAG on an Applied Biosystems 3730xl DNA Analyzer. The resulting sequence trace files in ABI format were processed with pregap4 (Staden package), using the phred base calling algorithm . Vector sequence and low quality sequence was trimmed .
The cDNA was preliminarily annotated based on the best hit resulting from BLAST (version 2.5.15) or BLASTX against the actual version of vertebrate databases available in the GCG Wisconsin Package (em_vrt, embl_new, tags_new, nr) .
A Perl script ran nightly NCBI BLASTN and BLASTX jobs for new ESTs as well as for ESTs with BLAST results older than a given time period. The BLAST information, including clone name, strain and tissue of origin, were stored in the MySQL EST database (MySQL version 5.027-max; binaries and documentation can be downloaded from . A PHP (version 5.1.2) web interface allows user-friendly access to this information and download of all sequences in FASTA format.
Additional links allow direct access to alignments of guppy ESTs with genomic sequences of Oryzias latipes, Tetraodon nigroviridis, and Takifugu rubripes 16,200 ESTs with a minimum length of 200 bp available sequence have been submitted to Genbank (see Table 1 for accession numbers).
The internal features of the database include a PHP web interface for administrative tasks like uploading new ESTs, updating of annotation, adding and changing position of introns (predicted and found in genomic sequences) as well as entering links to markers listed in the MySQL marker database that also has a PHP web interface.
Analysis of genomic sequences
Gene-specific PCR primers (18 to 23 mers, Tm 59° to 62°C) were designed from P. reticulata cDNAs using the Primer3 program (release 1.0) . Genomic DNA was extracted using a DNeasy Tissue kit (Qiagen), and was amplified by PCR using a mixture of Taq polymerase and Pfu (Fermentas; 1000:1 dilution) and following the protocol: 94°C for 5 min; 5 cycles of 94°C; 30 sec; 60°C, 30 sec; 68°C, 90 sec, followed by 31 cycles of 94°C, 30 sec; touchdown to annealing temperature from 68 to 56°C, 30 sec; 68°C, 90 sec, and a final elongation step of 68°C for 6 min. PCR products were purified using Bioline Quick Clean and sequenced, using the same forward primers as in the original PCR. Alternatively, M13forward and reverse tags had been added to the gene-specific primers; the PCR protocol was 94°C for 5 min, 35 cycles of 94°C, 30 sec; 59°C, 30 sec; 68°C, 90 sec; 68°C for 6 min. Sequencing was performed using M13 forward (5'-TGTAAAACGACGGCCAGT-3') or reverse (5'-CAGGAAACAGCTATGACC-3') primers.
Genes were included in the dataset if they met the following criteria: (1) orthologous sequences in each of ten species, and (2) alignable over most of the coding region. Genes were searched by reverse BLAST between the guppy and each of the nine other species, and considered to be homologues if they had significant TBLASTX scores (E-value < 10-50) and reliable sequence identity values (minimum 70% identity with D. rerio and more than 95% identity with X. maculatus). The open reading frame of each gene was manually examined after translating the nucleotide sequences with the getorf module from the EMBOSS package , version 3.0 .
The coding regions were aligned in a codon-based manner, by translating the nucleotide sequences into peptide sequences with transeq included in the EMBOSS package and than aligning them with MUSCLE, version 3.6 . Afterwards the nucleotide alignment was done based on the peptide alignment using the tranalign module included in the EMBOSS package . Flanking gap regions were deleted.
For tree estimation, the ten nucleotide alignments for each species were concatenated to obtain a single alignment. The tree shown in Additional file 1 was reconstructed using BIONJ  implemented in Splitstree4 version 4.6 .
Note added in proof
The paper by Warthmann, Fitz and Weigel, submitted has now been accepted for publication: Warthmann, N., Fitz, J., and Weigel, D. (2007) MSQT for choosing SNP assays from multiple DNA alignments. Bioinformatics, accepted for publication.
We wish to thank Kay Nieselt for helpful suggestions; Heike Keller for expert technical assistance; Joffrey Fitz for help with MSQT design, Cheng Soon Ong and Gunnar Rätsch for help with intron site prediction; David Reznick, Felix Breden, Anne Magurran, and Axel Meyer for guppy strains; and Rainer Bohrer for regular updates of the BLAST database files used. This work was funded by the Max Planck Society, of which DW is a director.
- Winge Ö: The location of eighteen genes in Lebistes reticulatus . J Genetics. 1927, 18: 1-43.View ArticleGoogle Scholar
- Endler JA: Variation in the appearance of guppy color patterns to guppies and their predators under different visual conditions. Vision Res. 1991, 31: 587-608. 10.1016/0042-6989(91)90109-I.PubMedView ArticleGoogle Scholar
- Endler JA: Multiple-trait coevolution and environmental gradients in guppies. Trends Ecol Evol. 1995, 10: 22-29. 10.1016/S0169-5347(00)88956-9.PubMedView ArticleGoogle Scholar
- Traut W, Winking H: Meiotic chromosomes and stages of sex chromosome evolution in fish: zebrafish, platyfish and guppy. Chromosome Res. 2001, 9: 659-672. 10.1023/A:1012956324417.PubMedView ArticleGoogle Scholar
- Winge Ö, Ditlevsen E: Colour inheritance and sex determination in Lebistes. Heredity. 1947, 1: 65-83.View ArticleGoogle Scholar
- Khoo G, Lim MH, Suresh H, Gan DK, Lim KF, Chen F, Chan WK, Lim TM, Phang VP: Genetic linkage maps of the guppy (Poecilia reticulata): assignment of RAPD markers to multipoint linkage groups. Mar Biotechnol (NY). 2003, 5: 279-293. 10.1007/s10126-002-0072-3.View ArticleGoogle Scholar
- Watanabe T, Nakajima M, Yoshida M, Taniguchi N: Construction of six linkage groups in the guppy (Poecilia reticulata). Anim Genet. 2004, 35: 147-148. 10.1111/j.1365-2052.2004.01098.x.PubMedView ArticleGoogle Scholar
- Watanabe T, Yoshida M, Nakajima M, Taniguchi N: Linkage mapping of AFLP and microsatellite DNA markers with the body color- and sex- determining loci in the guppy (Poecilia reticulata). Zoolog Sci. 2005, 22: 883-889. 10.2108/zsj.22.883.PubMedView ArticleGoogle Scholar
- Kazianis S, Nairn RS, Walter RB, Johnston DA, Kumar J, Trono D: The genetic map of Xiphophorus fish represented by 24 multipoint linkage groups. Zebrafish. 2004, 1: 287-304. 10.1089/zeb.2004.1.287.PubMedView ArticleGoogle Scholar
- Brummell M, Kazianis S, Davidson WS, Breden F: Conservation of Synteny Between Guppy and Xiphophorus Genomes. Zebrafish. 2006, 3: 347-358. 10.1089/zeb.2006.3.347.PubMedView ArticleGoogle Scholar
- Shimizu N, Sasaki T, Asakawa S, Shimizu A, Ishikawa SK, Imai S, Murayama Y, Himmelbauer H, Mitani H, Furutani-Seiki M: Comparative genomics of medaka and fugu. Comparative Biochemistry and Physiology, Part D. 2006, 1: 6-12.PubMedGoogle Scholar
- Magurran AE: Evolutionary Ecology: The Trinidadian Guppy. 2005, Oxford University Press, OxfordView ArticleGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. Molecular Biology and Evolution. 1990, 215: 403-View ArticleGoogle Scholar
- MySQL. [http://www.mysql.org]
- Dzwillo M: Genetische Untersuchungen an domestizierten Stämmen von Lebistes reticulatus (Peters). Mitteilungen des Hamburgischen Zool Museum Inst. 1959, 57: 575-584.Google Scholar
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000, 25: 25-29. 10.1038/75556.PubMed CentralPubMedView ArticleGoogle Scholar
- Paschall JE, Oleksiak MF, VanWye JD, Roach JL, Whitehead JA, Wyckoff GJ, Kolell KJ, Crawford DL: FunnyBase: a systems level functional annotation of Fundulus ESTs for the analysis of gene expression. BMC Genomics. 2004, 5: 96-108. 10.1186/1471-2164-5-96.PubMed CentralPubMedView ArticleGoogle Scholar
- Pearson WR, Lipman DJ: Improved tools for biological sequence comparison. Proc Natl Acad Sci USA. 1988, 85: 2444-2448. 10.1073/pnas.85.8.2444.PubMed CentralPubMedView ArticleGoogle Scholar
- Mazumder B, Seshadri V, Fox PL: Translational control by the 3'-UTR: the ends specify the means. Trends Biochem Sci. 2003, 28: 91-98. 10.1016/S0968-0004(03)00002-1.PubMedView ArticleGoogle Scholar
- Jurinke C, van den Boom D, Cantor CR, Koster H: Automated genotyping using the DNA MassArray technology. Methods Mol Biol. 2001, 170: 103-116.PubMedGoogle Scholar
- Staden R, Beal KF, Bonfield JK: The Staden package, 1998. Methods Mol Biol. 2000, 132: 115-130.PubMedGoogle Scholar
- Accelrys GCG 11.0 GenHelp Online Documentation. [http://www.accelrys.com/support/bio/genhelp/]
- Becher SA, Russell ST, Magurran AE: Isolation and characterization of polymorphic microsatellites in the Trinidadian guppy (Poecilia reticulata). Molecular Ecology Notes. 2002, 2: 456-458. 10.1046/j.1471-8286.2002.00276.x.View ArticleGoogle Scholar
- Xu P, Wang S, Liu L, Peatman E, Somridhivej B, Thimmapuram J, Gong G, Liu Z: Channel catfish BAC-end sequences for marker development and assessment of syntenic conservation with other fish species. Animal Genetics. 2006, 37: 321-326. 10.1111/j.1365-2052.2006.01453.x.PubMedView ArticleGoogle Scholar
- Breden F, Ptacek MB, Rashed M, Taphorn D, Figueiredo CA: Molecular phylogeny of the Live-Bearing Fish Genus Poecilia (Cyprinodontiformes: Poecilidae). Molecular Phylogenetics and Evolution. 1999, 12: 95-104. 10.1006/mpev.1998.0600.PubMedView ArticleGoogle Scholar
- Hoffmann M, Tripathi N, Henz SR, Lindholm AK, Weigel D, Breden F, Dreyer C: Opsin gene duplication and diversification in the guppy, a model for sexual selection. Proc Biol Sci. 2007, 274: 33-42. 10.1098/rspb.2006.3707.PubMed CentralPubMedView ArticleGoogle Scholar
- Gascuel O: BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data. Mol Biol Evol. 1997, 14: 685-695.PubMedView ArticleGoogle Scholar
- Huson DH, Bryant D: Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2006, 23: 254-267. 10.1093/molbev/msj030.PubMedView ArticleGoogle Scholar
- Guindon S, Gascuel O: Efficient biased estimation of evolutionary distances when substitution rates vary across sites. Mol Biol Evol. 2002, 19: 534-543.PubMedView ArticleGoogle Scholar
- Rosen DE, Bailey RM: The poeciliid fishes (Cyprinodontiformes), their structure, zoogeography and stystematics. Bull Am Mus Nat Hist. 1963, 126: 1-176.Google Scholar
- Parenti LR, Rauchenberger M: Systematic overview of the Poeciliines. Ecology and evolution of livebearing fishes (Poeciliidae). Edited by: Meffe GK, Snelson FF. 1989, Englewood Cliffs, New Jersey: Prentice Hall, 3-12.Google Scholar
- Volff JN, Korting C, Meyer A, Schartl M: Evolution and discontinuous distribution of Rex3 retrotransposons in fish. Mol Biol Evol. 2001, 18: 427-431.PubMedView ArticleGoogle Scholar
- Hrbek T, Seckinger J, Meyer A: A phylogenetic and biogeographic perspective on the evolution of poeciliid fishes. Mol Phylogenet Evol. 2007, 43: 986-998. 10.1016/j.ympev.2006.06.009.PubMedView ArticleGoogle Scholar
- Fajen A, Breden F: Mitochondrial DNA sequence variation among natural populations of the Trinidad Guppy. Evolution. 1992, 46: 1457-1465. 10.2307/2409949.View ArticleGoogle Scholar
- Alexander HJ, Breden F: Sexual isolation and extreme morphological divergence in the Cumana guppy: a possible case of incipient speciation. J Evol Biol. 2004, 17: 1238-1254. 10.1111/j.1420-9101.2004.00788.x.PubMedView ArticleGoogle Scholar
- Ewing B, Hillier L, Wendl M, P G: Basecalling of automated sequencer traces using phred. I. Accuracy assessment. Genome Research. 1998, 8: 175-185.PubMedView ArticleGoogle Scholar
- Guppy Databases. [http://guppy.weigelworld.org]
- Rozen S, Skaletsky HJ: Primer3 on the WWW for general users and for biologist programmers. Bioinformatics Methods and Protocols: Methods in Molecular Biology. Edited by: Krawetz S, Misener S, Totowa, NJ. 2000, Humana Press, 365-386.Google Scholar
- EMBOSS. [http://emboss.sourceforge.net/apps]
- Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16: 276-277. 10.1016/S0168-9525(00)02024-2.PubMedView ArticleGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.PubMed CentralPubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.