Skip to main content

Yellow lupin (Lupinus luteus L.) transcriptome sequencing: molecular marker development and comparative studies



Yellow lupin (Lupinus luteus L.) is a minor legume crop characterized by its high seed protein content. Although grown in several temperate countries, its orphan condition has limited the generation of genomic tools to aid breeding efforts to improve yield and nutritional quality. In this study, we report the construction of 454-expresed sequence tag (EST) libraries, carried out comparative studies between L. luteus and model legume species, developed a comprehensive set of EST-simple sequence repeat (SSR) markers, and validated their utility on diversity studies and transferability to related species.


Two runs of 454 pyrosequencing yielded 205 Mb and 530 Mb of sequence data for L1 (young leaves, buds and flowers) and L2 (immature seeds) EST- libraries. A combined assembly (L1L2) yielded 71,655 contigs with an average contig length of 632 nucleotides. L1L2 contigs were clustered into 55,309 isotigs. 38,200 isotigs translated into proteins and 8,741 of them were full length. Around 57% of L. luteus sequences had significant similarity with at least one sequence of Medicago, Lotus, Arabidopsis, or Glycine, and 40.17% showed positive matches with all of these species. L. luteus isotigs were also screened for the presence of SSR sequences. A total of 2,572 isotigs contained at least one EST-SSR, with a frequency of one SSR per 17.75 kbp. Empirical evaluation of the EST-SSR candidate markers resulted in 222 polymorphic EST-SSRs. Two hundred and fifty four (65.7%) and 113 (30%) SSR primer pairs were able to amplify fragments from L. hispanicus and L. mutabilis DNA, respectively. Fifty polymorphic EST-SSRs were used to genotype a sample of 64 L. luteus accessions. Neighbor-joining distance analysis detected the existence of several clusters among L. luteus accessions, strongly suggesting the existence of population subdivisions. However, no clear clustering patterns followed the accession’s origin.


L. luteus deep transcriptome sequencing will facilitate the further development of genomic tools and lupin germplasm. Massive sequencing of cDNA libraries will continue to produce raw materials for gene discovery, identification of polymorphisms (SNPs, EST-SSRs, INDELs, etc.) for marker development, anchoring sequences for genome comparisons and putative gene candidates for QTL detection.


L. luteus is a member of the genistoid clade of the Fabaceae family (2n = 52), which is the third largest flowering plant family with over 700 genera and 20,000 species [1]. The genus Lupinus comprises more than 200 annual and perennial herbaceous species of which several are cultivated and used as human food or animal feed [2]. Some of them show high levels of tolerance to biotic and abiotic stresses. For instance, L. hispanicus, a wild relative of L. luteus, has high tolerance to diseases and good adaptation to poor soils, but high levels of bitter alkaloids and low agronomic yields [3]. Lupins are considered to be of polyploid origin which probably played a crucial role in the evolution of their ancestral genomes [4, 5]. The major cultivated species are the old world lupin L. albus (white lupin), L. angustifolius (narrow-leafed lupin), L. luteus (yellow lupin), and the new world species L. mutabilis (pearl lupin or tarwii) [6].

L. luteus is widely distributed across the Mediterranean region, has shallow soil requirements, and cultivated accessions have variable seed yields in Mediterranean environments [7]. In addition, yellow lupin seeds have the highest protein content and twice the cysteine and methionine content of most lupins [8, 9]. However, despite its highly nutritional qualities, there is a lack of genetic and molecular tools to aid the genetic breeding of this species.

EST sequencing has accelerated gene discovery when genome sequences are not available, facilitating gene family identification and development of molecular markers. Next-generation sequencing has generated enormous amount of expressed sequence data for a wide number of plant species, specially minor or orphan crops [10]. For example, EST and genome sequencing of lentil and chickpea would not have been feasible without next-generation sequencing [11, 12]. The lower cost and greater sequence yield has allowed the identification of candidate genes, even when they are expressed at low levels [13, 14].

Research on plants, animals and fungi has shown that sequences of expressed genes are often widely transferable among species, and even genera, allowing wide genome comparative mapping studies [15, 16]. For instance, the combination of orphan crop EST sequences with model plant genetic and genomic resources, such as Lotus japonicus (Japanese trefoil) and Medicago truncatula (barrel medic), has identified macro- and micro-scale synteny, discovered new genes and alleles, and provided insights into genome evolution and duplication [17, 18]. Comparisons between ESTs and gene sequences among several legume species have allowed comparative genome studies between L. albus and M. truncatula[19], and L. angustifolius and Lotus japonicus[20].

Several molecular markers have been developed for Lupinus species, including RFLPs, ITAPs (Intron targeted amplified polymorphic sequences), and AFLPs, which have been used to build genetic linkage maps in L. albus[19] and L. angustifolius[20, 21]. So far, a limited number of SSRs have been developed for Lupinus species, and very few of these are EST-SSRs i.e. SSRs that are found in expressed sequences [2123]. Genomic and EST-SSRs have been widely used for the improvement of major crop plants, but their initial development with traditional methods requires significant research investment. Now, an almost unlimited number of genomic and EST-SSRs can be readily developed from next-generation sequencing approaches within most crop species, including orphan crops such as lupin [2428]. The expressed nature of EST-SSRs allows the annotation of these markers with putative functions by sequence homology and potentially reduces the genetic distance between marker and causal gene to 0 cM. [29, 30]. For instance, the length of a dinucleotide SSR at the 5’ UTR of a waxy gene has been associated with amylase content in rice [31, 32]. EST-SSRs have also been associated with several disease resistant genes in wheat and rice [33, 34] and a number of agronomically important traits in cotton, maize and narrow-leafed lupin [3537].

In this study, we constructed 454-EST libraries, carried out comparative studies between L. luteus and model legume species, and mapped L. luteus expressed sequences on the M. truncatula chromosomes. Alignments between our putative L. luteus genes and their homologs in M. truncatula, coupled with amplifications of intergenic regions provided evidence of microscale synteny between both species. In addition, we developed EST-SSR markers and illustrated their utility within diverse accessions of yellow lupin. Finally, because these EST-SSR markers are gene-based, they are also likely conserved among different species of lupin. We evaluated EST-SSR utility in the other Lupinus species, L. mutabilis and L. hispanicus.


Library construction and 454 sequencing

cDNA libraries were constructed from mRNA isolated from two tissue pools. Pool 1 (L1) included young leaves, buds and flowers, and pool 2 (L2), seeds in different developmental stages. RNA from pool 1 and 2 was isolated separately according to the guanidine hydrochloride method [38]. Both RNAs were assessed for quality by inspecting rRNA bands on an Agilent Bioanalyzer (Agilent Technologies, CA, USA).

cDNAs libraries were normalized and prepared using procedures for Roche 454 Titanium sequencing (Roche, Branford, CT, USA). cDNAs from L1 and L2 were synthesized using the stratagene AccuScript High Fidelity RT-PCR System (Agilent Technologies, CA, USA) and 5’ specific adaptors from Clontech. A cDNA normalization was used to improve coding sequence coverage, avoid AT homopolymer artifacts, and reduce excessive 3’ end transcript sequence [39]. cDNAs from both libraries were amplified using the Clontech Advantage HF system (Clontech Laboratories, Inc) and normalized utilizing the Evrogen Trimmer cDNA Normalization kit (Axxora, LLC). These un-cloned, normalized cDNA libraries were prepared for pyrosequencing according to the manufacturers specifications. One 454 run of sequencing was performed for each EST library (454 Life Sciences, Roche).

Separate transcriptome assemblies of L1 and L2 libraries were created using Newbler (de novo sequence assembly software of Roche 454 Life Sciences) and the cDNA option. A third assembly (L1L2) was completed using the reads from both libraries to avoid sequence redundancy when developing SSR markers. Reads were initially assembled into contigs and contigs into isotigs, which are equivalent to splice transcriptional variants. Sequence read EST data for L1 and L2 are available through the Sequence Read Archive (SRA055806).

EST annotation, function and comparative genomics to other species

Comparing isotigs from the combined assembly (L1L2) to the curated non-redundant protein database (nr,; blastx, e value ≤ 1e-10) provided a functional annotation for each isotig. Alignments of translated-isotigs and proteins with an e-value ≤ 1e-40 were considered to have significant homology. Annotations of the aligned proteins were extrapolated to annotate our putative isotig sequence using Blast2GO ( To directly compare the lupin isotigs to the genes of other crops, blast searches were also used to compare isotig translations to Arabidopsis thaliana, Glycine max, Medicago truncatula and Lotus japonicus Gene Indices (tblastx, e-value ≤ 1e-10). Isotigs were also annotated using Gene Ontology (GO) annotations from InterProScan (

In silico lupin EST mapping and microsynteny

Blast was used to compare lupin EST isotigs to the Medicago genome 3.0 release (≤ 1e-20, HSP identity 60% and HSP length > 50 bp.) The Blast results were visualized using GBrowse where positive matches were displayed as featured tracks on GBrowse 2.13 [40]. The presence of microsynteny was evaluated by PCR amplification of putatively conserved chromosome blocks between L. luteus and M. truncatula. Where alignments between yellow lupin and M. truncatula were identified, specific primer pairs were designed to amplify intergenic regions (Additional file 1). These targeted, intergenic regions were PCR amplified from two L. luteus and one L. hispanicus accessions using 100 ng of genomic DNA in 20 ul reactions containing 100 ng of genomic DNA, 0.2 mM dNTPs, 2 mM MgCl2, 1X PCR buffer, 2.5% DMSO, 1 U taq polymerase (Agilent Technologies, Santa Clara, CA) and 5 pmoles of each forward-reverse primer pair. PCR reactions were carried out following a touchdown protocol on a peltier thermalcycler (MJ Research, Inc.) 94°C for 5 min; 5 cycles of 1 min at 94°C, 1 min at 55-65°C decreasing 1°C per cycle, 2 min at 72°C followed by 35 cycles of 1 min at 94°C, 1 min at 50-60°C and 2 min at 72°C. Amplicons were purified from agarose gels and sequenced. These amplified, intergenic sequences were mapped onto the M. truncatula genome and visualized within a local implementation of GBrowse (Additional file 1). Positive PCR microsynteny set of primers were additionally tested against a screening panel consisting of six diverse accessions of L. luteus to search for polymorphisms among yellow lupin genotypes (Additional file 2).

Identification of EST-SSRs

SSR containing lupin isotigs were identified using the software MISA (MIcroSAtellite, SSR search criteria changed according to repeat types. Di-, and tri-repeats were selected with a minimum length of 12 and 15 nucleotides, respectively. For tetra-, penta- and hexa-repeats, the minimum length was 20 nucleotides. Mononucleotide repeats were not considered due to the possibility of 454 homopolymer sequencing errors associated with this technology. To estimate the amount of SSRs included in coding regions, L1L2 sequences were analyzed using ESTScan ( ORFs discovery was carried out using default parameters and putative cd sequences scanned for SSR motifs using MISA.

From all selected-SSR containing isotigs, only sequences with a motif of at least 7 repeat units were considered for primer design. Flanking primer pairs were designed using the Primer3 software available at NCBI v.3.12 with expected amplicon lengths between 150 - 500 bp. Oligonucleotides were synthesized by IDT (Integrated DNA Technologies, Inc.).

Evaluation and utility of EST-SSRs

EST-SSR polymorphisms and transferability were evaluated on the germplasm screening panel previously mentioned, and one accession each of L. hispanicus and L. mutabilis.

DNAs were extracted following standard procedures [41], quantified using a synergy HT Multimode Microplate Reader (Biotek Instruments, Winooski, VT), and diluted to 50 ng/ul in TE buffer (10 Mm TRIS, 1 mM EDTA pH 7.5). DNA amplification was carried out in 20ul PCR reactions as described above.

PCR products were separated on 6% denaturing polyacrylamide gels, run in TBE buffer at 60 watts for 3–4 hours and visualized using silver stain procedures. DNA amplicons of six EST-SSR primer-pairs used in the polymorphism screening were purified from agarose gels and sequenced in an Applied Biosystems 3730xl DNA Analyzer sequencer (Applied Biosystems, Carlsbad, CA). Amplicon sequences from each EST-SSR primer-pairs were aligned using Geneious version (Biomatters Ltd., using default parameters).

Genetic diversity

The polymorphic EST-SSRs were evaluated in sixty-four L. luteus accessions from several origins (Poland, Ukraine, the former Soviet Union, Spain, Germany, Morocco, Belarus, Portugal, Netherlands, Israel, Hungary, and Chile; Additional file 2). Polish accessions were kindly provided by W.K. Swiecicki, Institute of Plant Genetics, Polish Academy of Sciences, Poznan. Our collection of Chilean accessions is composed of improved breeding lines that are adapted to the Chilean environment. This Chilean germplasm originated from breeding and selection of old European varieties for Southern Chilean environmental conditions. The rest were obtained from the western Regional PI Station, USDA, ARS, WRPIS, Washington State University, Regional Plant Introduction Station, Pullman, Washington, USA. A sample of 50 polymorphic EST-SSRs was used to genotype the sixty-four L. luteus accessions (Table 1). Eighteen EST-SSRs were identified from isotigs specific to L2, 25 isotigs specific to L1, and seven were common to both L1 and L2 libraries. EST-SSR fragments with different sizes were scored as different alleles and coded with alphabetical letters for each primer set. Genetic relationships among L. luteus accessions were evaluated using the neighbor-joining algorithm implemented in PAUP* (v4.01b10). A distance tree was built and branch support estimated by 10,000 bootstraps.

Table 1 Characteristics of 50 EST-SSR primers developed in L. luteus. Shown for each primer pair are the library specificity, repeat motif, forward and reverse sequence, allele range size (bp), number of alleles, amplification in other Lupin species, and annotation


Seed and leaf-flower EST libraries

Two runs of 454 pyrosequencing yielded 205 Mb and 530 Mb of sequence data for L1 and L2 EST libraries, respectively (Table 2). L1 produced 604,869 usable reads that assembled into 26,975 contigs with an average length of 468 nucleotides. L2 generated 1,345,892 usable reads that assembled into 43,674 contigs with an average length of 800 nucleotides. Careful inspection of the L1 contigs found lower percentages of coding regions, higher A/T content, and 2x more A/T homopolymers than L2 contigs. A combined assembly (L1L2) was created to identify the genes that were common in both tissues. 1,964,517 reads were used in the L1L2 assembly and they formed 71,655 contigs with an average contig length of 632 nucleotides. To reduce sequence redundancy due to transcript and alternative splice variants, L1L2 contigs were clustered into 55,309 isotigs, of which 38,200 isotigs translated into proteins and 8,741 of them were full length.

Table 2 cDNA 454 assembly statistics of L1, L2 and L1L2 L. luteus libraries

Functional classification and in silico comparative genomics

The assembled 454 isotigs represented putative transcriptional products i.e. functional genes. Blastx was used to annotate the L1L2 putative genes (i.e. isotigs). A total of 32,862 (59.5%) putative genes showed matches with other species (≤1e-10). Of these sequences, 20,169 (36.5%) showed high similarity to other plant species genes (≤1e-40). GO annotations were grouped under three categories: molecular function, biological processes, and cellular components (Figure 1). At least 31,142 isotigs were annotated with one molecular function, 11,894 with a cellular component and 22,842 with biological process.

Figure 1

GO term annotations for L1L2. Isotigs were grouped under three categories: (a) molecular function, (b) biological processes, and (c) cellular components. Numbers between parentheses indicate the number of positive matches for each function.

Blast was used to compare L1L2 to several model species (tblastx; ≤ 1e-10; Figure 2). Around 57% (31,520) of L. luteus sequences had significant similarity with at least one sequence of Medicago, Lotus, Arabidopsis, or Glycine, and 40.17% showed positive matches with all of these species.

Figure 2

Venn diagram summarizing the distribution of tBlastX matches between L. luteus and four model species ( A. thaliana, M. truncatula, L. japonicus and G. max ). Numbers following the model species correspond to the size of the respective data base. Numbers within the Venn diagram indicate the number of sequences sharing similarity using tBLASTx. Numbers within parenthesis indicate the percentage of matches in terms of the total number of L. luteus sequences.

In silico mapping of lupin ESTs on M. Truncatula chromosomes

Alignment of L. luteus isotig sequences to the M. truncatula genome (Blastn; ≤1e-20; MT3) was used to identify local genomic variability between our ESTs and a related, well-annotated reference genome sequence. The alignments were visualized using GBrowse (v. 2.13) with the Blast matches displayed as feature tracks. A total of 25,400 sequences (46%) from L1L2 had a positive match with MT3 and were distributed heterogeneously on the M. truncatula chromosomes. Chromosomes 3 and 1 had the highest (34,636) and lowest (16,055) number of matches, respectively. Each L. luteus sequence was mapped to an average of 3.7 positions on the Medicago genome.

Occasionally, independent alignments of lupin genes with the M. truncatula genome were found relatively close to each other that primers could be designed to hybridize conserved exons, allowing the amplification of intergenic sequences in between lupin and M. truncatula coding sequences (Figure 3). Positive PCR amplification of intergenic regions using L. luteus genomic DNA and primers anchored on conserved exonic regions of adjacent M. truncatula genes suggested the occurrence of microsynteny (i.e. conserved gene order) between yellow lupin and Medicago. Thirty-three out of 79 (42%) primer pairs amplified clear PCR products. 16 pairs showed expected sizes based on Medicago genomic regions. The remainder primer pairs amplified shorter or longer lupin fragments than the fragments amplified in M. truncatula. Amplicon sequence data for L. luteus containing intergenic DNA sequence were mapped onto the Medicago genome using blast (Figure 3). The alignments between L. luteus and Medicago showed high levels of conservation in the coding regions, but little sequence similarity in the intergenic regions. When L. hispanicus DNA was included as PCR template, only 23 primer pairs amplified. Variable amplification was likely due to localized sequence polymorphism within the primer binding site (i.e. small indels) and not the lack of microsynteny. This ratio (23/33) is similar to the number of EST-SSRs that were found to amplify fragments in both species. Alignments among L. luteus and L. hispanicus were possible at intergenic regions but sequences were clearly less similar than coding regions.

Figure 3

Microsyntenic L. luteus DNA fragments mapped on the Medicago genome using a GBrowse platform. (a) L. luteus microsyntenic region 13 on M. truncatula chromosome 1; (b) L. luteus microsyntenic region 5 on M. truncatula chromosome 1; (c) L. luteus microsyntenic region 11 on M. truncatula chromosome 2.

When these markers were evaluated on the screening panel of diverse germplasm accessions, 10 had length polymorphism for these intergenic regions (Additional file 1). In addition to EST-SSRs, this new Conserved Microsynteny (CMS) marker could be valuable resource for crop improvement with molecular markers.

Identification of EST-SSRs

A total of 2,572 isotig sequences contained at least one EST-SSR, with a frequency of one SSR per 17.75 kilobases (Table 3). The observed frequencies for di-, tri-, tetra-, penta-, and hexa-repeats were 30.4%, 52.7%, 2.4%, 7.5% and 6.2%, respectively (Table 4). Among the di-nucleotide repeats, the AT/TA motif was the most frequently observed (49%) followed by GA/CT (45%). The AC/GT motif was found in low frequency (6%) and there were no CG/GC motifs in the Lupinus sequences. Tri-nucleotide repeats, predominantly A/T-rich motifs (74.5%), were the most frequent tri-nucleotide repeat found in the Lupinus transcriptome. These tri-nucleotide repeats were often found within the coding sequence of putative genes (77.2%). GAA/CTT motif was the most frequent tri-nucleotide repeat (31%).

Table 3 Features of EST-SSRs identified in assembled L1L2 L. luteus library
Table 4 Distribution of repeat types and number of repeats within the L1L2 L. luteus library

Evaluation of EST-SSRs within yellow lupin and other lupin species

Studies involving repeat sizes and level of polymorphism have suggested a positive correlation between repeat number and rates of polymorphisms, especially in dimeric microsatellites [28, 42]. Thus, only EST-SSRs containing at least 7 repeat units were selected for validation to increase the likelihood of finding markers polymorphic between lupin accessions. A total of 783 EST-SSR candidate loci had sufficient repeat units, but only 375 had enough repeat flanking sequence to be suitable for primer design. PCR amplification of these markers resulted in 222 EST-SSRs (59%) that were polymorphic among the six diverse L. luteus included in screening panel. 130 EST-SSRs were monomorphic and 23 primer-pairs failed to amplify. A small number (6) of EST-SSRs were validated by Sanger sequencing. The amplicon sequences from four different L. luteus genotypes and from L. hispanicus and L. mutabilis confirmed the existence of SSR motifs and their length variability between lupin accessions (Figure 4). EST-SSR amplicons showed high conservation at the flanking SSR regions of both Lupinus species when compared with L. luteus. However, several indels were observed in adjacent regions and within the SSR motif, especially in L. mutabilis.

Figure 4

Alignment of L. luteus , L. hispanicus and L. mutabilis containing several repeat motifs. (a) isotig03739 with GA and AGA motifs; (b) isotig16318 with a TAA motif; and (c) isotig21236 with a GAA motif.

Fifty polymorphic EST-SSRs were used to genotype a sample of 64 L. luteus accessions (Table 1 and Additional file 2). Twenty-four of these selected markers were specific to L1 (leaf-flower EST library), 20 EST-SSRs were specific to L2 (seed EST library), and 6 were present in both libraries. Neighbor-joining distance analysis detected several clusters among L. luteus accessions, strongly suggesting the existence of population subdivisions (Figure 5). However, no clear geographical patterns (country of origin) were observed among lupin accessions. Interestingly, Chilean accessions were distributed in most clusters, probably reflecting the breeding history of these genotypes. Two hundred and fifty four (65.7%) and 113 (30%) SSR primer pairs were able to amplify fragments from L. hispanicus and L. mutabilis DNA, respectively.

Figure 5

Neighbour Joining tree relating the 64L. luteus accessions included in the diversity study. Numbers above branches correspond to bootstrap values. Accessions are identified by a letter L followed by numbers. Letters around accessions identify country of origin based on seed bank or breeding histories (RUS: Russia, ISRL: Israel, HUNG: Hungary, CHIL: Chile, GER: Germany, SPN: Spain, PORT: Portugal, MORO: Morocco, POL: Poland, BYS: Belarus, UKR: Ukraine). The scale is in distance units.


Next-generation sequencing has reduced the existing gap between major crop genomic platforms and the limited resources that are currently available for orphan crops [10]. Complete transcriptome sequencing has generated species specific molecular markers, in silico expression analyses, gene discovery, and phylogenetic relationships [43, 44].

In this research, we used 454 cDNA sequences to assemble transcriptomes of two tissues (L1 and L2) of yellow lupin. We recovered a large number of previously unknown and uncharacterized yellow lupin gene sequences (Table 2). The total number of sequences for the combined library was mostly additive from L1 and L2. The L1 library favored the inclusion of longer 3’UTR regions, and thus, reducing the amount of coding sequences needed to assemble longer combined contigs (L1L2). As a consequence, two or more sequences belonging to the same transcript may not be assembled together, causing an overestimation of expressed sequences. The larger amount of 3’UTR regions for L1 is also in agreement with the lower GC content, condition typically associated with untranslated regions [45, 46]. Undoubtedly, a number of expressed sequences are tissue specific and will not assemble into combined contigs. For instance, several genes related to seed dormancy and germination are not expressed in vegetative and floral tissues [47, 48]. The same specificity was observed in a number of tissues and plant species [4951]. The assembly of L1L2 generated 55,309 isotigs of which 30,811 had similarity to putative proteins found in other plant species. Comparative studies carried out against L. japonicus, M. truncatula and G. max showed a total of 31,520 lupin sequences similar to at least one of the model legume databases and 22,219 were similar to all of them. Lotus and Medicago belong to the Galegoid subclade, which includes mostly temperate legume species [52]. Glycine is a member of the Phaseoloid subclade which comprises mostly tropical species [52]. Lupins belong to the Genistoid subclade, which is sister (and distant) to most of the described Papilionoid subclades; especially those containing most domesticated species [53].

Although micro-repeat motifs are frequent in plant genomes and their respective transcriptomes, the frequency of SSR discovery depends on the search criteria [42, 5456]. We analyzed 55,309 lupin isotig sequences using MISA and identified 2,796 SSR motifs with an average frequency of one SSR per 17.75 kbp. Tri-nucleotide repeats were the motifs most frequently found in L. luteus expressed sequences. Similar results have been reported in numerous plant species [26, 28, 54, 55, 57]. The abundance of trimeric EST-SSRs has been attributed to the absence of frameshift mutations when there is length variation in these SSRs [58]. Indeed, 1,435 EST-SSRs were discovered within coding regions of the gene. Among tri-nucleotide repeats, AT-rich motifs were the most predominant ones (74.5%), which have also been observed in soybean, Citrus and Arabidopsis [54, 57]. For di-nucleotide repeats, AT was the most frequently observed motif, contrasting with results from Arabidopsis, soybean, maize, rice, wheat and barley where AC/GT were the most frequent repeats [26, 28, 54, 55, 57]. The high proportion of untranslated sequences (specifically 3’UTR), mainly contributed from the L1, could explain the bias toward A/T-rich repeat sequences observed in yellow lupin. There were no CG repeats in the lupin sequences, similar to results obtained in barrel medic [24], rice, corn, soybean [57], wheat [27], Sorghum [25], Arabidopsis, apricot and peach [59].

We used GBrowse to visualize lupin ESTs aligned to the M. truncatula chromosomes (Figure 3). This approach potentially identifies paralogs sequences and allows color-coded alignment by BLAST significance [60]. A total of 25,400 L. luteus contigs were localized and found to be distributed across the entire Medicago genome with chromosomes Mt1 and Mt3 having the highest number of gene matches. Each yellow lupin sequence was mapped to an average of 3.7 locations, which may correspond in part to rounds of genome duplications previously described for the Medicago genome [61]. Understanding syntenic relationships among species is essential to exploit the available tools developed for comparative genomic analysis. Using this approach, we created a new method of developing molecular markers, markers that are based on conserved microsynteny (CMS) between orphan and model species. Genome comparisons among M. truncatula, G. max and L. japonicus have shown that, in general, most genes in Papilionoid legume species are likely to be found within a relatively long syntenic region of any other Papilioniod species [62]. Positive amplification and sequencing of L. luteus intergenic regions, based on PCR primers located on M. truncatula adjacent genes, suggested the existence of microscale synteny between these legume species. Roughly 40% of the targeted intergenic L. luteus regions amplified, points out the usefulness of conserved legume chromosome blocks for genomic studies of orphan crops. Although some primer pairs failed to amplify, poor amplification could be a consequence of non-synteny, but also other technical limitations could also explain negative PCR results. For instance it is known that non-coding DNA regions are highly variable among species [63, 64], and negative PCR amplifications could easily due to excessively long L. luteus intergenic regions.

Few studies have reported the use of EST-SSRs in Lupinus species [19, 21, 22]. Most efforts have focused on genetic linkage mapping and in diversity studies in L. angustifolius[20], L. albus[21] and L. luteus[22]. To validate our L. luteus polymorphic markers we tested 50 EST-SSRs on a population of 64 genotypes of L. luteus. An analysis of genotypic diversity illustrated the existence of several clusters within L. luteus germplasm. The lack of a clear pattern following the geographical accession origin (country) could be explained by three reasons. 1) The number of accessions may not have been large enough to allow a clear pattern to emerge. 2) L. luteus is widely distributed across the Mediterranean region, mainly due to human introductions [6]. This situation could have homogenized natural genetic distinctiveness, leaving mostly population subdivisions based on breeding histories. 3) Finally, it is possible some accessions could have been misclassified; and thus, obscuring an existing geographical clustering pattern.

We observed that a number of high yellow lupin EST-SSR amplified fragments in two other lupin species, L. hispanicus and L. mutabilis (Table 1). The high number of transferable markers between L. luteus and L. hispanicus confirmed their closer genetic relationship [5, 65] than L. luteus and L. mutabilis. The two closely related species have the same chromosome number (2n = 52) and are still interfertile, generating a natural hybrid called hispanicoluteus[66]. Phylogenetic studies have placed new and old world lupins into two different clades [5, 65, 67]. Thus, most EST-SSRs amplified in L. mutabilis (2n = 48), the only cultivated new world lupin [65], should have high transferability rates to other lupin species, such as L. albus and L. angustifolius. The understanding of the genetic diversity among other close relative lupin species will facilitate the transfer of favorable variation into cultivated species. For instance, L. hispanicus has been suggested as a reservoir of favorable variation for a number of biotic and abiotic stresses currently affecting L. luteus[68, 69].


L. luteus deep transcriptome sequencing will facilitate the further development of genomic tools and lupin germplasm. Massive sequencing of cDNA libraries will continue to produce raw materials for gene discoveries, identification of polymorphisms (SNPs, EST-SSRs, INDELs, etc.) for marker development, anchoring sequences for genome comparison studies and putative gene candidates for QTL detection. We are also exploiting the microsyntenic regions observed among L. luteus and legume model species to saturate yellow lupin linkage maps by amplifying conserved regions across legume species. The utilization of these tools will allow transforming L. luteus into a valid temperate legume crop alternative.


  1. 1.

    Mace ES, Varshney RK, Mahalakshmi V, Seetha K, Gafoor A, Leeladevi Y, Crouch JH: In silico development of simple sequence repeat markers within the aeschynomenoid/dalbergoid and genistoid clades of the Leguminosae family and their transferability to Arachis hypogaea, groundnut. Plant Sci. 2008, 174: 51-60. 10.1016/j.plantsci.2007.09.014.

    Article  CAS  Google Scholar 

  2. 2.

    Gladstones JS: Distribution, origin, taxonomy, history and importance. Lupins as crop plants: biology, production and utilization. Edited by: Gladstones JS, Atkins CA, Hamblin J. 1998, CAB International, Cambridge, United Kingdom, 1-37.

    Google Scholar 

  3. 3.

    Múzquiz M, Burbano C, Gorospe MJ, Ródenas I: A chemical study of Lupinus hispanicus seed—toxic and antinutritional components. J Scie Food Agr. 1989, 47: 205-214. 10.1002/jsfa.2740470208.

    Article  Google Scholar 

  4. 4.

    Wolko B, Weeden NF: Estimation of Lupinus genome polyploidy on the basis of isozymic loci number. Genet Pol. 1989, 30: 165-170.

    Google Scholar 

  5. 5.

    Naganowska B, Wolko B, Sliwinska E, Kaczmarek Z: Nuclear DNA content variation and species relationships in the genus Lupinus (Fabaceae). Ann Bot. 2003, 92: 349-355. 10.1093/aob/mcg145.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  6. 6.

    Petterson DS: Composition and food uses of lupin. Lupins as a crop plants: biology, production and utilization. Edited by: Gladstones JS, Atkins CA, Hamblin J. 1998, CAB International, Wallingford, UK, 353-384.

    Google Scholar 

  7. 7.

    Gladstones JS: Lupins of the Mediterranean Region and Africa. Technical Bulletin No. 26. 1974, Western Australian Department of Agriculture, Western Australia, 43-48.

    Google Scholar 

  8. 8.

    Glencross BD, Carter CG, Duijster N, Evans DR, Dods K, McCafferty P, Hawkins WE, Maas R, Sipsas S: A comparison of the digestibility of a range of lupin and soybean protein products when fed to either Atlantic salmon (Salmo salar) or rainbow trout (Oncorhynchus mykiss). Aquaculture. 2004, 237: 333-346. 10.1016/j.aquaculture.2004.03.023.

    Article  CAS  Google Scholar 

  9. 9.

    Berville ABS, Heinanen J, Kartuzova LT, Bernatskaya ML, Chmeleva ZV: Diversity of lupin (Lupinus L.) based on biochemical composition. Plant Genet Res Newsletter. 2003, 134: 42-57.

    Google Scholar 

  10. 10.

    Varshney RK, Close TJ, Singh NK, Hoisington DA, Cook DR: Orphan legume crops enter the genomics era!. Curr Opin Plant Biol. 2009, 12: 202-210. 10.1016/j.pbi.2008.12.004.

    Article  PubMed  Google Scholar 

  11. 11.

    Kaur S, Cogan N, Pembleton LW, Shinozuka M, Savin KW, Materne M, Forster JW: Transcriptome sequencing of lentil based on second-generation technology permits large-scale unigene assembly and SSR marker discovery. BMC Genomics. 2011, 12: 265-10.1186/1471-2164-12-265.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  12. 12.

    Hiremath PJ, Farmer A, Cannon SB, Woodward J, Kudapa H, Tuteja R, Kumar A, Bhanuprakash A, Mulaosmanovic B, Gujaria N, Krishnamurthy L, Gaur PM, Kavikishor PB, Shah T, Srinivasan R, Lohse M, Xiao Y, Town CD, Cook DR, May GD, Varshney RK: Large-scale transcriptome analysis in chickpea (Cicer arietinum L.), an orphan legume crop of the semi-arid tropics of Asia and Africa. Plant Biotechnol J. 2011, 9: 922-931. 10.1111/j.1467-7652.2011.00625.x.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  13. 13.

    Cheung F, Haas BJ, Goldberg SMD, May GD, Xiao Y, Town CD: Sequencing Medicago truncatula expressed tags using 454 Life Science technology. BMC Genomics. 2006, 7: 272-10.1186/1471-2164-7-272.

    PubMed Central  Article  PubMed  Google Scholar 

  14. 14.

    Weber APM, Weber KL, Carr K, Wilkerson C, Ohlrogge JB: Sampling the Arabidopsis transcriptome with massively parallel pyrosequencing. Plant Physiol. 2007, 144: 32-42. 10.1104/pp.107.096677.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  15. 15.

    Yu JK, La Rota M, Kantety RV, Sorrells ME: EST derived SSR markers for comparative mapping in wheat and rice. Mol Genet Genomics. 2004, 271: 742-751.

    Article  CAS  PubMed  Google Scholar 

  16. 16.

    Rexroad CE, Rodriguez MF, Coulibaly I, Gharbi K, Danzmann RG, DeKoning J, Phillips R, Palti Y: Comparative mapping of expressed sequence tags containing microsatellites in rainbow trout (Oncorhynchus mykiss). BMC Genomics. 2005, 6: 54-10.1186/1471-2164-6-54.

    PubMed Central  Article  PubMed  Google Scholar 

  17. 17.

    Young ND, Udvardi M: Translating Medicago truncatula genomics to crop legumes. Curr Opin Plant Biol. 2009, 12: 193-201. 10.1016/j.pbi.2008.11.005.

    Article  CAS  PubMed  Google Scholar 

  18. 18.

    Aubert G, Morin J, Jacquin F, Loridon K, Quillet MC, Petit A, Rameau C, Lejeune-Henaut I, Huguet T, Bursting J: Functional mapping in pea, as an aid to the candidate gene selection and for investigating synteny with the model legume Medicago truncatula. Theor Appl Genet. 2006, 112: 1024-1041. 10.1007/s00122-005-0205-y.

    Article  CAS  PubMed  Google Scholar 

  19. 19.

    Phan HT, Ellwood SR, Adhikari K, Nelson MN, Oliver RP, Phan HT, Ellwood SR, Adhikari K, Nelson MN, Oliver RP: The first genetic and comparative map of white lupin (Lupinus albus L.): Identification of QTLs for anthracnose resistance and flowering time, and a locus for alkaloid content. DNA Res. 2007, 14: 59-70. 10.1093/dnares/dsm009.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  20. 20.

    Nelson MN, Moolhuijzen PM, Boersma JG, Chudy M, Lesniewska K, Bellgard M, Oliver RP, Swiecicki W, Wolko B, Cowling WA, Ellwood SR: Aligning a New Reference Genetic Map of Lupinus angustifolius with the Genome Sequence of the Model Legume, Lotus japonicus. DNA Res. 2010, 17: 73-83. 10.1093/dnares/dsq001.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  21. 21.

    Nelson M, Huyen P, Ellwood S, Moolhuijzen P, Hane J, Williams A, O’Lone C, Fosu-Nyarko J, Scobie M, Cakir M, Jones M, Bellgard M, Ksiazkiewicz M, Wolko B, Barker S, Oliver R, Cowling W: The first gene-based map of Lupinus angustifolius L.-location of domestication genes and conserved synteny with Medicago truncatula. Theor Appl Genet. 2006, 113: 225-238. 10.1007/s00122-006-0288-0.

    Article  CAS  PubMed  Google Scholar 

  22. 22.

    Parra Gonzalez L, Straub S, Doyle J, Mora Ortega P, Salvo Garrido H, Maureira Butler I: Development of microsatellites inLupinus luteus(Fabaceae) and cross-species amplification in other lupine species. Am J Bot. 2010, 97: e72-e74. 10.3732/ajb.1000170.

    Article  Google Scholar 

  23. 23.

    Gao LL, Hane JK, Kamphuis LG, Foley R, Shi BJ, Atkins CA, Singh KB: Development of genomic resources for the narrow-leafed lupin (Lupinus angustifolius): construction of a bacterial artificial chromosome (BAC) library and BAC-end sequencing. BMC Genomics. 2011, 12: 521-10.1186/1471-2164-12-521.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  24. 24.

    Gupta S, Prasad M: Development and characterization of genic SSR markers in Medicago truncatula and their transferability in leguminous and non-leguminous species. Genome. 2009, 52: 761-771. 10.1139/G09-051.

    Article  CAS  PubMed  Google Scholar 

  25. 25.

    Kantety RM, Mathews DE, Sorrels ME: Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat. Plant Mol Biol. 2002, 148: 501-510.

    Article  Google Scholar 

  26. 26.

    La Rota RM, Kantety RV, Yu JK, Sorrells ME: Nonrandom distribution and frequencies of genomic and EST-derived microsatellite markers in rice, wheat, and barley. BMC Genomics. 2005, 6: 23-35. 10.1186/1471-2164-6-23.

    PubMed Central  Article  PubMed  Google Scholar 

  27. 27.

    Nicot N, Chiquet VB, Amilhat L, Legeai F, Leroy P, Bernard M, Sourdille P: Study of simple sequence repeat (SSR) markers from wheat expressed sequence tags (ESTs). Theor Appl Genet. 2004, 109: 800-805. 10.1007/s00122-004-1685-x.

    Article  CAS  PubMed  Google Scholar 

  28. 28.

    Thiel T, Michalek W, Varshney RK, Graner A: Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor Appl Genet. 2003, 106: 411-422.

    CAS  PubMed  Google Scholar 

  29. 29.

    Domon E, Fuijita M, Ishikawa N: The insertion/deletion polymorphisms in the waxy gene of barley genetic resources from East Asia. Theor Appl Genet. 2002, 104: 132-138. 10.1007/s001220200016.

    Article  CAS  PubMed  Google Scholar 

  30. 30.

    Blair M, McCouch SR: Microsatellite and sequence-tagged site markers diagnostic for the rice bacterial leaf blight resistance gene xa-5. Theor Appl Genet. 1997, 95: 174-184. 10.1007/s001220050545.

    Article  CAS  Google Scholar 

  31. 31.

    Ayres NM, McClung AM, Larkin PD, Bligh HFJ, Jones CA, Park WD: Microsatellites and a single nucleotide polymorphism differenciate apparent amylase classes in an extended pedigree of US rice germplasm. Theor Appl Genet. 1997, 94: 773-781. 10.1007/s001220050477.

    Article  CAS  Google Scholar 

  32. 32.

    Prathepha P: Characterization of Waxy microsatellite classes that are closely linked to the rice Waxy gene and amylose content in Thai rice germplasm Songklanakarin. J Sci Technol. 2003, 25: 1-8.

    CAS  Google Scholar 

  33. 33.

    Selvaraj I, Nagarajan P, Thiyagarajan K, Bharathi M, Rabindran R: Identification of Microsatellite (SSR) and RAPD Markers Linked to Rice Blast Disease Resistance gene in Rice (Oryza sativa L.). African J Biotechnol. 2011, 10: 3301-3321.

    Google Scholar 

  34. 34.

    Zhou WC, Kolb FL, Bai GH, Domier LL, Boze LK, Smith NJ: Validation of a major QTL for scab resistance with SSR markers and use of marker-assisted selection in wheat. Plant Breeding. 2003, 122: 40-46. 10.1046/j.1439-0523.2003.00802.x.

    Article  CAS  Google Scholar 

  35. 35.

    Abdurakhmonov IY, Abdullaev AA, Saha S, Buriev ZT, Arslanov D, Kuryazov Z, Mavlonov GT, Rizaeva SM, Reddy UK, Jenkins JN, Abdullaev A, Abdukarimov A: Simple Sequence Repeat Marker Associated with a Natural Leaf Defoliation Trait in Tetraploid Cotton. J Hered. 2005, 96: 644-653. 10.1093/jhered/esi097.

    Article  CAS  PubMed  Google Scholar 

  36. 36.

    Zeng L, Meredith WR, Gutiérrez OA, Boykin DL: Identification of associations between SSR markers and fiber traits in an exotic germplasm derived from multiple crosses among Gossypium tetraploid species. Theor Appl Genet. 2009, 119: 93-103. 10.1007/s00122-009-1020-7.

    Article  CAS  PubMed  Google Scholar 

  37. 37.

    Li X, Yang H, Buirchell B, Yan G: Development of a DNA marker tightly linked to low-alkaloid gene iucundus in narrow-leafed lupin (Lupinus angustifolius L.) for marker-assisted selection. Crop and Pasture Sci. 2011, 62: 218-224. 10.1071/CP10352.

    Article  CAS  Google Scholar 

  38. 38.

    Logeman J, Schell J, Willmitzer L: Improved method for the isolation of RNA from plant tissues. Anal Biochem. 1987, 163: 16-20. 10.1016/0003-2697(87)90086-8.

    Article  Google Scholar 

  39. 39.

    Meyer E, Aglyamova GV, Wang S, Buchanan-Carter J, Abrego D, Colbourne JK, Willis BL, Matz MV: Sequencing and de novo analysis of a coral larval transcriptome using 454 GSFlx. BMC Genomics. 2009, 10: 219-10.1186/1471-2164-10-219.

    PubMed Central  Article  PubMed  Google Scholar 

  40. 40.

    Podicheti R, Gollapudi R, Dong Q: WebGBrowse: a web server for GBrowse. Bioinformatics. 2009, 25: 1550-1551. 10.1093/bioinformatics/btp239.

    Article  CAS  PubMed  Google Scholar 

  41. 41.

    Doyle JJ, Doyle JL: A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 1978, 19: 11-15.

    Google Scholar 

  42. 42.

    Temnykh S, DeClerck G, Lukashova A, Lipovich L, Cartinhour S, McCouch S: Computational and experimental analysis of microsatellites in rice (Oryza sativaL.): Frequency, length variation, transposon associations, and genetic marker potential. Genome Res. 2001, 11: 1441-1452. 10.1101/gr.184001.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  43. 43.

    Edwards D, Batley J: Plant genome sequencing: applications for crop improvement. Plant Biotech J. 2010, 8: 2-9. 10.1111/j.1467-7652.2009.00459.x.

    Article  CAS  Google Scholar 

  44. 44.

    Lister R, Gregory BD, Ecker JR: Next is now: new technologies for sequencing of genomes, transcriptomes, and beyond. Curr Opin Plant Biol. 2009, 12: 107-118. 10.1016/j.pbi.2008.11.004.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  45. 45.

    Zhang L, Kasif S, Cantor CR, Broude NE: GC_AT-content spikes as genomic punctuation marks. Proc Natl Acad Sci. 2004, 101: 16855-16860. 10.1073/pnas.0407821101.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  46. 46.

    Pesole G, Liuni S, Grillo G, Licciulli F, Mignone F, Gissi C, Saccone C: UTRdb and UTRsite: specialized database of sequences and functional elements of 5’ and 3’ unstranslated regions of eukaryotic mRNAs. Update 2002. Nucleic Acids Res. 2002, 30: 335-340. 10.1093/nar/30.1.335.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  47. 47.

    Diaz I, Vicente-Carbajosa J, Abraham Z, Martinez M, Isabel-La Moneda I, Carbonero P: The GAMYB protein from barley interacts with the DOF transcription factor BPBF and activates endosperm-specific genes during seed development. Plant J. 2002, 29: 453-464. 10.1046/j.0960-7412.2001.01230.x.

    Article  CAS  PubMed  Google Scholar 

  48. 48.

    Koornneff M, Bentsink L, Hilhorst H: Seed dormancy and germination. Curr Opin Plant Biol. 2002, 5: 33-36. 10.1016/S1369-5266(01)00219-9.

    Article  Google Scholar 

  49. 49.

    Tillett RL, Ergül A, Albion RL, Schlauch KA, Cramer GR, Cushman JC: Identification of tissue-specific, abiotic stress responsive gene expression patterns in wine grape (Vitis vinifera L.) based on curation and mining of large-scale EST data sets. BMC Plant Biology. 2011, 11: 86-10.1186/1471-2229-11-86.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  50. 50.

    Covitz PA, Smith LS, Long SR: Expressed Sequence Tags from a Root-Hair-Enriched Medicago truncatula cDNA Library. Plant Physiol. 1998, 117: 1325-1332. 10.1104/pp.117.4.1325.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  51. 51.

    Yamada S, Katsuhara M, Kelly WB, Michalowski CB, Bohnert HJ: A family of transcripts encoding water channel proteins: Tissue-specific expression in the common ice plant. Plant Cell. 1995, 7: 1129-1142.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  52. 52.

    Bertioli DJ, Moretzsohn MC, Madsen LH, Sandal N, Leal-Bertioli S, Guimarães PM, Hougaard BK, Fredslund J, Schauser L, Nielsen AM, Sato S, Tabata S, Cannon SB, Stougaard J: An analysis of synteny of Arachis with Lotus and Medicago sheds new light on the structure, stability and evolution of legume genomes. BMC Genomics. 2009, 10: 45-10.1186/1471-2164-10-45.

    PubMed Central  Article  PubMed  Google Scholar 

  53. 53.

    Wojciechowski MF, Lavin M, Sanderson MJ: A phylogeny of legumes (leguminosae) based on analysis of the plastid MATK gene resolves many well-supported subclades within the family. Am J Bot. 2004, 91: 1846-1862. 10.3732/ajb.91.11.1846.

    Article  CAS  PubMed  Google Scholar 

  54. 54.

    Cardle I, Ramsay L, Milbourne D, Macaulay M, Marshall D, Waugh R: Computational and experimental characterization of physically clustered simple sequence repeats in plants. Genetics. 2000, 156: 847-854.

    PubMed Central  CAS  PubMed  Google Scholar 

  55. 55.

    Varshney RK, Thiel T, Stein N, Langridge P, Graner A: In silico analysis on frequency and distribution of microsatellites in ESTs of some cereals species. Cell Mol Biol Lett. 2002, 7: 537-546.

    CAS  PubMed  Google Scholar 

  56. 56.

    Varshney RK, Graner A, Sorrels ME: Genic microsatellite markers in plants: features an applications. Trends Biotechnol. 2005, 23: 48-55. 10.1016/j.tibtech.2004.11.005.

    Article  CAS  PubMed  Google Scholar 

  57. 57.

    Gao L, Tang J, Li H, Jia J: Analysis of microsatellites in major crops assessed by computational and experimental approaches. Mol Breeding. 2003, 12: 245-261. 10.1023/A:1026346121217.

    Article  CAS  Google Scholar 

  58. 58.

    Metzgar D, Bytof J, Wills C: Selection against frameshift mutations limits expansion in coding DNA. Genome Res. 2000, 10: 72-80.

    PubMed Central  CAS  PubMed  Google Scholar 

  59. 59.

    Jung S, Abbott A, Jesudurai C, Tomkins J, Main D: Frequency, type, distribution and annotation of simple sequence repeats in Rosaceae ESTs. Funct Integr Genomics. 2005, 5: 136-143. 10.1007/s10142-005-0139-0.

    Article  CAS  PubMed  Google Scholar 

  60. 60.

    Beckett P, Bancroft I, Trick M: Computational tools for Brassica-Arabidopsis comparative genomics. Comp Funct Genom. 2005, 6: 147-152. 10.1002/cfg.463.

    Article  CAS  Google Scholar 

  61. 61.

    Shoemaker RC, Schlueter J, Doyle JJ: Paleopolyploidy and gene duplication in soybean and other legumes. Curr Opin Plant Biol. 2006, 9: 104-109. 10.1016/j.pbi.2006.01.007.

    Article  CAS  PubMed  Google Scholar 

  62. 62.

    Mudge J, Cannon SB, Kalo P, Oldroyd GED, Roe BA, Town CD, Young ND: Highly syntenic regions in the genomes of soybean,Medicago truncatula, andArabidopsis thaliana. BMC Plant Biology. 2005, 5: 15-10.1186/1471-2229-5-15.

    PubMed Central  Article  PubMed  Google Scholar 

  63. 63.

    Grover CE, Wendel JF: Recent insights into mechanisms of genome size change in plants. J Botany 2010. 2010, Article ID 382732: 1-8.

    Google Scholar 

  64. 64.

    Bruggmann R, Bharti AK, Gundlach H, Lai J, Young S, Pontaroli AC, Wei F, Haberer G, Fuks G, Du C, Raymond C, Estep MC, Liu R, Bennetzen JL, Chan AP, Rabinowicz PD, Quackenbush J, Barbazuk WB, Wing RA, Birren B, Nusbaum C, Rounsley S, Mayer FX, Messing J: Uneven chromosome contraction and expansion in the maize genome. Genome Res. 2006, 16: 1241-1251. 10.1101/gr.5338906.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  65. 65.

    Ainouche AK, Bayer R: Phylogenetic relationships in Lupinus (Fabaceae: Papilionoideae) based on internal transcribed spacer sequences (ITS) of nuclear ribosomal DNA. Am J Bot. 1999, 86: 590-607. 10.2307/2656820.

    Article  CAS  PubMed  Google Scholar 

  66. 66.

    Swiecicki W, Swiecicki WK, Nijaki T: Lupinus x hispanicoluteus – An interspecific hybrid of Old world lupins. Acta Soc Bot Pol. 1999, 68: 217-220.

    Article  Google Scholar 

  67. 67.

    Drummond CS: Diversification of Lupinus (Leguminosae) in the western New World: Derived evolution of perennial life history and colonization of montane habitats. Mol Phylogenet Evol. 2008, 48: 408-421. 10.1016/j.ympev.2008.03.009.

    Article  PubMed  Google Scholar 

  68. 68.

    Callow JA, Ford-Lloyd BV, Newbury HJ: Biotechnology and plant genetic resources: conservation and use. 1997, CAB International, Wallingford, UK, 49-76.

    Google Scholar 

  69. 69.

    Swanson T: Global values of Biological diversity: the public interest in the conservation of plant genetic resources for agriculture. Plant Genet Res Newsletter. 1996, 105: 1-3.

    Google Scholar 

Download references


This research was funded by the National Commission for Scientific & Technological Research (FONDECYT Project No.1090759) and CONICYT Regional/GORE Araucanía/CGNA/R10C1001, Chile. We thank Héctor Urbina for his assistance on L. luteus sequence assemblies.

Author information



Corresponding author

Correspondence to Iván J Maureira-Butler.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

LBP collected the tissues and extracted the RNAs. JU, LBP and JM constructed the EST libraries. JU and JM supervised the 454 sequencing of the libraries. LBP and JU conducted the SSR search and primer design. LBP conducted the SSR polymorphism tests and transferability studies. LMP sequenced the amplicons of SSR and intergenic blocks. GAA grew the plants for the diversity study, extracted the DNAs, PCR amplified and conducted the genotyping of the population. IMB drafted the experimental design of all the studies carried out in this work and conducted the genetic analysis for the diversity study. CSN conducted annotations and in silico mapping of the sequences. LBP conducted the microscale synteny studies. HS and IMB conceived the study. LBP drafted the manuscript with the support of IMB. All the authors read and approved the final manuscript.

Electronic supplementary material

Table S1.

Additional file 1: Characteristics of 33 Conserved Microsynteny (CMS) markers developed in L. luteus. Shown for each primer pair are the Medicago chromosome library specificity, l1l2 isotigs where CMS forward and reverse primers were anchored, forward and reverse sequence, expected Medicago amplicon size (bp), L. luteus CMS amplicon size (bp), amplification in other Lupin species (L. hispanicus), and the level of polymorphism on the L. luteus screening panel. (PDF 164 KB)

Table S2.

Additional file 2: Lupinus luteus, L. hispanicus and L. mutabilis accessions included in the study. (PDF 80 KB)

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Parra-González, L.B., Aravena-Abarzúa, G.A., Navarro-Navarro, C.S. et al. Yellow lupin (Lupinus luteus L.) transcriptome sequencing: molecular marker development and comparative studies. BMC Genomics 13, 425 (2012).

Download citation


  • Lupinus luteus
  • Orphan crop
  • Microsynteny