Skip to main content

Genome-wide analysis of NBS-encoding disease resistance genes in Cucumis sativusand phylogenetic study of NBS-encoding genes in Cucurbitaceae crops



Plant nucleotide-binding site (NBS)-leucine-rich repeat (LRR) proteins encoded by resistance genes play an important role in the responses of plants to various pathogens, including viruses, bacteria, fungi, and nematodes. In this study, a comprehensive analysis of NBS-encoding genes within the whole cucumber genome was performed, and the phylogenetic relationships of NBS-encoding resistance gene homologues (RGHs) belonging to six species in five genera of Cucurbitaceae crops were compared.


Cucumber has relatively few NBS-encoding genes. Nevertheless, cucumber maintains genes belonging to both Toll/interleukine-1 receptor (TIR) and CC (coiled-coil) families. Eight commonly conserved motifs have been established in these two families which support the grouping into TIR and CC families. Moreover, three additional conserved motifs, namely, CNBS-1, CNBS-2 and TNBS-1, have been identified in sequences from CC and TIR families. Analyses of exon/intron configurations revealed that some intron loss or gain events occurred during the structural evolution between the two families. Phylogenetic analyses revealed that gene duplication, sequence divergence, and gene loss were proposed as the major modes of evolution of NBS-encoding genes in Cucurbitaceae species. Compared with NBS-encoding sequences from the Arabidopsis thaliana genome, the remaining seven TIR familes of NBS proteins and RGHs from Cucurbitaceae species have been shown to be phylogenetically distinct from the TIR family of NBS-encoding genes in Arabidopsis, except for two subfamilies (TIR4 and TIR9). On the other hand, in the CC-NBS family, they grouped closely with the CC family of NBS-encoding genes in Arabidopsis. Thus, the NBS-encoding genes in Cucurbitaceae crops are shown to be ancient, and NBS-encoding gene expansions (especially the TIR family) may have occurred before the divergence of Cucurbitaceae and Arabidopsis.


The results of this paper will provide a genomic framework for the further isolation of candidate disease resistance NBS-encoding genes in cucumber, and contribute to the understanding of the evolutionary mode of NBS-encoding genes in Cucurbitaceae crops.


Cucumbers (Cucumis sativus L.) of the Cucurbitaceae plant family are among the most important vegetable crops in the world. However, susceptibility to multiple pathogens hinders their production increase and quality improvement [15] The NBS-LRR resistance (R) genes, which encode proteins containing nucleotide binding sites (NBS) and leucine-rich repeat (LRR) domains, form the largest R-gene family among plant genomes [6]. Therefore, a systematic evaluation of NBS-encoding genes is required in order to better understand cucumber resistance and susceptibility. Previously, NBS and Pto analogues had been isolated and characterized using degenerate primers in cucumbers [79]. However, this experimental approach failed to detect all the members of the gene families in the cucumbers. Fortunately, the cucumber genome was sequenced by researchers who worked on the ‘Chinese Long’ inbred line 9930 and the gynoecious inbred line ‘Gy14’, which provided an opportunity to conduct a comprehensive overview of the NBS-encoding gene superfamily at the genome level [10]. Recently, Kang et al. localized the cucumber scab R gene Ccu into an R-gene cluster located in a 670 kb region of cucumber chromosome 2 [11]. Four resistance gene homologues (RGHs) were located in the region delimited by the molecular markers Indel 01 and Indel 02, and thus were possible Ccu candidates [11]. Genome-wide analysis of cucumber NBS-encoding genes played an important role in R-gene mapping and cloning.

Aside from cucumber, the Cucurbitaceae plant family also includes many important vegetable crops such as the bottle gourd Lagenaria siceraria (Mol.) Standl.], luffa Luffa cylindrica (L.) Roem.], squash (Cucurbita moschata Duch.), melon (Cucumis melo. L), and watermelon Citrullus lanatus (Thunb) Mansfeld]. For these species in the Cucurbitaceae family, several studies concerning their phylogenetic relationships have been reported [1216]. These studies have shown that a wide genetic distance exists between the Citrullus and Cucumis groups. Phylogenetic relationships among Citrullus species and subspecies are closer in comparison with those among most Cucumis species [14]. Moreover, Lagenaria et al. [16] reported the history of Cucurbitaceae using a multigene phylogeny for 114 of the 115 genera and 25% of the 960 species worldwide, and found that Cucumis and Cucurbita are more closely related to each other than any of them are to Luffa. However, few R-related genetic and genomic resources are available for the improvement of these crops. Therefore, the analysis of R-genes or RGHs will contribute to their timely application in disease resistance breeding in Cucurbitaceae crops.

Currently, numerous disease R-genes which confer resistance to a wide range of pathogens, including viruses, bacteria, fungi, nematodes and aphids, have been cloned through map-based cloning and transposon-tagging from many dicotyledonous and monocotyledonous plants [1719]. Although the mechanisms of infection of these organisms differ significantly, R-gene products are remarkably similar to one another. Most R-genes seem to encode putative NBS-LRR domains. It is well known that NBS-LRR R-proteins in plants recognize the presence of the pathogen through two different types of perception mechanisms [17, 20, 21]. One is direct recognition between R-proteins in plants and avirulence proteins in the pathogen, and the other is the indirect perception mechanism postulated by the Guard Model [17, 22]. The Guard Model proposes that NBS-LRR R-proteins act by monitoring the plant effector target against pathogen effector proteins, and explains that a single R-protein is able to perceive multiple effectors. Therefore, it is theorized that few R-genes are capable of targeting the broad diversity of pathogens in plants [17]. Genome-wide analysis of a complete set of NBS-LRR R-proteins in the plant genome will provide new insights into the genetic diversity of the R-genes available in this species.

To date, NBS-LRR R-genes may be divided into two families, distinguished by the presence or absence of a TIR domain at the N-terminal [23]. The first is known as the TIR NBS-LRR family, which is homologous to the Toll protein and interleukin-1 receptor at the N-terminal domain, and the other is known as the non-TIR NBS-LRR family. Generally, non-TIR NBS-LRR R-proteins include a putative coiled-coil domain at the N-terminal domain. Thus, they are also referred to as CC NBS-LRR R-proteins. Eight conserved motifs have been identified in the NBS domains of these two R-gene families [23], some of which are specific to the non-TIR and the TIR NBS-LRR families [24]. Degenerate primers have been designed based on these conserved motifs, and a large number of NBS-encoding RGHs have been isolated from different plant species via polymerase chain reaction (PCR) [2530]. These RGHs have high sequence similarity with R-proteins cloned from different plant species.

In this paper, an in silico search of cucumber genome databases was conducted to identify members of the cucumber NBS-encoding gene family. A total of 57 members were identified from the phytozome database ( A phylogenetic tree was constructed and the NBS-encoding genes were separated into two distinct groups, namely the TIR and CC families. Conserved motifs were analyzed in these two families to support the partition. In addition, 158 NBS-encoding RGHs from the other five Cucurbitaceae crop genomes were also identified via degenerate PCR amplification and database mining. These genes, together with the RGHs, were used for the study of their phylogenetic relationship in Cucurbitaceae crops. Finally, a comparative analysis between the NBS-encoding genes from Arabidopsis thaliana and those from the Cucurbitaceae crops was performed in order to determine their evolutionary origin. The findings will provide a strong groundwork for the isolation of candidate R-genes in cucumber and contribute to understanding the evolution of NBS-encoding genes in Cucurbitaceae crops.


Sequence and database search for NBS-encoding genes in Cucumis sativus

The availability of the complete cucumber genome sequences facilitated the search for NBS-encoding genes. At present, two cucumber inbred lines, 9930 (the ‘Chinese long’, commonly used in modern cucumber breeding) and Gy14 (the gynoecious inbred line), have been sequenced. The former was sequenced using a combination of traditional Sanger and next-generation Illumina GA sequencing technologies [10] and a database has been established for the sequence formation ( The latter was sequenced de novo with an appropriate mixture of random shotgun and paired-end shotgun reads using a 454-XLR technology; these sequences have been uploaded to the JGI Genome database ( In this study, the two databases were used to search for NBS-encoding genes.

NBS-encoding genes were identified for the first time in three steps using the JGI Cucumis sativus Genome database. The first step involved a BLASTN search using A. thaliana and rice NBS-encoding sequences [3133] as the query. The second step aimed at a complete search of the candidate NBS-encoding genes in the cucumber gynoecious inbred line Gy14, and was performed using the amino acid sequence of the nucleotide-binding adaptor shared by APAF-1, R-proteins, and the CED-4 (NB-ARC) domain (Pfam: PF00931) as a query to find possible encoded homologues [22]. In the third step, based on the above results, the search of candidate NBS-encoding genes in the cucumber genome was repeated using BLASTN searches. The overall analysis reveals that the NBS-encoding gene family is composed of 57 members in the ‘Gy14’ cucumber gynoecious inbred line (Table 1). The predicted nucleotide and protein sequences of all 57 NBS-encoding genes from ‘Gy14’ are provided in Additional file 1.

Table 1 NBS-encoding or RGH genes in cucurbitaceous crops

To classify these NBS-encoding proteins, each was identified based on N terminal CC motifs and TIR domains, as well as LRRs. A total of seven categories (TNL, CNL, TN, CN, N, NL and RPW8-NL) were identified (Table 2). Twenty-three proteins were shown to possess only NBS-LRRs. Two proteins, Cucsa.102240 and Cucsa.123410, were predicted to have a domain with a sequence similar to the RPW8 Arabidopsis powdery mildew resistance gene family [34]. Thirteen genes were identified with TIR domains at the N-terminal, and two of them, Cucsa.338660 and Cucsa.091460, were shown to lack LRRs. Eighteen genes were predicted to encode the CC motifs, and one of these genes lacked LRRs. The gene, Cucsa.237530, possesses only NBS domains (Table 2; Additional file 2).

Table 2 Numbers of each family of NBS-encoding genes in Cucumis sativus

Previously, the number of NBS-encoding genes from the cucumber inbred line ‘Chinese Long’ 9930 had already been reported [10]. In this study, the methods mentioned above were also implemented to identify NBS-encoding genes from the ‘Chinese Long’ cucumber 9930 database ( These NBS-encoding genes are shown in Additional file 1. A comparison of Gy14 and 9930 cucumber NBS-encoding genes showed that most of the genes were highly similar to one another (Additional file 3). Therefore, in order to perform a comprehensive analysis of the NBS-encoding genes within the cucumber genome for comparison purposes, only NBS-encoding genes from the cucumber ‘Gy14’ genome were selected for further analyses.

Phylogenetic analysis of cucumber NBS-encoding genes and exon-intron configurations

Generally, 5’ region preceding the NBS and 3' region following the NBS have high variability and are not included for construction. The NBS region, however, is highly conserved and is often used to generate multiple sequence alignments and phylogenetic tree constructions [30]. In order to elucidate the relationships among the cucumber NBS-encoding genes, sequences of the NBS domain of these genes were used to construct a neighbor-joining (NJ) phylogenetic tree. These genes were divided into two families, the CC-NBS and TIR-NBS, which are supported by the high bootstrap values (Figure 1A). This was consistent with the results reported by Pan et al., which showed that both the TIR-NBS and CC-NBS families of genes occur in dicotyledonous plant species [24]. Moreover, each of these two families was separated into two subfamilies (CC-I/CC-II and TIR-I/TIR-II) with high bootstrap values (Figure 1A). In addition, it was observed that the members in CC-NBS were more numerous than those in the TIR-NBS family, which are composed of 34 and 23 members, respectively. This distribution is incongruent with that found in Arabidopsis. In the Arabidopsis genome, the member of the CC-NBS family was less than that of the TIR-NBS family [31]. Recently, the melon genome sequence was available [35], this distribution was also found in the NBS-LRR family (data not shown).

Figure 1

Phylogenetic analysis and intron/exon configurations of NBS-encoding genes in cucumbers. A phylogenetic tree of NBS-encoding genes was constructed using MEGA 5.0. Introns and exons are drawn to scale with the full encoding regions of their respective genes. Boxes indicate the exon, and lines indicate the intron. 0 = intron phrase 0; 1 = intron phrase 1; 2 = intron phrase 2.

The exon/intron positions and phases of the 57 NBS-encoding genes in cucumber were further analyzed. The exon/intron structures were obtained using the online Gene Structure Display Server (GSDS: with both coding sequences and genomic sequences [36]. Figure 1B provides a detailed illustration of the relative lengths of the introns and conservation of the corresponding exon sequences within each NBS-encoding gene in the cucumbers. The number of introns in all of these genes ranged from zero to seven. No intron was found in the Cucsa.338110, Cucsa.338190 or Cucsa.237530, whereas seven were found in Cucsa.132370. More than half of the genes had one to three introns (Additional file 4). Moreover, in the CC-I subfamily, all of the genes contained the lowest amount of introns (zero to three), except for Cucsa.132370, which contained seven introns. The genes in the TIR-II had the greatest number of introns (three to six), except for Cucsa.338660, which had only one intron. In the remaining two subfamilies, CC-II and TIR-I, the NBS-encoding genes contained zero to six introns (Figure 1B). These findings, together with the phylogenetic tree, indicate that some intron loss and intron gain events may have occurred during the structural evolution between the two families of cucumber NBS-encoding genes.

Architectural diversity of NBS-encoding genes in Cucumis sativus

In order to understand the fine structure of the NBS-encoding genes in the cucumbers, CC- and TIR-NBS families were analyzed separately using the MEME and Clustal X programs. Moreover, these proteins usually have an N-terminal region, NBS domain, and LRR and C-terminal regions. Therefore, each of the two families of proteins were separated into three parts and subjected to protein domain and motif analyses.

The CC-NBS family

Three, ten and seven conserved motifs were identified in the N-terminal region (CC domain), NBS domain, and LRR and C-terminal regions, respectively (Table 3). Three motifs (C1, C2, and C3) were identified in the N-terminal region of most of the CC-NBS families of the cucumber NBS-encoding genes (Additional file 5). Most of the sites in these motifs were poorly conserved (Table 3, Additional file 6A). The NBS domain showed lower sequence diversity than the N-terminal domain. The eight motifs (P-loop, RNBS-A, Kinase-2, RNBS-B, RNBS-C, GLPL, RNBS-D, and MHDV) which were previously identified to occur widely in the NBS region of the known NBS-encoding genes in plant species [31], were also conserved in cucumber CC-NBS R-proteins with the exceptions of Cucsa.318890, Cucsa.091880, Cucsa.326910, Cucsa.132370, Cucsa.237070, Cucsa.094560 and Cucsa.094580, all of which have lost some conserved motifs (Additional file 5). In addition, two other conserved motifs (CNBS-1 and CNBS-2) in the NBS domain of the CC-NBS R-proteins were identified in the cucumbers. However, most of the sites in these two motifs were weakly conserved. Seven LRR-related motifs were found in CC-NBS R-proteins (Additional file 5). In contrast to the N-terminal and NBS domain, the LRR motif patterns were variable. Almost all proteins showed different LRR motif patterns. Among them, the conserved motifs L1 and L2 existed widely in all CC-NBS R proteins (Additional file 5).

Table 3 Major MEME motifs in predicted cucumber CC-NBS family of proteins

The TIR-NBS family

MEME was used to find motifs in the TIR region, NBS domain, and LRR region of the cucumber TIR-NBS family of NBS-encoding genes. A total of four, nine, and seven motifs were found, respectively. In the TIR region, four motifs (T1, T2, T3, and T4) existed in almost half of these proteins (Table 4). The conservation of the NBS domain in the TIR-NBS family was higher than that in the CC-NBS family. In this family, two proteins, Cucsa.237530 and Cucsa.338660, lacked the RNBS-D-TIR and MHDL domains, whereas Cucsa.249360 lacked the RNBS-C domain (Additional file 5). Compared with the CC-NBS family, only one additional motif (TNBS-1) was identified in the NBS domain of the TIR-NBS family (Additional file 5). In addition, although the same number of LRR-related motifs was identified in the TIR-NBS family, the CC-NBS and TIR-NBS families do not have identical motifs. Moreover, most of the sites of these motifs were poorly conserved (Table 4, Additional file 6B).

Table 4 Major MEME motifs in predicted cucumber TIR-NBS family of proteins

Chromosomal distribution of cucumber NBS-encoding genes

The GY14 cucumber genome encoded 57 members. Of them, 55 NBS-encoding genes could be located on all seven chromosomes (Figure 2). The remaining two genes (Cucsa.326910 and Cucsa.155730) could not be located on any chromosomes and were assigned to Scaffold03139 and Scaffold03138, respectively. As in Arabidopsis and rice, the distribution of cucumber NBS-encoding genes among chromosomes is nonrandom [3133]. There were 10 and 20 genes which were localized on chromosomes 5 and 2, respectively, whereas only five genes were located on chromosomes 1 and 6. The remaining 20 NBS-encoding genes were located on chromosomes 3, 4 and 7.

Figure 2

Position of NBS-encoding genes on the cucumber chromosomes. Chromosome numbers are indicated at the bottom end of the chromosome. Those located on sequence scaffolds are not shown. The straight lines connecting the NBS-encoding genes present on duplicated chromosomal segments, whereas the tandem duplicated gene clusters are indicated by red lines.

In addition, it was determined that the NBS-encoding gene clusters by a combination of three approaches described in previous studies [31, 37, 38]. A gene cluster is defined as a region in which two neighboring homologous genes are less than 200 kb and fewer than eight non-NBS-encoding genes between NBS-encoding genes. It was also found that most of the NBS-encoding genes were clustered on the chromosomes. A total of 33 genes were localized within nine clusters. The largest cluster contained 10 NBS-encoding genes, while the smallest included only two genes, both of which were located on chromosome 2. Shared phylogenetic clades confirmed the NBS-encoding gene chromosomal duplication, with four and seven NBS-encoding gene segment and tandem duplication events in cucumber (Figure 2), and these genes concern chromosomes 2, 3, 6 and 7.

Identification of NBS-encoding RGHs in other Cucurbitaceae crops

To gain an insight into the phylogenetic relationship of NBS-encoding genes in Cucurbitaceae crops, database mining and PCR amplification were employed to identify the NBS-encoding genes in melon, bottle gourd, luffa, watermelon, and squash (Table 1, Additional file 7). A total of 165 NBS sequences were obtained. Among these sequences, 43 belonged to bottle gourd, 20 to luffa, 24 to melon, 64 to watermelon, and 14 to squash. For bottle gourd and luffa, the NBS sequences were obtained by PCR amplification; for squash, the NBS sequences were derived from database mining; for melon and watermelon, 10 and 31 sequences were respectively drawn from PCR amplification, and the remaining 14 and 33 sequences were obtained by database mining. The NBS sequences from watermelon originated from two species. One was Citrullus lanatus, consisting of 58 NBS sequences. Among them, one NBS sequence, GU124560, is a pseudogene (gene with stop codon), and two NBS sequences, GU124553 and GU124554, lacked the complete conserved motifs from P-loop to GLPL. The remaining six NBS sequences were found in Citrullus colocynthis, in which GU124552 and GU124555 were pseudogenes. Among the 24 NBS sequences from melon, two are pseudogenes (AF354508 and AF354512), which were found in the C. melo species. The AY583855 sequence has been identified as a Fusarium R gene [39]. In addition, the NBS sequences were obtained from bottle gourd (L. siceraria), luffa (L. cylindrica), and squash (C. moschata). In the current study, the pseudogenes and sequences without conserved domains from P-loop to GLPL were excluded from further analysis. Among the 165 NBS sequences from the Cucurbitaceae crops, 103 NBS sequences (JN230598 to JN230701) were identified via degenerate PCR amplification. The remaining 62 were derived from the GeneBank database (Table 1).

Phylogenetic analysis of NBS-encoding genes and RGHs in Cucurbitaceae crops

Phylogenetic analysis via the NJ method was conducted in order to determine the relationships among NBS-encoding genes and RGHs in the Cucurbitaceae crops. The consensus phylogenetic tree (Figures 3, Additional file 8) indicated that there were two distinct families, namely TIR-NBS and CC-NBS, which were consistent with the pattern previously described [23, 40]. TIR-NBS and CC-NBS were further subdivided into nine and four distinct subfamilies (TIR1 to TIR9 and CC1 to CC4), respectively.

Figure 3

Phylogenetic comparison of Cucurbitaceae NBS-encoding genes and RGHs. TIR-NBS and CC-NBS families are distinguished. The scale represents the average number of substitutions per site. The numbers on the branches indicate the percentage of 1000 bootstrap replicates that support the node with only values > 50% reported. The detailed phylogenetic tree is shown in Additional file 8.

TIR1, TIR3 and TIR6 were the largest among the subfamilies, which were composed of 43, 35 and 40 sequences, respectively. TIR1 contained 29 sequences from bottle gourd, 5 from watermelon, 7 from luffa, and 2 from cucumber. The TIR3 subfamily included 13 sequences from cucumber, 14 from melon, and 8 from watermelon. The TIR6 subfamily contained 27 sequences from watermelon, 10 from squash, 2 from cucumber, and 1 from luffa. The TIR2, TIR5, TIR8 and TIR9 subfamilies are small, composed of 2, 2, 4, and 1 sequence, respectively. The remaining two subfamilies, TIR4 and TIR7, consisted of 10 and 15 sequences, respectively. Within the CC-NBS family, the largest subfamilies, CC1 and CC4, consist of 20 and 28 sequences, respectively, while CC2 and CC3 were relatively small subfamilies, composed of 3 and 12 sequences, respectively (Figures 3 and S4).

Interestingly, all NBS sequences obtained via PCR amplification in luffa belonged to the TIR (TIR1, TIR6 and TIR7) families, whereas the NBS sequences from the other five species were distributed widely among TIR- and CC-NBS families (Figures 3, Additional file 8).

Comparison of NBS-encoding genes and RGHs from Cucurbitaceae and A. thaliana

In the following analysis, A. thaliana, a member of Brassicaceae and the model system for genomic comparisons, was selected for phylogeny. The phylogenies were constructed using the P-loop to GLPL motifs. Previous phylogenetic analyses showed that all NBS sequences in A. thaliana are separated into two distinct families, CC and TIR [23]. A great divergence was observed between the two families. Therefore, separate analyses of these families were performed, and the results are shown in this paper. Figure 4 and Additional file 9 show the results of the phylogenetic analysis of the CC-NBS family between the Cucurbitaceae crops and A. thaliana. All members from the CC1, CC2, CC3 and CC4 subfamilies were detected in four clades (N2, N1, N4, and N3, respectively), using reference sequences from Arabidopsis as described by Cannon et al. [41]. Among the four clades, N2 was the largest, being composed of 12 members from cucumber, 3 from melon, 3 from bottle gourd, and 10 from watermelon. The other three clades were relatively small. N1 clade included all CC3 subfamilies from the Cucurbitaceae crops and two members, At3g14460 and At3g14470, from A. thaliana. However, relatively lower bootstrap values were observed in N1 and N2 clades. Low scores are usually an indication that the observed patterns must be analyzed with caution, and are more often observed for large datasets. Such poor scoring does not necessarily imply unreliable branching, but instead indicates that not all members may be assigned to a particular group with high confidence. The NBS-LRR superfamily accounts for the largest number of known disease resistance genes, and is one of the largest gene families among plant genomes [42]. High diversification of these genes was observed in the plants [30, 38, 40]. Therefore, it is believed that the diversification of NBS-encoding genes was responsible for the poor bootstrap score. This phenomenon has occurred frequently in constructing phyologenetic trees of NBS-encoding resistance genes from other plant species [7, 43, 44].

Figure 4

Phylogenetic comparisons of Cucurbitaceae CC-NBS family of NBS-encoding genes and NBS-encoding genes with those in Arabidopsis . The ancient CC family (N1 to N4) as defined by Cannon et al. [41] is indicated. The scale represents the average number of substitutions per site. The numbers on the branches indicate the percentage of 1000 bootstrap replicates which support the node with only values of > 50% reported. The detailed phylogenetic tree is shown in Additional file 9.

Clade N3 included 8 sequences from cucumber, 8 from bottle gourd, 3 from squash, and 1 from watermelon. Clade N4 only contained three sequences from cucumber. Within the TIR family (Figures 5, Additional file 10), the nine subfamilies from the Cucurbitaceae crops remained different when compared with the TIR family of the sequences from A. thaliana. Sequences from TIR4 and TIR9 were detected in the At-TIR-NBS-B and At-TIR-NBS-A subfamilies, respectively. Each of the remaining seven subfamilies from the Cucurbitaceae crops formed a group.

Figure 5

Phylogenetic comparisons of Cucurbitaceae TIR-NBS family of NBS-encoding genes and NBS-encoding genes to those in Arabidopsis . The scale represents the average number of substitutions per site. The numbers on the branches indicate the percentage of 1000 bootstrap replicates which support the node with only values of > 50% reported. The detailed phylogenetic tree is shown in Additional file 10.

The general features of the phylogenetic tree are presented in Figures 4 and 5. In most cases, each clade in the Cucurbitaceae crops was composed of sequences from more than one species. For example, clade N3 was composed of CC1 sequences from four species (cucumber, bottle gourd, squash, and watermelon), whereas clade N2 consisted of CC4 sequences from cucumber, melon, bottle gourd, and watermelon. A similar phenomenon was also observed in the TIR1 to TIR8 subfamilies. However, in TIR9 and CC2, squash and cucumber were species-specific subfamilies, that is, they were composed of sequences from only one species.


As the first sequenced vegetable crop, the cucumber genome will provide a valuable new resource for the biological research and breeding of cucurbits ( The Cucurbitaceae family, commonly known as cucurbits and gourds, includes many economically important cultivated plants, such as cucumber (C. sativus L.), melon (C. melo L.), bottle gourd (Lagenaria siceraria var. hispida), luffa L. cylindrica (L.) Roem.], watermelon Citrullus lanatus (Thunb.) Matsum. & Nakai], squash (C. moschata Duch.), and pumpkin (Cucurbita spp.) [45]. Developing disease resistance is one of the most important objectives of breeding Cucurbitaceae crops. In the current study, the first large scale analysis of NBS-encoding genes from cucumber was reported, as was that of RGHs from melon, watermelon, luffa, bottle gourd and squash. The results contribute to the identification of candidate R-genes and provide insight into NBS-encoding gene evolution in Cucurbitaceae crops.

The cucumber genome encodes a small NBS-encoding gene family

Previous studies regarding NBS-encoding genes from Vitis vinifera, Populus trichocarpa, Arabidopsis thaliana, and Oryza sativa revealed that there are 535, 416, 174, and 519 genes in these species, respectively [46]. However, the GY14 cucumber genome only encodes 57 NBS-encoding genes, which is less than that in the four sequenced plant genomes (Additional file 11). Recently, Huang et al. identified a total of 59 NBS-encoding genes which were located on seven different chromosomes in the ‘Chinese Long’ cucumber genome [10]. These two cucumber lines have similar numbers of NBS-encoding genes (Additional file 3). Moreover, in these two cucumber genomes, the majority of conserved NBS-LRR R genes is single-copy and/or located as a singleton. The complex clustered NBS-LRR R genes contribute greatly to the rich genetic variation (data not shown). Recently, it was reported that there are 81 NBS-encoding genes in the melon genome [35]. It was found that there are significant differences in R gene counts among them. This result indicated that the degree of interspecific variation is greater than that of intraspecific variation in Cucumis.

Recently, Porter et al. reported that only 54 NBS-encoding genes are present in Carica papaya[47], which was similar to the number of these genes in the cucumbers (Additional file 11). Although the difference between the genome size and total number of predicted protein encoded genes among the six sequenced plant species was observed clearly (Additional file 11), the number of NBS-encoding genes does not increase or decrease proportionally. The obtained data indicate that, similar to Carica papaya, NBS-encoding genes in cucumber are a relatively small gene family. Based on the information regarding the cucumber genome [10], there are at least two explanations to this phenomenon. The first is the absence of the recent whole-genome duplication (WGD) in the small cucumber genome [10]. WGD is common in angiosperm plants and produces a tremendous source of raw materials for gene genesis, therefore the absence of WGD may have led to the small number of NBS-encoding genes in cucumber. The second explanation is the inclusion of a small number of tandem gene duplications and a few segmental duplications in the cucumber genome [10]. These duplications also contribute in part to the small number of NBS-encoding genes in cucumbers.

Diversity of NBS-encoding genes in cucumber

Cucumber genome encodes 57 NBS-encoding genes and maintains both the TIR and CC families (Additional file 11 and Figure 1A), which suggests that the cucumber has relatively few, albeit diverse, NBS-encoding genes. The sequences of TIR- and CC-NBS family of NBS-encoding genes were aligned separately. The results revealed the presence of six previously identified motifs (P-loop, Kinase-2, RNBS-B, RNBS-C, GLPL, and MHDV) in most genes (Additional file 12). Motifs RNBS-A-TIR and RNBS-D-TIR occur exclusively in the TIR-NBS R proteins (Additional file 12A), whereas RNBS-A-nonTIR and RNBS-D-nonTIR are specific to the CC-NBS R proteins (Additional file 12B). However, some motifs present in the sequence alignment were not detected using the MEME software, suggesting that these motifs were poorly conserved in some of these proteins, such as the P-loop motif in Cucsa.017490, Cucsa.088220, Cucsa.094560, Cucsa.239860 and Cucsa.318890. This phenomenon was observed only in the CC-NBS family of NBS-encoding genes in cucumbers, suggesting that these genes are more conserved in the TIR-NBS family than those in the CC-NBS family. In previous studies, some researchers found that the CC family is highly diverse and originated prior to the split between gymnosperms and angiosperms. In contrast, the TIR family is more homogeneous and was found only in dicotyledon, suggesting that it arose after the divergence of monocotyledon and dicotyledon [41, 48, 49]. Therefore, the results of this study were consistent with those of previous reports.

The last residue of the kinase-2 motif, D (Aspartate) or W (Tryptophan), in the NBS-encoding genes in plants has also been used to predict (95% accuracy) whether they belong to the TIR- or CC-NBS family of the NBS-encoding genes [24]. In the current paper, the last residue in most of the kinase-2 motifs of TIR-NBS family of genes is “D”, except in Cucsa.237410, Cucsa.237540, Cucsa.237520, and Cucsa.091460, in which it was substituted for “Asparagine”, “Glutamine acid”, “Threonine” and “Asparagine”, respectively. For the CC-NBS family, the last residues in the kinase-2 motifs is “W” except in Cucsa.337190, Cucsa.338110, and Cucsa.338190, in which it was replaced with “Serine”, “Serine” and “Glycine”, respectively (Additional file 12). This class not only supports the results of the above phylogenetic analysis, but also the view that both the TIR- and CC-NBS families of genes occur in dicot species [50].

Comparative evolutionary analysis of NBS-encoding genes from Cucurbitaceae crops and Arabidopsis

Arabidopsis is the model system for genomic comparisons among dicots, due to the fact that a complete draft of its genome is available [51]. In this study, both the TIR- and CC-NBS families were identified in all genes from the Cucurbitaceae crops (Figure 3). Separate phylogenies for the 2 families were subsequently constructed (Figures 4 and 5). The phylogenetic pattern of CC-NBS is shown in Figure 4 and Additional file 9. Subfamilies CC1, CC2, CC3 and CC4 of Cucurbitaceae fall within the subfamilies N3, N4, N1 and N2, respectively, as identified by Cannon et al. [41] No species-specific expansion in the CC-NBS family after the divergence of the Cucurbitaceae species and Arabidopsis was observed. In addition, the N1 subfamily contained only two members from Arabidopsis, At3g14460 and At3g14470, whereas subfamily CC3 of Cucurbitaceae was grouped into this subfamily. Thus, the analysis has identified a region of chromosome 3 of Arabidopsis which is potentially orthologous to the CC3 subfamily of Cucurbitaceae.

The phylogenetic comparison of TIR-NBS sequences from Cucurbitaceae and Arabidopsis revealed a degree of change as opposed to the phylogenetic pattern of the CC-NBS family (Figure 5). Similar to the results shown in Figure 3, most subfamilies were also shown to be species-specific in the phylogenetic analysis of TIR-NBS sequences from Cucurbitaceae and Arabidopsis, except for TIR9 and TIR4, which were combined into the At-TIR-NBS-A and At-TIR-NBS-B subfamilies, respectively [52]. This observation is similar to those described in Solanaceae and Asteraceae, and may be typical of other plant families as well [24, 51], which suggests recent gene radiation from a common ancestral source of NBS-encoding genes or RGHs.


The results of this study provide a genomic framework for the further isolation of candidate NBS-encoding genes in Cucurbitaceae crops through comparative genomics, and contribute to the understanding of the evolutionary mode of NBS-encoding genes in Cucurbitaceae crops. In 2009, the cucumber genome was sequenced by researchers who worked on the ‘Chinese Long’ inbred line 9930 [10], and recently Gy14 was sequenced de novo. A vast amount of useful information has been collected, and two cucumber genome databases (; have been established. However, information regarding other less studied Cucurbitaceae crops is still scarce, including that of melon, watermelon, luffa, bottle gourd, and squash. Thus, obtaining more NBS sequences from these other Cucurbitaceae crops should be the focus of future studies.


Retrieval and identification of cucumber NBS-encoding R genes

Cucumber (Cucumis sativus L.) assembly and annotation V1.0 were downloaded from A TBLASTN search was used to obtain all NBS-encoding genes in the cucumber (C. sativus L.) genome. First, a TBLASTN was performed using the protein coding sequences of the NBS domain of NBS-encoding sequences from A. thaliana and rice [3133] as the query against the JGI Cucumis sativus genome database ( Second, the amino acid sequence of the NB-ARC domain (Pfam: PF00931) was adopted as a query in TBLASTN searches for possible homologues encoded in the cucumber genome. The conserved NBS domain of these predicted NBS-encoding proteins was determined by Pfam version 22.0 ( Third, based on the results above, the searches of candidate NBS-encoding genes in the cucumber genome were repeated using BLASTN searches. The e-value used was 1e-5. Finally, all BLAST hits in the cucumber genome, together with flank regions of 5,000–10,000 bp in the upstream and downstream of BLAST hits, were annotated using the FGENESH ( and GENSCAN ( programs.

To classify these NBS-encoding genes, all candidate genes were evaluated to further verify whether they encoded TIR, CC, NBS, or LRR motifs using the Pfam database (, SMART protein motif analyses (, and COILS, with a threshold of 0.9, to specifically detect CC domains [53].

Prediction of conserved motif structures and gene duplication

To investigate the diversity and structure of NBS-encoding genes in cucumbers, their predicted amino acid sequences were subjected to domain and motif analyses. According to the methods of previous researchers [31, 54], NBS-encoding genes from cucumbers were divided into three components, namely the N-terminal, NBS domain, and LRR-C-terminal regions. They were then analyzed individually using the Multiple Expectation Maximization for Motif Elicitation (MEME)/Motif Alignment and Search Tool (MAST) system ( Furthermore, MEME motif analyses were performed on members of TIR-NBS and CC-NBS families. Conservation of each motif among the NBS-encoding genes was performed with WebLogo version 2.8.2 ( using the default settings.

Gene duplication events of NBS-encoding genes were defined based on the http://criterion used by previous researchers [55]. NBS-encoding genes in cucumber were aligned using BioEdit ( and calculated by MEGA 5.0 [56] for homology gene calculation.

Identification of NBS-encoding RGHs in other Cucurbitaceae crops

To understand the phylogenetic relationship among the NBS-encoding genes in Cucurbitaceae crops, NBS-encoding RGHs from melon, bottle gourd, luffa, watermelon and squash were also identified via degenerate PCR amplification and database mining. First, PCR was performed using genomic DNA for young leaves from melon, bottle gourd, luffa, and watermelon using 3 pairs of degenerate primers. The young leaves in the second true-leaf stage were harvested, frozen immediately in liquid nitrogen, and stored at −80°C. Genomic DNA was isolated using a plant DNA extraction kit (Tiangen, China). The primers were designed by the previous researchers based on the conserved regions of P-loop and GLPL of amino acid identity among the known NBS-LRR R genes from the other plant species (Additional file 13). The PCR amplifications were performed in 20 μL reaction mixtures with 1 U of LATaq DNA proof reading polymerase (TaKaRa, Kyoto, Japan), 1 × PCR buffer, 1.5 mM MgCl2, 0.5 μM each of forward/reverse primers, 0.4 mM dNTP, and 50 ng of template DNA. PCR was performed in a PTC-100 thermal cycler (MJ Research, Inc., Watertown, MA). The cycling conditions consisted of an initial denaturation performed for 3 min at 94°C, followed by 35 cycles at 94°C for 30 s, 55°C for 45 s, and 72°C for 1 min. These were followed by a 10 min extension step at 72°C and 10°C to terminate the reaction.

The DNA fragments from the PCR were separated using 1.0% agarose gels. Fragments with the expected size (~500 bp) were excised and reclaimed from the gel and purified with a PCR purification kit (Qiagen, Germany). Subsequently, these fragments were combined with vector DNA to generate recombinant DNA molecules, and then transformed into competent Escherichia coli JM109 cells. Plasmid DNA was purified with a PCR purification kit (Qiagen, Germany). The DNA fragments were sequenced using an ABI 3730 sequencer (Applied Biosystems, Foster City, CA, USA). Then, each of the acquired DNA sequences was trimmed of vector sequence contamination using VecScreen at the National Center of Biotechnology Information (NCBI). Identity and similarity searches of nucleotide and amino acid sequences were performed using BLAST at the NCBI GenBank database (

Second, other RGHs in melon, watermelon, and squash were obtained from the GenBank database searches. All sequences from these species were downloaded and searched with the NBS domain of NBS-encoding sequences from A. thaliana and rice [3133] as the query. The RGHs in melon were sourced from a published paper [57]. In addition, Arabidopsis NBS-encoding proteins, which were obtained from, were selected for phylogenetic relationship analysis.

Sequence and phylogenetic analysis

Amino acid sequences of all NBS-encoding genes in the cucumber genome and RGHs from the other five Cucurbitaceae crops were aligned using Clustal X version 1.8 [58], followed by manual adjustment. The conserved domains of P-loop to GLPL of these proteins and RGHs were applied to construct a phylogenetic tree using the NJ method [59] and an NJ algorithm implemented in the Molecular Evolutionary Genetics Analysis software version 5.0 (MEGA 5.0) [56]. Bootstrapping (1000 replicates) was used to evaluate the degree of support for a particular grouping pattern in the phylogenetic tree. Branch lengths were assigned by pairwise calculations of the genetic distances, and missing data were treated by pairwise deletions of the gaps.


  1. 1.

    Wyszogrodzka AJ, Williams PH, Peterson CE: Multiple-pathogen inoculation of cucumber (Cucumis sativus) seedlings. Plant Dis. 1987, 71: 275-280.

    Article  Google Scholar 

  2. 2.

    Abul-Hayja ZM: Multiple disease screening and genetics of resistance cucumber. 1975, Madison: University of Wisconsin, 149-Ph. D. thesis

    Google Scholar 

  3. 3.

    Palmer MJ, Williams PH: A seedling evaluation method for Fusarium wilt of cucumber incited by Fusarium oxysporum f.sp. cucumerinum. (Abstr.). Phytopathol. 1981, 71: 247-

    Google Scholar 

  4. 4.

    Chen JF, Moriarty G, Jahn M: Some disease resistance tests in Cucumis hystrix and its progenies from interspecific hybridization with cucumber. Progress in cucurbit genetics and breeding research. Proceedings of Cucurbitaceae 2004, the 8th EUCARPIA Meeting on Cucurbit Genetics and Breeding. Edited by: Lebeda A, Paris HS. 2004, Olomouc (CZ): Palacky University in Olomouc, 189-196.

    Google Scholar 

  5. 5.

    Chen JF, Lewis S: New source of nematode resistance was identified in Cucumis. Cucurbit Genet Coop Rep. 2000, 23: 32-35.

    CAS  Google Scholar 

  6. 6.

    Ellis J, Jones D: Structure and function of proteins controlling strain-specific pathogen resistance in plant. Curr Opin Plant Biol. 1998, 1: 288-293.

    Article  CAS  PubMed  Google Scholar 

  7. 7.

    Wan HJ, Zhao G, Ahmed AM, Qiao CT, Chen JF: Identification and characterization of potential NBS-encoding resistance genes and induction kinetics of a putative candidate gene associated with downy mildew resistance in Cucumis. BMC Plant Biol. 2010, 10: 186-

    PubMed Central  Article  PubMed  Google Scholar 

  8. 8.

    Wan HJ, Chen JF: Characterization of NBS-LRR resistance gene analogs from a high resistance to downy mildew introgression line from Cucumis hystrix x C. sativus. Acta. horticul. 2010, 871: 573-578.

    CAS  Google Scholar 

  9. 9.

    Wan HJ, Qiao CT, Ahmed AM, Zhao ZG, Chen JF: Isolation, phylogeny and evolutionary analysis of Pto-type disease resistance gene analogues from a Cucumis hystrix introgression line of cucumber (C. sativus). Funct Plant Biol. 2010, 37: 513-523.

    Article  CAS  Google Scholar 

  10. 10.

    Huang S, Li R, Zhang Z, Li L, Gu X, Fan W, Lucas WJ, Wang X, Xie B, Ni P, Ren Y, Zhu H, Li J, Lin K, Jin W, Fei Z, Li G, Staub J, van der Kilian A, Vossen EA, Wu Y, Guo J, He J, Jia Z, Tian G, Lu Y, Ruan J, Qian W, Wang M, Huang Q, Li B, Xuan Z, Cao J, Asan Wu Z, Zhang J, Cai Q, Bai Y, Zhao B, Han Y, Li Y, Li X, Wang S, Shi Q, Liu S, Cho WK, Kim JY, Xu Y, Heller-Uszynska K, Miao H, Cheng Z, Zhang S, Wu J, Yang Y, Kang H, Li M, Liang H, Ren X, Shi Z, Wen M, Jian M, Yang H, Zhang G, Yang Z, Chen R, Ma L, Liu H, Zhou Y, Zhao J, Fang X, Fang L, Liu D, Zheng H, Zhang Y, Qin N, Li Z, Yang G, Yang S, Bolund L, Kristiansen K, Li S, Zhang X, Wang J, Sun R, Zhang B, Jiang S, Du Y: The genome of the cucumber, Cucumis sativus L. Nat Genet. 2009, 41: 1275-1281.

    Article  CAS  PubMed  Google Scholar 

  11. 11.

    Kang HX, Weng YQ, Yang YH, Zhang ZH, Zhang SP, Mao ZC, Cheng GH, Gu XF, Huang SW, Xie BY: Fine genetic mapping localizes cucumber scab resistance gene Ccu into an R gene cluster. Theor Appl Genet. 2011, 122: 795-803.

    Article  CAS  PubMed  Google Scholar 

  12. 12.

    Helm MA, Hemleben V: Characterization of a new prominent satellite DNA of Cucumis metuliferus and differential distribution of satellite DNA in cultivated and wild species of Cucumis and in related genera of Cucurbitaceae. Euphytica. 1997, 94: 219-226.

    Article  CAS  Google Scholar 

  13. 13.

    Sanjur OI, Piperno DR, Andres TC, Wessel-Beaver L: Phylogenetic relationships among domesticated and wild species of Cucurbita (Cucurbitaceae) inferred from a mitochondrial gene: Implications for crop plant evolution and areas of origin. Proc Natl Acad Sci USA. 2002, 99: 535-540.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  14. 14.

    Levi A, Thomas CE, Simmons AM, Thies JA: Analysis based on RAPD and ISSR markers reveals closer similarities among Citrullus and Cucumis species than with Praecitrullus fistulosus (Stocks) Pangalo. Genet Resour Crop Evol. 2005, 52: 465-472.

    Article  CAS  Google Scholar 

  15. 15.

    Sikdar B, Bhattacharya M, Mukherjee A, Banerjee A, Ghosh E, Ghosh B, Roy SC: Genetic diversity in important members of Cucurbitaceae using isozyme, RAPD and ISSR markers. Biol Plant. 2010, 54: 135-140.

    Article  CAS  Google Scholar 

  16. 16.

    Schaefer H, Heibl C, Renner SS: Gourds afloat: a dated phylogeny reveals an Asian origin of the gourd family (Cucurbitaceae) and numerous oversea dispersal events. Proc R Soc B. 2009, 276: 843-851.

    PubMed Central  Article  PubMed  Google Scholar 

  17. 17.

    Dangl JL, Jones JDG: Plant pathogens and integrated defence responses to infection. Nature. 2001, 411: 826-833.

    Article  CAS  PubMed  Google Scholar 

  18. 18.

    McDowell JM, Woffenden BJ: Plant disease resistance genes: recent insights and potential applications. Trends Biotech. 2003, 21: 178-183.

    Article  CAS  Google Scholar 

  19. 19.

    Martin GB, Bogdanove AJ, Sessa G: Understanding the function of plant disease resistance proteins. Annu Rev Plant Biol. 2003, 54: 23-61.

    Article  CAS  PubMed  Google Scholar 

  20. 20.

    Keen NT: Gene-for-gene complementarity in plant-pathogen interactions. Annu Rev Genet. 1990, 24: 447-473.

    Article  CAS  PubMed  Google Scholar 

  21. 21.

    Van der Hoorn RAL, Kamoun S: From guard to decoy: a new model for perception of plant pathogen effectors. Plant Cell. 2008, 20: 2009-2017.

    PubMed Central  Article  PubMed  Google Scholar 

  22. 22.

    Van der Biezen EA, Jones IDG: Plant disease resistance proteins and the gene-for-gene concept. Trends Plant Sci. 1998, 23: 454-456.

    CAS  Google Scholar 

  23. 23.

    Meyers BC, Dickerman AW, Michelmore RW, Sivaramakrishnan S, Sobral BW, Young ND: Plant disease resistance genes encode members of an ancient and diverse protein family within the nucleotide-binding superfamily. Plant J. 1999, 20: 317-332.

    Article  CAS  PubMed  Google Scholar 

  24. 24.

    Pan QL, Liu YS, Budai-Hadrian O, Sela M, Carmel-Goren L, Zamir D, Fluhr R: Comparative genetics of nucleotide binding site-leucine rich repeat resistance gene homologues in the genomes of two dicotyledons: tomato and Arabidopsis. Genetics. 2000, 155: 309-322.

    PubMed Central  CAS  PubMed  Google Scholar 

  25. 25.

    Kanazin V, Mareck L, Shoemaker P: Resistance gene analogs are conserved and clustered in soybean. Proc Natl Acad Sci USA. 1996, 93: 11746-11750.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  26. 26.

    Leister D, Ballvora A, Salamini F, Gebhardt C: A PCR-based approach for isolating pathogen resistance genes from potato with potential for wide application in plants. Nat Genet. 1996, 14: 421-429.

    Article  CAS  PubMed  Google Scholar 

  27. 27.

    Shen KA, Meyers BC, Islam-Faridi MN, Chin DB, Stelly DM, Michelmore RW: Resistance gene candidates identified by PCR with degenerate oligonucleotide primers map to clusters of resistance genes in lettuce. Mol Plant Microbe Interact. 1998, 11: 815-823.

    Article  CAS  PubMed  Google Scholar 

  28. 28.

    Noir S, Combes MC, Anthony F, Lashermes P: Origin, diversity and evolution of NBS-type disease-resistance gene homologues in coffee trees (Coffea L.). Mol Genet Genomics. 2001, 265: 654-662.

    Article  CAS  PubMed  Google Scholar 

  29. 29.

    Martínez-Zamora MG, Castagnaro AP, Díaz-Ricci JC: Isolation and diversity analysis of resistance gene analogues (RGHs) from cultivated and wild strawberries. Mol Genet Genomic. 2004, 272: 480-487.

    Article  Google Scholar 

  30. 30.

    Nair RA, Thomas G: Isolation, characterization and expression studies of resistance gene candidates (RGCs) from zingiber spp. Theor Appl Genet. 2007, 116: 123-134.

    Article  CAS  Google Scholar 

  31. 31.

    Meyers BC, Kozik A, Griego A, Kuang H, Michelmore RW: Genome-wide analysis of NBS-LRR genes in Arabidopsis. Plant Cell. 2003, 15: 809-834.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  32. 32.

    Zhou T, Wang Y, Chen JQ, Araki H, Jing Z, Jiang K, Shen J, Tian D: Genome-wide identification of NBS genes in rice reveals significant expansion of divergent non-TIR NBS genes. Mol Genet Genomics. 2004, 271: 402-415.

    Article  CAS  PubMed  Google Scholar 

  33. 33.

    Yang SH, Feng ZM, Zhang XY, Jiang K, Jin XQ, Hang YY, Chen LQ, Tian DC: Genome-wide investigation on the genetic variations of rice disease resistance genes. Plant Mol Biol. 2006, 62: 181-193.

    Article  CAS  PubMed  Google Scholar 

  34. 34.

    Xiao S, Ellwood S, Calis O, Patrick E, Li T, Coleman M, Turner JG: Broad-spectrum mildew resistance in Arabidopsis thaliana mediated by RPW8. Science. 2001, 291: 118-120.

    Article  CAS  PubMed  Google Scholar 

  35. 35.

    Garcia-Mas J, Benjak A, Sanseverino W, Bourgeois M, Mir G, Gonzalez VM, Henaff E, Camara F, Cozzuto L, Lowy E, Alioto T, Capella-Gutierrez S, Blanca J, Canizares J, Ziarsolo P, Gonzalez-Ibeas D, Rodriguez-Moreno L, Droege M, Du L, Alvarez-Tejado M, Lorente-Galdos B, Mele M, Yang L, Weng Y, Navarro A, Marques-Bonet T, Aranda MA, Nuez F, Pico B, Gabaldon T, Roma G, Guigo R, Casacuberta JM, Arus P, Puigdomenech P: The genome of melon (Cucumis melo L.). Proc Natl Acad Sci USA. 2012, 109: 11872-11877.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  36. 36.

    Guo AY, Zhu QH, Chen X, Luo JC: GSDS: a gene structure display server. Yi Chuan. 2007, 29: 1023-1026.

    Article  CAS  PubMed  Google Scholar 

  37. 37.

    Yang S, Zhang X, Yue JX, Tian D, Chen JQ: Recent duplications dominate NBS-encoding gene expansion in two woody species. Mol Genet Genomics. 2008, 280: 187-198.

    Article  CAS  PubMed  Google Scholar 

  38. 38.

    Florian J, Leighton P, Graham JE, Katrin M, Peter JAC, Frank W, Sanjeev KS, Dan B, Glenn B, Jonathan DGJ, Ingo H: Identification and localisation of the NB-LRR gene family within the potato genome. BMC Genomics. 2012, 13: 75-

    Article  Google Scholar 

  39. 39.

    Tarek J, Joseph JK, Shelly JN, Claude ET, Ralph AD: The Fusarium wilt resistance locus Fom-2 of melon contains a single resistance gene with complex features. Plant J. 2004, 39: 283-297.

    Article  Google Scholar 

  40. 40.

    Pan Q, Wendel J, Fluhr R: Divergent evolution of plant NBS-LRR resistance gene homologues in dicot and cereal genomes. J Mol Evol. 2000, 50: 203-213.

    CAS  PubMed  Google Scholar 

  41. 41.

    Cannon SB, Zhu HY, Baumgarten AM, Spangler R, May G, Cook DR, Young ND: Diversity, distribution, and ancient taxonomic relationships within the TIR and non-TIR NBS-LRR resistance gene families. J Mol Evol. 2002, 54: 548-562.

    Article  CAS  PubMed  Google Scholar 

  42. 42.

    Ameline-Torregrosa C, Wang BB, OòBleness MS, et al: Identification and characterization of nucleotide-binding site-leucine-rich repeat genes in the model plant Medicago truncatula. Plant Physiol. 2008, 146: 5-21.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  43. 43.

    Xu Q, Wen XP, Deng XX: Isolation of TIR and nonTIR NBS-LRR resistance gene analogues and identification of molecular markers linked to a powdery mildew resistance locus in chestnut rose (Rosa roxburghii Tratt). Theor Appl Genet. 2005, 111: 819-830.

    Article  CAS  PubMed  Google Scholar 

  44. 44.

    Peraza-Echeverria S, Dale JL, Harding RM, Smith MK, Collet C: Characterization of disease resistance gene candidates of the nucleotide binding site (NBS) type from banana and correlation of a transcriptional polymorphism with resistance to Fusarium oxysporum f.sp. cubense race 4. Mol Breeding. 2008, 22: 565-579.

    Article  CAS  Google Scholar 

  45. 45.

    Reddy BU: Cladistic analyses of a few members of Cucurbitaceae using rbcl nucleotide and amino acid sequences. Int J Bioinf Res. 2009, 1: 58-64.

    Article  Google Scholar 

  46. 46.

    Sanseverino W, Roma G, De Simone M, Faino L, Melito S, Stupka E, Frusciante L, Ercolano MR: PRGdb: a bioinformatics platform for plant resistance gene analysis. Nucleic Acids Res. 2010, 38: D814-821.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  47. 47.

    Porter BW, Paidi M, Ming R, Alam M, Nishijima WT, Zhu YJ: Genome-wide analysis of Carica papaya reveals a small NBS resistance gene family. Mol Genet Genomic. 2009, 281: 609-626.

    Article  CAS  Google Scholar 

  48. 48.

    Tarr DEK, Alexander HM: TIR-NBS-LRR genes are rare in monocots: evidence from diverse monocotorders. BMC Res Notes. 2009, 2: 197-

    PubMed Central  Article  PubMed  Google Scholar 

  49. 49.

    Bai JF, Pennill LA, Ning JC, Lee SW, Ramalingam J, Webb CA, Zhao B, Sun Q, Nelson JC, Leach JE, Hulbert SH: Diversity in nucleotide binding site-leucine-rich repeat Genes in cereals. Genome Res. 2002, 12: 1871-1884.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  50. 50.

    Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, Schein J, Sterck L, Aerts A, Bhalerao RR, Bhalerao RP, Blaudez D, Boerjan W, Brun A, Brunner A, Busov V, Campbell M, Carlson J, Chalot M, Chapman J, Chen GL, Cooper D, Coutinho PM, Couturier J, Covert S, Cronk Q, Cunningham R, Davis J, Degroeve S, Déjardin A, Depamphilis C, Detter J, Dirks B, Dubchak I, Duplessis S, Ehlting J, Ellis B, Gendler K, Goodstein D, Gribskov M, Grimwood J, Groover A, Gunter L, Hamberger B, Heinze B, Helariutta Y, Henrissat B, Holligan D, Holt R, Huang W, Islam-Faridi N, Jones S, Jones-Rhoades M, Jorgensen R, Joshi C, Kangasjärvi J, Karlsson J, Kelleher C, Kirkpatrick R, Kirst M, Kohler A, Kalluri U, Larimer F, Leebens-Mack J, Leplé JC, Locascio P, Lou Y, Lucas S, Martin F, Montanini B, Napoli C, Nelson DR, Nelson C, Nieminen K, Nilsson O, Pereda V, Peter G, Philippe R, Pilate G, Poliakov A, Razumovskaya J, Richardson P, Rinaldi C, Ritland K, Rouzé P, Ryaboy D, Schmutz J, Schrader J, Segerman B, Shin H, Siddiqui A, Sterky F, Terry A, Tsai CJ, Uberbacher E, Unneberg P, Vahala J, Wall K, Wessler S, Yang G, Yin T, Douglas C, Marra M, Sandberg G, Van de Peer Y, Rokhsar D: The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006, 313: 1596-1604.

    Article  CAS  PubMed  Google Scholar 

  51. 51.

    Plocik A, Layden J, Kesseli R: Comparative analysis of NBS domain sequences of NBS-LRR disease resistance genes from sunflower, lettuce, and chicory. Mol Phylogenet Evol. 2004, 31: 153-163.

    Article  CAS  PubMed  Google Scholar 

  52. 52.

    Richly E, Kurth J, Lesiter D: Mode of amplification and reorganization of resistance genes during recent Arabidopsis thaliana evolution. Mol Biol Evol. 2002, 19: 76-84.

    Article  CAS  PubMed  Google Scholar 

  53. 53.

    Lupas A, Van Dyke M, Stock J: Predicting coiled coils from protein sequences. Science. 1991, 252: 1162-1164.

    Article  CAS  PubMed  Google Scholar 

  54. 54.

    Kohler A, Rinaldi C, Duplessis S, Baucher M, Geelen D, Duchaussoy F, Meyers BC, Boerjan W, Martin F: Genome-wide identification of NBS resistance genes in Populus trichocarpa. Plant Mol Biol. 2008, 66: 619-636.

    Article  CAS  PubMed  Google Scholar 

  55. 55.

    Cheng Y, Li XY, Jiang HY, Ma W, Miao WY, Yamada T, Zhang M: Systematic analysis and comparison of nucleotide-binding site disease resistance genes in maize. FEBS J. 2012, 279: 2431-2443.

    Article  CAS  PubMed  Google Scholar 

  56. 56.

    Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  57. 57.

    Brotman Y, Silberstein L, Kovalski I, Perin C, Dogimont C: Resistance gene homologues in melon are linked to genetic loci conferring disease and pest resistance. Theor Appl Genet. 2002, 104: 1055-1063.

    Article  CAS  PubMed  Google Scholar 

  58. 58.

    Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25: 4876-4882.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  59. 59.

    Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4: 406-425.

    CAS  PubMed  Google Scholar 

Download references


This study was partially supported by National Basic Research Program of China (The 973 Program:2012CB113900; 2009CB119001-01); the key Program (30830079) of the National Natural Science Foundation of China; and the 863 project (2010AA10A108; 2012AA100202).

Author information



Corresponding author

Correspondence to Jinfeng Chen.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

HJW participated in conceiving the paper, primer design, sequence analysis, and drafting the final manuscript. WY participated in DNA extraction and PCR amplification. KLB and SJ participated in bioinformatics, and modified the final manuscript. XP participated in conceiving the study, and modified the final manuscript. JFC critically reviewed the manuscript and gave financial support to the study. All authors read and approved the final manuscript.

Electronic supplementary material

Coding DNA and protein sequences of the NBS-encoding genes from cucumbers (

Additional file 1: Cucumis sativus L).(DOC 451 KB)

Additional file 2: Predicted domains of each NBS-encoding genes and numbers of LRR motifs in cucumber.(XLS 33 KB)

Additional file 3: Sequence identity between 9930 and Gy14 NBS-encoding genes.(XLS 53 KB)

Additional file 4: Distribution of the NBS-encoding genes with different numbers of introns in cucumber.(TIFF 598 KB)

Motif patterns of NBS-encoding genes in cucumbers (

Additional file 5: Cucumis sativus L.). (A) CC-NBS family (B) TIR-NBS family (XLS 56 KB)

(A) Motif sequence logos in the cucumber CC-NBS family of NBS-encoding genes.

Additional file 6: (B) Motif sequence logos in the cucumber TIR-NBS family of NBS-encoding genes. (DOC 1 MB)

Additional file 7: PCR amplification products generated by the three selected resistance gene degenerate primers in the four major Cucurbitaceae crops.(DOC 51 KB)

Additional file 8: Phylogenetic comparison of Cucurbitaceae NBS-encoding genes and RGHs. The TIR- and CC-NBS families are distinct. The former is divided into subfamilies TIR1 to TIR 9 and the latter is separated into subfamilies CC1 to CC4. (DOC 59 KB)

The detailed phylogenetic tree from Figure

Additional file 9: 4. The tree consists of 63 Cucurbitaceae and 52 Arabidopsis CC-NBS protein sequences. Parentheses indicate the ancient CC family (N1 to N4) as defined by Cannon et al. [41]. (DOC 56 KB)

The detailed phylogenetic tree from Figure

Additional file 10: 5. The tree consists of 152 Cucurbitaceae and 106 Arabidopsis TIR-NBS sequences. Arabidopsis subfamilies are indicated by At-TIR-NBS-A to At-TIR-NBS-G corresponding to the seven TIR subfamilies identified by Richly et al. [52]. (DOC 53 KB)

Total number of predicted NBS-encoding genes identified in the six sequenced angiosperm genomes.

Additional file 11: The values for Carica papaya, Arabidopsis thaliana, Vitis vinifera, Oryza sativa, and Populus trichocarpa were previously summarized by Yang et al. [37] and Porter et al. [48]. The total number of predicted protein encoding genes and the genome size of each species is also shown. (DOC 36 KB)

Additional file 12: Amino acid sequence alignment of NBS-encoding genes. The conserved domains are highlighted and indicated by an arrow. The alignment was constructed using Clustal X software. (RTF 3 MB)

Additional file 13: Degenerate primers used for PCR amplification from the four major Cucurbitaceae crops (melon, bottle gourd, luffa and watermelon).(DOC 34 KB)

Authors’ original submitted files for images

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Wan, H., Yuan, W., Bo, K. et al. Genome-wide analysis of NBS-encoding disease resistance genes in Cucumis sativusand phylogenetic study of NBS-encoding genes in Cucurbitaceae crops. BMC Genomics 14, 109 (2013).

Download citation


  • Cucumber
  • Cucurbitaceae
  • Phylogenetic relationship