Saturated linkage map construction in Rubus idaeususing genotyping by sequencing and genome-independent imputation
© Ward et al.; licensee BioMed Central Ltd. 2013
Received: 9 July 2012
Accepted: 4 December 2012
Published: 16 January 2013
Rapid development of highly saturated genetic maps aids molecular breeding, which can accelerate gain per breeding cycle in woody perennial plants such as Rubus idaeus (red raspberry). Recently, robust genotyping methods based on high-throughput sequencing were developed, which provide high marker density, but result in some genotype errors and a large number of missing genotype values. Imputation can reduce the number of missing values and can correct genotyping errors, but current methods of imputation require a reference genome and thus are not an option for most species.
Genotyping by Sequencing (GBS) was used to produce highly saturated maps for a R. idaeus pseudo-testcross progeny. While low coverage and high variance in sequencing resulted in a large number of missing values for some individuals, a novel method of imputation based on maximum likelihood marker ordering from initial marker segregation overcame the challenge of missing values, and made map construction computationally tractable. The two resulting parental maps contained 4521 and 2391 molecular markers spanning 462.7 and 376.6 cM respectively over seven linkage groups. Detection of precise genomic regions with segregation distortion was possible because of map saturation. Microsatellites (SSRs) linked these results to published maps for cross-validation and map comparison.
GBS together with genome-independent imputation provides a rapid method for genetic map construction in any pseudo-testcross progeny. Our method of imputation estimates the correct genotype call of missing values and corrects genotyping errors that lead to inflated map size and reduced precision in marker placement. Comparison of SSRs to published R. idaeus maps showed that the linkage maps constructed with GBS and our method of imputation were robust, and marker positioning reliable. The high marker density allowed identification of genomic regions with segregation distortion in R. idaeus, which may help to identify deleterious alleles that are the basis of inbreeding depression in the species.
KeywordsGenotyping by sequencing GBS RADseq Imputation Raspberry Rubus idaeus Psuedotestcross Linkage map Segregation distortion
Genetic linkage maps permit the elucidation of genome structure and organization and enable the identification of molecular markers linked to traits in an experimental segregating progeny, leading ultimately to the elucidation of the genetic basis of the trait of interest. As a result, maps have been developed for many diverse plant species [1–9]. Traditionally, transferable linkage map development has been achieved through the scoring of restriction fragment length polymorphisms (RFLPs) , microsatellites (SSRs) and gene specific markers  in a segregating progeny. Using such markers, saturated reference linkage maps for many plant species have been developed. Reference maps inform the selection of markers for mapping in other progenies [10–12] and have been used to anchor, order and orientate physical map BAC contigs, and genome sequencing scaffolds for the assignment of pseudo-chromosomes for whole genome sequence initiatives [13–17].
Single nucleotide polymorphisms (SNPs) are the most abundant mutations between related DNA molecules. The advent of affordable second generation sequencing technologies in recent years has led to the release of whole genome reference sequences for many plant species [6, 14, 18–20], and the identification of abundant SNPs throughout the genomes of these organisms [21–23]. Thus, SNPs are becoming increasingly important as markers for both fundamental and applied genetics research in plants. Relatively low throughput methods have been developed for the analysis and mapping of SNPs. These include high resolution melting (HRM) , and the cleaved amplified polymorphic DNA (CAPs) assay . Additionally, medium and high throughput genotyping assays have been developed that permit hundreds of thousands of SNPs to be interrogated simultaneously on a single multiplexed array. Platforms for genotyping in this way include SNPlex, Golden Gate, Infinium and Axiom, which have been employed successfully for genotyping in many plant species including apple, peach, grape and purple false brome [22, 23, 26–29]. Genotyping arrays have many advantages over other techniques for genetic analysis, however an essential prerequisite for array development is a predetermined set of SNPs, preferably located at known positions on a reference genome sequence. Additionally, the transferability of heterozygous SNPs between species has been shown to be low  and as such, in many genera, arrays must be developed specifically for the species under investigation. Thus for minor crops and for genotyping interspecific progenies or species complexes, the development of arrays is currently not a viable experimental solution.
Despite the second generation sequencing ‘revolution’ in the biological sciences, many crops of significant economic importance remain without a reference genome sequence, or an abundance of SNP data in public repositories. High throughput SNP genotyping for these organisms using array-based technologies is not economically viable, yet rapid, high-throughput SNP genotyping would be immensely advantageous for the progression of classical mapping and QTL analyses, for genome-wide association studies and pedigree-based analyses, genomic selection and for the development and implementation of marker-assisted breeding and selection.
Second generation sequencing has offered the possibility to genotype sequence variation in the genome of an organism for use in mapping experiments through whole genome re-sequencing. Whole genome re-sequencing has been employed for mapping in eukaryotic species with a relatively small genome size and on a selective mapping populations such as for the fungus Venturia inequalis. However, for the majority of organisms, even those with relatively small genomes such as the diploid strawberry Fragaria vesca[14, 21] a complexity reduction step must be performed prior to sequencing to enable sufficient depth of coverage of the same regions in all genomes of a segregating progeny to permit segregation to be scored. Genotyping through the sequencing of reduced representation genomic libraries developed through restriction digestion of genomic DNA (restriction-site associated DNA; RAD) was initially proposed by Miller (2007)  and adapted to incorporate barcoding for multiplexing with Illumina sequencing technology by Baird et al. (2008) . The RAD procedure has been used successfully to identify SNPs in a number of plant species including eggplant, barley, and globe artichoke [33–35] and its utility to linkage map development and QTL analysis in a large mapping population was demonstrated recently by Pfender et al. (2011) . Subsequently, Elshire et al. (2011) proposed a method for the construction of highly multiplexed reduced complexity genotyping by sequencing (GBS) libraries. The procedure is based on a similar restriction digestion technique to RAD, however it is substantially less complicated, resulting in time and cost savings in library preparation, but the resultant data contains a larger number of missing genotype calls.
Rubus is a genus in the Rosaceae family containing more than 600 species, some of which, such as R. idaeus subsp. idaeus L. (red raspberry) and Rubus L. subgenus Rubus Watson (blackberry) are of economic importance as cultivated fruit crops. Breeding methods for these species have remained largely unchanged since the first empirical breeding programs were initiated. However, changes in cultural practices, the withdrawl of soil fumigants, and demands for increased fruit quality, shelf-life and for the extension of the traditional cultivation season, have necessitated novel breeding techniques to satisfy the demand for new cultivars. The development and application of molecular tools for Rubus would increase the speed and precision of the breeding process, particularly for traits that are difficult to characterize phenotypically, such as pyramided resistances to pests or pathogens. Looking further forward, Rubus breeding would greatly benefit from genomic selection approaches that have recently become popular in crops such as maize, barley, and wheat  because even modest gains from genomic selection could save years of in-field evaluation. An essential precursor to the development of such tools is the characterization of an abundance of informative molecular markers with which to perform marker-trait association analyses. In Rubus, the majority of molecular markers that have been developed and mapped in the genus to date are SSRs [4, 39–43]. More recently, low throughput methods were employed for mapping SNP markers in an interspecific Rubus mapping progeny [44, 45], but high throughput methods for the identification and mapping of molecular markers have yet to be reported for the genus.
In this investigation, we have exploited the recent advances in low-cost sequencing and multiplexed library preparation  to generate segregation data for SNP markers distributed throughout the R. idaeus genome. We used these markers for linkage map construction in a red raspberry progeny from the cross ‘Heritage’×‘Tulameen’ (H×T). The segregation data was generated using multiplexed sequencing on the Illumina HiSeq sequencing-by-synthesis platform. Shallow genome sampling resulted in a data set containing a large proportion of missing values, and thus we developed a pipeline which includes a novel imputation algorithm (Maskov) to deal with the missing and putatively erroneous data through comparison of genotypes in internal genotype bins following initial co-segregation analysis. The challenges and solutions to generating and handling segregation data from thousands of loci for linkage mapping are discussed.
Genotyping by sequencing
Sequencing resulted in 135,776,036 reads including deeper coverage of parents. There were 19,623,392 reads for ‘Heritage’ and 20,293,782 for ‘Tulameen’. Within the population the mean number of reads was 1,350,125 and the median was 977,402 per individual. Forty-two individuals from the progeny were sequenced in library one and 29 individuals were sequenced in library two although each individual within a library was part of a 96-plex reaction in a single sequencing lane at two different sequencing centers. Sequence quality differed between the two sequencing lanes with a mean phred score at base 64 of 26.7 in library one and a mean phred score at base 64 of 33.6 in library two. Overall, library one had lower per base quality scores and a greater per base interquartile range compared to library two (Additional file 1: Figure S1). However, on a per read basis both libraries had quality scores greater than 37 for most reads (Additional file 2: Figure S2). In library one approximately 19.15 percent of reads contained N’s (uncalled bases) compared to only 7.53 percent of reads in library two. Further differences in the error rate between the lanes is illustrated by the percentage of unique reads in each library because as coverage increases additional unique reads are likely to result from sequencing errors rather than new sequences (Additional file 3: Figure S3). Overrepresented sequences in the libraries had high sequence similarity to the Fragaria vesca chloroplast genome and accounted for approximately 5.5 percent of library one and approximately 6.3 percent of library two (Additional file 4: Table S1) as determined by alignment with bowtie (Langmead et al., 2009). The percent missing data was also a clear function of sequencing depth (Additional file 5: Figure S4).
Number of segregating SNPs identified
A total of 9143 segregating SNPs were identified in the progeny following analysis of raw data using Stacks . Of these, 4744 were present in the parental configuration AB×AA (i.e. heterozygous only in ‘Heritage’), 2672 in the configuration AA×AB (i.e. heterozygous only in ‘Tulameen’), and the remaining 1727 in the configuration AB×AB (i.e. heterozygous in both parents). To simplify the process of imputation, and subsequent analysis using maximum likelihood implemented in JOINMAP 4.0 (Kyasma, NL), only SNPs segregating in a uni-parental configuration, i.e. AB×AA or AA×AB were used for further analysis.
Segregating SSRs identified in the H × T progeny
Total number of SNPs mapped and percentage of missing values
Following initial co-segregation analysis a total of 4521 SNPs displaying the parental configuration AB × AA (i.e. heterozygous in the ‘Heritage’ parental genotype), along with the 33 SSR markers scored in the progeny, coalesced into seven linkage groups associated with the haploid chromosome number for the species at a minimum LOD score of 7.0. A further 2391 SNPs, along with 12 SSR loci displaying the parental configuration AA × AB (i.e. heterozygous in the ‘Tulameen’ parental genotype), coalesced into seven linkage groups at a minimum LOD score of 7.0. The total number of data-points analysed in the initial phase of mapping was 323,334 in 71 seedlings in the ‘Heritage’ data set and 170,613 in 71 seedlings in the ‘Tulameen’ data set, containing a total of 116,728 (36%) and 61,481 (36%) missing values respectively. The average number of perceived recombination events per individual was 22.45 in ‘Heritage’ and 11.22 in ‘Tulameen’, indicating a large number of double recombination events due to erroneous marker genotypes.
The total number of SNP and SSR markers mapped to the ‘Heritage’ linkage map, the number of markers per chromosome and the total length of each LG in centi-Morgans (cM)
LG length (cM)
The total number of SNP and SSR markers mapped to the ‘Tulameen’ linkage map, the number of markers per chromosome and the total length of each LG in centi-Morgans (cM)
LG length (cM)
Using a recently reported method of multiplexed, reduced representation library construction  and massively parallel sequencing using the Illumina HiSeq platform, GBS was successfully employed to produce a high density, saturated linkage map for a red raspberry (R. ideaus) mapping population. Problems of missing data and false negative genotyping calls were overcome by relying on data from SNP genotyping bins to perform imputation of missing and erroneous data points within the segregation data matrix using Maskov. The ‘Heritage’ and ‘Tulameen’ linkage maps produced were of a comparable length to previously-published linkage maps of the species [4, 47] and to the linkage maps of closely-related genera such as diploid Fragaria[3, 11] and diploid Rosa, but shorter than the L×GM Rubus linkage maps published by Graham et al. (2006, 2011) [42, 45]. A comparison of common SSR markers revealed almost complete colinearity between the ‘Heritage’ and L×GM maps, but a reduction in genetic distance on the ‘Heritage’ map. Since the process of imputation employed tended towards conservatively placing markers into genotypic bins and thus eliminating the occurrence of spurious double recombination events within the data, the process would also tend to reduce the overall length of the linkage maps produced. However, positioning of common markers has demonstrated that the imputation process employed results in accurate marker placement, albeit at the expense of precise marker ordering within genotyping bins.
Despite the calculation of relatively low inbreeding coefficients for both ‘Heritage’ and ‘Tulameen’ (0.094 and 0.069 respectively) by Dale et al. (1993) , in this investigation we observed almost twice the level of heterozygous SNPs in the genome of ’Heritage’ than in the genome of ‘Tulameen’. Relatively high levels of genome differentiation and heterozygosity is a feature of red raspberry germplasm, despite the majority of modern varieties being derived from a narrow genetic base . The genome of ‘Heritage’, the more heterozygous of the two parental genotypes, is currently being sequenced by an international consortium , and thus data from the relative positions of SNPs mapped in this investigation within sequence scaffolds of the ‘Heritage’ genome sequence will help to validate the SNP positioning on the H×T linkage maps and will increase the precision of SNP positions within genotype blocks. Additionally, the genetic positions of the SNPs on the H×T linkage maps will permit anchoring of sequencing scaffolds and the development of pseudochromosomes for the Rubus genome sequence, as had been performed for other highly heterozygous genome sequences [6, 15].
The GBS approach used in this investigation enabled the identification and mapping of an unprecedented number of sequence characterized markers in Rubus and to produce the most saturated linkage map for a species within the Rosaceae family to date, at a fraction of the time and cost of developing maps for Rubus using traditional marker technologies such as SSRs  and gene specific and EST-based markers . Indeed, the methods employed here are more cost-effective than the array-based methods of SNP detection and scoring, such as the IRSC Infinium whole genome genotyping array recently developed and used for linkage map construction in Malus[22, 26]. However, GBS as used in this investigation yielded data containing large amounts of missing values. Splitting the library preparations between two lanes of sequencing allowed examination of the effect that varied quality in sequencing has on the outcome. One sequencing center provided data with nearly twice as many uncalled bases and in the current implementation of Stacks reads containing uncalled bases are discarded. Increasing depth of coverage by sequencing each individual in multiple lanes would likely resolve the issue of missing values, but it is also expected that starting with DNA of increased quality and purity would result in a more uniform restriction digestion and adapter ligation. Therefore performing manual DNA extraction or preparing multiple libraries with independent automated DNA extractions may result in more uniform sequencing and fewer missing values when the GBS method of Elshire et al. (2011)  is applied to linkage map construction. The most robust method is likely to be one in which two independent library preparations are conducted and sequenced for each progeny individual in separate lanes. Choosing an enzyme that cuts less frequently could also reduce the number of missing values by increasing coverage per restriction fragment. Using a more rare cutting enzyme could also potentially reduce the amount of sequenced chloroplast DNA. However, the use of rare cutting enzymes in pseudo-testcross progenies that are less heterozygous would also dramatically decrease the number of markers detected in the AA × AB and AB × AA configuration. As sequencing yield and quality continues to increase and costs continue to decrease, the desire to conduct larger and more highly multiplexed experiments may propagate the problem of missing data further. The Maskov imputation program that we present here can be used to overcome the challenges of missing data through map-based imputation.
On previously reported Rubus linkage maps, regions of significant segregation distortion have been observed . Similar regions of segregation distortion were observed in this investigation, however, the depth of marker saturation of the linkage maps presented here allowed us to plot the occurrence of segregation distortion along each linkage group with a high degree of precision. A number of well-defined regions of the ‘Heritage’ and ‘Tulameen’ linkage maps exhibited significant segregation distortion and in many cases these regions were conserved between the two parental linkage maps, indicating the presence of lethal or sub-lethal genes that are conserved in heterozygous form in both parental genotypes. Jennings (1967)  reported on the genetics of two loci, H conferring the presence of cane pubescence, and T conferring the presence of red pigmentation, and observed that they are rarely present in the homozygous forms HH and TT which was postulated to be due to lethal or sub-lethal genes linked in coupling to the dominant allele of each gene. Later, a gene affecting the viability of seeds in raspberry progenies and determining the presence or absence of cotyledonary glands was also described by Jennings (1972) . Graham et al. (2006)  reported a genetic map position for gene H on LG2 of the ‘Latham’ × ‘Glen Moy’ genetic linkage map, which is within the region of one of the defined areas of segregation distortion on the ‘Heritage’ linkage map, as well as on the linkage map of Sargent et al. (2007) , but not on the ‘Tulameen’ map. Both ‘Heritage’ and ‘Tulameen’ present the glabrous recessive phenotypes for gene H (i.e. hh) however, the maps presented here suggest that there are a number of genes distributed throughout the seven R. idaeus chromosomes that exhibit a lethal or highly detrimental sub-lethal effects which are conserved in heterozygous form in the ‘Heritage’ and ‘Tulameen’ genotypes, presumably due to advantages associated with pleiotrpic effects of the recessive lethal or sub-lethal alleles. These genes are most likely a factor in the high degree of heterozygosity that is maintained in Rubus despite the breakdown of the self-incompatibility system in the species . Whilst segregation distortion has previously been observed on genetic linkage maps of Rubus, in this investigation, we have mapped markers in sufficient numbers to permit the identification of a number of conserved genetic regions between linkage maps putatively responsible for biased transmission of alleles. The availability of a genome sequence for Rubus would potentially allow the identification of candidate genes creating the segregation distortion apparent on the H×T linkage maps.
Using GBS followed by imputation of missing data guided by marker membership to genotyping bins using Maskov, we have identified and mapped a total of 6912 SNPs in Rubus and developed a comprehensive SNP reference map for red raspberry. As the flanking sequences of each of the SNPs presented here have been defined and are available in Table S2, marker positions from this investigation can be used to inform studies in other Rubus populations. Fine mapping of regions of interest could be performed either through development of CAPs markers , or HRM assays  from SNPs within regions of interest to saturate existing Rubus linkage maps, or by first identifying heterozygous SNPs from GBS of parental lines of genetically undefined mapping populations. This could be followed by design of assays for selected heterozygous SNPs distributed throughout the seven Rubus linkage groups. The approach described here is suitable for the rapid and reliable development of saturated linkage mapping resources for any organism, whether or not it has been previously genetically characterized, or has an available genome sequence, and provides a wealth of genetic information that can serve as the starting point for downstream genetic investigations such as QTL analyses, positional cloning of genes controlling traits of interest, the anchoring of genome sequence contigs and the development of genomic selection strategies.
Plant material and DNA extraction
To generate a segregating population, a cross was made between the R. idaeus cultivars ‘Heritage’ (National Clonal Germplasm Repository accession # PI 553382) and ‘Tulameen’ (National Clonal Germplasm Repository accession # PI 618441). The resulting seeds were germinated and grown under glasshouse (double walled polycarbonate) conditions and the population denoted H×T for ease of reference. Young fresh leaf material was collected from the progeny, snap frozen and ground to a fine powder under liquid nitrogen. DNA was extracted in 96-well plate format using the Omega-E-Z extraction kit according to the manufacturer’s recommendations. DNA was quantified using PicoGreen (Invitrogen) against a λ standard DNA dilution series with a Synergy 2 fluorimeter (BioTek) then stored at −20°C prior to sequencing.
Genotyping by sequencing
To determine the optimal concentration of sequencing adapter to use per unit of DNA, a titration was performed using the methods, barcodes, adapters, and primers of Elshire et al. (2011) . Briefly, eight titrations were performed with 200 ng of DNA from ‘Heritage.’ DNA was digested with ApeKI (New England Biolabs, Ipswitch MA) for 2 hours at 75°C. Following digestion, various quantities of ApeKI adapter (1.8 ng, 2.4 ng, 3.6 ng, 4.2 ng, 4.8 ng, 5.4 ng, 6.0 ng, and 7.2 ng) were ligated to the resulting restriction fragments using T4 ligase (New England Biolabs, Ipswitch, Massachusetts, USA) with 60 minute incubation at 22°C followed by a 30 minute ligase denaturation step at 65°C. The ligation reaction was purified with a Qiagen PCR cleanup kit (Qiagen, Valencia, California, USA) as per the manufacturer’s instructions.
Next, 10 μl of the purified reaction was used in a 50 μl PCR reaction with 25 μl PCR 2x Taq Master Mix (New England Biolabs, Ipswitch, Massachusetts, USA), and 25 pmol of each primer. Thermal cycling was initiated with 5 minutes at 72°C and 30 seconds at 98°C followed by 18 cycles of 10 seconds at 98°C, 30 seconds at 65°C and 30 seconds at 72°C. A final extension was performed at 72°C for 5 minutes. An additional Qiagen PCR cleanup was performed and the subsequent libraries were analyzed on an Agilent Bioanalyzer (Agilent Technologies, Santa Clara, California, USA) and the electropherograms examined for library and dimer peaks. The adapter concentration of 3.6 ng yielded a satisfactory library without adapter dimer or other highly aberrant peaks. Thus genotyping by sequencing was performed as in the titration with the exception that 100 ng of DNA from progeny and correspondingly 1.8 ng of uniquely bar-coded adapter was used for each sample. All reactions were performed in separate wells for each genotype from the population and were pooled after ligation and before a 25 μl PCR. Digestion, ligation PCR conditions, and thermal cycling were the same as in the titration.
The 71 progeny from the H×T population were split between two library preparations (42 genotypes in the first and 29 genotypes in the second) and sequenced independently at two different sequencing centers, both as part of 96-plex reactions. Libraries were sequenced on the Illumina HiSeq 2000 sequencing platform (Illumina, San Diego, California). Sequencing reads were subsequently processed with custom perl scripts . Furthermore the script trimmed reads to 64 nt and only reads with ApeKI restriction sites were retained. Data was further processed in Stacks  with Stacks de novo, default settings and automated genotype corrections were allowed.
Quantification of over-represented reads and unique reads was determined by counting read frequency with a custom UNIX shell script. Reads with frequency greater than 1000 were initially screened against the NCBI nt database. After the initial determination that many over represented reads were highly similar to other chloroplast sequences, all overrepresented reads were aligned to the Fragaria vesca chloroplast genome (GenBank: JF345175.1) using Bowtie  with default settings for reads in FASTA format.
Microsatellite amplification and scoring of heterozygous markers
The fingerprinting set proposed by Fernández-Fernández et al. 2011  was used to confirm the parentage of the seedlings and to identify those resulting from uncontrolled outcrossing or selfing. Seedlings resulting from outcrossing were removed from further analysis. Additionally, selected primer pairs from published primer sets [39–43] were labelled on the forward primer with either 6-FAM or HEX fluorescent dyes (IDT, Belgium) or NED and PET (Life Technologies Corporation, Carlsbad, California, USA) and tested for heterozygosity in the parental genotypes ‘Heritage’ and ‘Tulameen’ in single PCR reactions. From these, heterozygous markers from each of the seven previously reported Rubus linkage groups were identified for scoring in the full H×T progeny. Primer pairs generating heterozygous amplicons in the parental genotypes were combined by product size and fluorescent dye colour into multiplexes of up to eight primer pairs and PCR was performed using the ‘Type-it’ PCR mastermix (Qiagen, Valencia, California, USA) following the manufacturer’s recommendations, in a final volume of 12.5 μl. Reactions were performed using the following PCR cycles: an initial denaturation step of 95°C for 5 min was followed by 28 cycles of 95°C for 30 s, an annealing temperature of 55°C decreasing by 0.5°C per cycle until 50°C for 90 s and 72°C for 30 s, followed by a 30 min final extension step at 60°C. PCR products were fractionated by capillary electrophoresis through a 3100 genetic analyser (Life Technologies Corporation, Carlsbad, California, USA). Data generated were collected and analysed using the GENESCAN and GENOTYPER (Life Technologies Corporation, Carlsbad, California, USA) software.
Manual imputation of genotyping by sequencing data for linkage map construction
Sequence data obtained for any one individual contained both missing data and a degree of false-negative SNP calls. These missing data and false negatives resulted from library construction, the relatively low depth of sequencing coverage per progeny individual due to multiplexing, and sequencing biases created by the sequencing platform employed in this investigation, following initial marker ordering. Missing data complicated the computation of reliable marker ordering and sequencing artifacts led to a high number of perceived double recombination events per individual following initial map construction. Thus for accurate and reliable linkage map construction using GBS data, we implemented a system of data imputation that increased the accuracy of marker placement, at the expense of precision in localised marker order. Data output from Stacks  was combined with SSR data and formatted for linkage mapping using the standard codes for a ‘cross pollination’ (CP) type progeny of JOINMAP 4.0 (Kyasma, NL).
Automated imputation of genotyping by sequencing data for linkage map construction
The maskov algorithm
where M, the convolution mask, is a vector of size 2E+1, genotype (x) is the original data set with errors (but without missing values), and convolved(x) is a function that describes the location of recombination events.
The coefficients of the mask vector M are weighted to compute the effective first derivative at each point. The location of an edge is defined as any position x i where |convolved(x i )| > E and where E is a user defined parameter that represents the maximum number of expected consecutive errors. The sign of convolved(x i ) decides which phase transition is detected. The third and final pass fills in the “blocks” between the recombination edges using “winner take all” criteria to correct errors. As the length of the mask vector M increases, recombination edges are detected more reliably but the exact location of an edge will become less accurate in the presence of errors. Maskov also has a user-defined parameter that controls the amount of tolerated missing data for a given genotype and a threshold value T (by default T= E) that controls the detection of recombination edges. Additional description of the algorithm, including usage information and screenshots are provided (Additional file 7: Text S1). The first release of the program called Maskov (Version 1.01) is freely available at the Maskov google group.
Automated imputation with Maskov in Rubus idaeus
Maskov 1.01 was used to visualize recombination events and to perform imputation with the initial output from marker ordering with the maximum likelihood algorithm of JOINMAP 4.0 (Kyasma, NL). Imputation parameters in Maskov were set to E = 5 with the default threshold of E, and the maximum amount of missing data set at seventy percent.
Linkage map construction
Imputed GBS data along with data generated for SSR loci were analysed using the maximum likelihood function of JOINMAP 4.0 (Kyasma, NL) to enable linkage map construction. Data were grouped using a minimum LOD score of 7.0 and maps were constructed using the default maximum likelihood parameters. Following initial linkage map construction, the markers were colour-coded according to phase and genotype as previously described, and marker positions were visually inspected and resolved where necessary. Markers with identical genotypes in the H×T progeny were grouped into mapping bins of identical genotypes and a single genotype for each bin was then used for final map construction using regression mapping in JOINMAP 4.0 (Kyasma, NL) applying the Kosambi mapping function. Marker placement was determined for each linkage group of the two parental maps using a minimum LOD score threshold of 7.0, a recombination fraction threshold of 0.35, ripple value of 1.0, jump threshold of 3.0 and a triplet threshold of 5.0. Linkage groups were compared to previously published maps using the positions of common SSRs markers and nomenclature follows the recently revised numbering system of Bushakra et al. (2012). Maps presented were plotted using MapChart 2.0 .
Distribution of segregation distortion across the H × T linkage maps
Segregation distortion was determined by calculating x 2 values for all mapped markers using JOINMAP 4.0 (Kyasma, NL). Relative distortion along each linkage group of H×T maps was determined by plotting x 2 values against marker positions along each linkage group. Significance thresholds for P=0.05, P=0.01 and P=0.001 were plotted as dashed lines on the graphs.
Genotyping by Sequencing
Simple Sequence Repeats
Restriction fragment length polymorphisms
Cleaved amplified polymorphic DNA
Single nucleotide polymorphisms
High resolution melting
Restriction-site associated DNA
Quantitative trait loci
Polymerase chain reaction
Logarithm (base 10) of odds.
Research into Rubus genomics at FEM-IASMA is supported by the research office of the Provincia Autonoma di Trento. The authors thank Driscoll Strawberry Associates for assistance in sequence acquisition and the USDA-ARS National Clonal Germplasm Genetics Laboratory and distribution staff for optimizing DNA extraction in red raspberry and shipping the DNA.
- Akbari M, Wenzl P, Caig V, Carling J, Xia L, Yang S, Uszynski G, Mohler V, Lehmensiek A, Kuchel H, Hayden MJ, Howes N, Sharp P, Vaughan P, Rathmell B, Huttner E, Kilian A: Diversity arrays technology (DArT) for high-throughput profiling of the hexaploid wheat genome. Theor Appl Genet. 2006, 113: 1409-1420. 10.1007/s00122-006-0365-4. 10.1007/s00122-006-0365-4View ArticlePubMed
- Cho RJ, Mindrinos M, Richards DR, Sapolsky RJ, Anderson M, Drenkard E, Dewdney J, Reuber TL, Stammers M, Federspiel N, Theologis A, Yang WH, Hubbell E, Au M, Chung EY, Lashkari D, Lemieux B, Dean C, Lipshutz RJ, Ausubel FM, Davis RW, Oefner PJ: Genome-wide mapping with biallelic markers in Arabidopsis thaliana. Nat Genet. 1999, 23: 203-207. 10.1038/13833. 10.1038/13833View ArticlePubMed
- Sargent DJ, Clarke J, Simpson D, Tobutt K, Arus P, Monfort A, Vilanova S, Denoyes-Rothan B, Rousseau M, Folta K: An enhanced microsatellite map of diploid Fragaria. Theor Appl Genet. 2006, 112: 1349-1359. 10.1007/s00122-006-0237-y.View ArticlePubMed
- Sargent DJ, Fernández-Fernández F, Rys A, Knight VH, Simpson DW, Tobutt KR: Mapping of A1 conferring resistance to the aphid Amphorophora idaei and dw (dwarfing habit) in red raspberry (Rubus idaeus L.) using AFLP and microsatellite markers. BMC Plant Biol. 2007, 7: 15-10.1186/1471-2229-7-15. 10.1186/1471-2229-7-15PubMed CentralView ArticlePubMed
- Tanksley S, Ganal M, Prince J, De-Vicente M, Bonierbale M, Broun P, Fulton T, Giovannoni J, Grandillo S, Martin G: High density molecular linkage maps of the tomato and potato genomes. Genetics. 1992, 132: 1141-1160.PubMed CentralPubMed
- Velasco R, Zharkikh A, Affourtit J, Dhingra A, Cestaro A, Kalyanaraman A, Fontana P, Bhatnagar SK, Troggio M, Pruss D, Salvi S, Pindo M, Baldi P, Castelletti S, Cavaiuolo M, Coppola G, Costa F, Cova V, Dal Ri A, Goremykin V, Komjanc M, Longhi S, Magnago P, Malacarne G, Malnoy M, Micheletti D, Moretto M, Perazzolli M, Si-Ammour A, Vezzulli S, et al: The genome of the domesticated apple (Malus × domestica Borkh.). Nat Genet. 2010, 42: 833-839. 10.1038/ng.654. 10.1038/ng.654View ArticlePubMed
- Vezzulli S, Troggio M, Coppola G, Jermakow A, Cartwright D, Zharkikh A, Stefanini M, Grando MS, Viola R, Adam-Blondon A-F, Thomas M, This P, Velasco R: A reference integrated map for cultivated grapevine (Vitis vinifera L.) from three crosses, based on 283 SSR and 501 SNP-based markers. Theor Appl Genet. 2008, 117: 499-511. 10.1007/s00122-008-0794-3. 10.1007/s00122-008-0794-3View ArticlePubMed
- Fernández-Fernández F, Antanaviciute L, Govan C, Sargent D: Development of a multiplexed microsatellite set for fingerprinting red raspberry (Rubus idaeus) germplasm and its transferability to other Rubus species. J Berry Res. 2011, 1: 177-187. 10.3233/BR-2011-019
- Illa E, Lambert P, Quilot B, Audergon J, Dirlewanger E, Howad W, Dondini L, Tartarini S, Lain O, Testolin R: Linkage map saturation, construction, and comparison in four populations of Prunus. J Horticultural Sci Biotechnol ISAFRUIT Spec Issue. 2009, 84: 168-175.
- Eshed Y, Zamir D: An introgression line population of Lycopersicon pennellii in the cultivated tomato enables the identification and fine mapping of yield-associated QTL. Genetics. 1995, 141: 1147-1162.PubMed CentralPubMed
- Sargent DJ, Passey T, Surbanovski N, Lopez Girona E, Kuchta P, Davik J, Harrison R, Passey A, Whitehouse AB, Simpson DW: A microsatellite linkage map for the cultivated strawberry (Fragaria × ananassa) suggests extensive regions of homozygosity in the genome that may have resulted from breeding and selection. Theor Appl Genet. 2012, 124: 1229-1240. 10.1007/s00122-011-1782-6. 10.1007/s00122-011-1782-6View ArticlePubMed
- Silfverberg-Dilworth E, Matasci C, de Weg W, Van Kaauwen MPW, Walser M, Kodde L, Soglio V, Gianfranceschi L, Durel C, Costa F: Microsatellite markers spanning the apple (Malus x domestica Borkh.) genome. Tree Genet Genomes. 2006, 2: 202-224. 10.1007/s11295-006-0045-1.View Article
- International Rice Genome Sequencing Project: The map-based sequence of the rice genome. Nature. 2005, 436: 793-800. 10.1038/nature03895. 10.1038/nature03895View Article
- Shulaev V, Sargent DJ, Crowhurst RN, Mockler T, Folkerts O, Delcher AL, Jaiswal P, Mockaitis K, Liston A, Mane S, Burns P, Davis TM, Slovin JP, Bassil NV, Hellens RP, Evans C, Harkins T, Kodira C, Desany B, Crasta OR, Jensen RV, Allan AC, Michael TP, Setubal JC, Celton J-M, Rees DJG, Williams KP, Holt SH, Ruiz Rojas JJ, Chatterjee M, et al: The genome of woodland strawberry (Fragaria vesca). Nat Genet. 2011, 43: 109-116. 10.1038/ng.740. 10.1038/ng.740PubMed CentralView ArticlePubMed
- Velasco R, Zharkikh A, Troggio M, Cartwright DA, Cestaro A, Pruss D, Pindo M, Fitzgerald LM, Vezzulli S, Reid J, Malacarne G, Iliev D, Coppola G, Wardell B, Micheletti D, Macalma T, Facci M, Mitchell JT, Perazzolli M, Eldredge G, Gatto P, Oyzerski R, Moretto M, Gutin N, Stefanini M, Chen Y-J, Segala C, Davenport C, Demattè L, Mraz A, et al: A High Quality Draft Consensus Sequence of the Genome of a Heterozygous Grapevine Variety. PLoS One. 2007, 2: e1326-10.1371/journal.pone.0001326. 10.1371/journal.pone.0001326PubMed CentralView ArticlePubMed
- Zhebentyayeva TN, Swire-Clark G, Georgi LL, Garay L, Jung S, Forrest S, Blenda AV, Blackmon B, Mook J, Horn R, Howad W, Arus P, Main DS, Tomkins JP, Sosinski B, Baird WV, Reighard GL, Abbott AG: A framework physical map for peach, a model Rosaceae species. Tree Genet Genomes. 2008, 4: 745-756. 10.1007/s11295-008-0147-z. 10.1007/s11295-008-0147-zView Article
- Velasco R, Zharkikh A, Affourtit J, Dhingra A, Cestaro A, Kalyanaraman A, Fontana P, Bhatnagar SK, Troggio M, Pruss D, Salvi S, Pindo M, Baldi P, Castelletti S, Cavaiuolo M, Coppola G, Costa F, Cova V, Ri AD, Goremykin V, Komjanc M, Longhi S, Magnago P, Malacarne G, Malnoy M, Micheletti D, Moretto M, Perazzolli M, Si-Ammour A, Vezzulli S, et al: The genome of the domesticated apple (Malus × domestica Borkh.). Nat Genet. 2010, 42: 833-839. 10.1038/ng.654. 10.1038/ng.654View ArticlePubMed
- Argout X, Salse J, Aury J-M, Guiltinan MJ, Droc G, Gouzy J, Allegre M, Chaparro C, Legavre T, Maximova SN, Abrouk M, Murat F, Fouet O, Poulain J, Ruiz M, Roguet Y, Rodier-Goud M, Barbosa-Neto JF, Sabot F, Kudrna D, Ammiraju JSS, Schuster SC, Carlson JE, Sallet E, Schiex T, Dievart A, Kramer M, Gelley L, Shi Z, Bérard A, et al: The genome of Theobroma cacao. Nat Genet. 2010, 43: 101-108. 10.1038/ng.736View ArticlePubMed
- Huang S, Li R, Zhang Z, Li L, Gu X, Fan W, Lucas WJ, Wang X, Xie B, Ni P, Ren Y, Zhu H, Li J, Lin K, Jin W, Fei Z, Li G, Staub J, Kilian A, van der Vossen E, Wu Y, Guo J, He J, Jia Z, Ren Y, Tian G, Lu Y, Ruan J, Qian W, Wang M, et al: The genome of the cucumber, Cucumis sativus L. Nat Genet. 2009, 41: 1275-1281. 10.1038/ng.475. 10.1038/ng.475View ArticlePubMed
- Varshney RK, Chen W, Li Y, Bharti AK, Saxena RK, Schlueter JA, Donoghue MTA, Azam S, Fan G, Whaley AM, Farmer AD, Sheridan J, Iwata A, Tuteja R, Penmetsa RV, Wu W, Upadhyaya HD, Yang S-P, Shah T, Saxena KB, Michael T, McCombie WR, Yang B, Zhang G, Yang H, Wang J, Spillane C, Cook DR, May GD, Xu X, et al: Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers. Nat Biotechnol. 2012, 30: 83-89. 10.1038/nbt.2022View Article
- Celton J-M, Christoffels A, Sargent DJ, Xu X, Rees DJG: Genome-wide SNP identification by high-throughput sequencing and selective mapping allows sequence assembly positioning using a framework genetic linkage map. BMC Biol. 2010, 8: 155-10.1186/1741-7007-8-155. 10.1186/1741-7007-8-155PubMed CentralView ArticlePubMed
- Chagné D, Crowhurst RN, Troggio M, Davey MW, Gilmore B, Lawley C, Vanderzande S, Hellens RP, Kumar S, Cestaro A, Velasco R, Main DS, Rees JD, Iezzoni A, Mockler T, Wilhelm L, Van de Weg E, Gardiner SE, Bassil NV, Peace C: Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple. PLoS One. 2012, 7: e31745-10.1371/journal.pone.0031745. 10.1371/journal.pone.0031745.t004PubMed CentralView ArticlePubMed
- Myles S, Chia J-M, Hurwitz B, Simon C, Zhong GY, Buckler E, Ware D: Rapid genomic characterization of the genus vitis. PLoS One. 2010, 5: e8219-10.1371/journal.pone.0008219. 10.1371/journal.pone.0008219PubMed CentralView ArticlePubMed
- Chagné D, Gasic K, Crowhurst RN, Han Y, Bassett HC, Bowatte DR, Lawrence TJ, Rikkerink EHA, Gardiner SE, Korban SS: Development of a set of SNP markers present in expressed genes of the apple. Genomics. 2008, 92: 353-358. 10.1016/j.ygeno.2008.07.008.View ArticlePubMed
- Konieczny A, Ausubel FM: A procedure for mapping Arabidopsis mutations using co-dominant ecotype-specific PCR-based markers. Plant J. 1993, 4: 403-410. 10.1046/j.1365-313X.1993.04020403.x.View ArticlePubMed
- Antanaviciute L, Fernández-Fernández F, Jansen J, Banchi E, Evans KM, Viola R, Velasco R, Dunwell JM, Troggio M, Sargent DJ: Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array. BMC Genomics. 2012, 13: 203-10.1186/1471-2164-13-203. 10.1186/1471-2164-13-203PubMed CentralView ArticlePubMed
- Huo N, Garvin DF, You FM, McMahon S, Luo M-C, Gu YQ, Lazo GR, Vogel JP: Comparison of a high-density genetic linkage map to genome features in the model grass Brachypodium distachyon. Theor Appl Genet. 2011, 123: 455-464. 10.1007/s00122-011-1598-4. 10.1007/s00122-011-1598-4View ArticlePubMed
- Pindo M, Vezzulli S, Coppola G, Cartwright DA, Zharkikh A, Velasco R, Troggio M: SNP high-throughput screening in grapevine using the SNPlex™ genotyping system. BMC Plant Biol. 2008, 8: 12-10.1186/1471-2229-8-12. 10.1186/1471-2229-8-12PubMed CentralView ArticlePubMed
- Verde I, Bassil NV, Scalabrin S, Gilmore B, Lawley CT, Gasic K, Micheletti D, Rosyara UR, Cattonaro F, Vendramin E, Main DS, Aramini V, Blas AL, Mockler T, Bryant DW, Wilhelm L, Troggio M, Sosinski B, Aranzana MJ, Arus P, Iezzoni A, Morgante M, Peace C: Development and Evaluation of a 9K SNP Array for Peach by Internationally Coordinated SNP Detection and Validation in Breeding Germplasm. PLoS One. 2012, 7: e35668-10.1371/journal.pone.0035668. 10.1371/journal.pone.0035668.t003PubMed CentralView ArticlePubMed
- Micheletti D, Troggio M, Zharkikh A, Costa F, Malnoy M, Velasco R, Salvi S: Genetic diversity of the genus Malus and implications for linkage mapping with SNPs. Tree Genetics & Genomes. 2011, 7: 857-868. 10.1007/s11295-011-0380-8. 10.1007/s11295-011-0380-8View Article
- Miller MR, Dunham JP, Amores A, Cresko WA, Johnson EA: Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers. Genome Res. 2007, 17: 240-248. 10.1101/gr.5681207. 10.1101/gr.5681207PubMed CentralView ArticlePubMed
- Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, Selker EU, Cresko WA, Johnson EA: Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers. PLoS One. 2008, 3: e3376-10.1371/journal.pone.0003376. 10.1371/journal.pone.0003376.g003PubMed CentralView ArticlePubMed
- Barchi L, Lanteri S, Portis E, Acquadro A, Valè G, Toppino L, Rotino GL: Identification of SNP and SSR markers in eggplant using RAD tag sequencing. BMC Genomics. 2011, 12: 304-10.1186/1471-2164-12-304. 10.1186/1471-2164-12-304PubMed CentralView ArticlePubMed
- Chutimanitsakun Y, Nipper RW, Cuesta-Marcos A, Cistué L, Corey A, Filichkina T, Johnson EA, Hayes PM: Construction and application for QTL analysis of a Restriction Site Associated DNA (RAD) linkage map in barley. BMC Genomics. 2011, 12: 4-10.1186/1471-2164-12-4. 10.1186/1471-2164-12-4PubMed CentralView ArticlePubMed
- Scaglione D, Acquadro A, Portis E, Tirone M, Knapp SJ, Lanteri S: RAD tag sequencing as a source of SNP markers in Cynara cardunculus L. BMC Genomics. 2012, 13: 3-10.1186/1471-2164-13-3. 10.1186/1471-2164-13-3PubMed CentralView ArticlePubMed
- Pfender WF, Saha MC, Johnson EA, Slabaugh MB: Mapping with RAD (restriction-site associated DNA) markers to rapidly identify QTL for stem rust resistance in Lolium perenne. Theor Appl Genet. 2011, 122: 1467-1480. 10.1007/s00122-011-1546-3. 10.1007/s00122-011-1546-3View ArticlePubMed
- Elshire RJ, Glaubitz JC, Sun Q, Poland J, Kawamoto K, Buckler ES, Mitchell SE: A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species. PLoS One. 2011, 6: e19379-10.1371/journal.pone.0019379. 10.1371/journal.pone.0019379.g006PubMed CentralView ArticlePubMed
- Heslot N, Yang H-P, Sorrells ME, Jannink J-L: Genomic Selection in Plant Breeding: A Comparison of Models. Crop Sci. 2012, 52: 146-10.2135/cropsci2011.06.0297View Article
- Castillo NRF, Reed BM, Graham J, Fernández-Fernández F, Bassil NV: Microsatellite markers for raspberry and blackberry. J Am Soc Hortic Sci. 2010, 135: 271-278.
- Graham J, Smith K, Woodhead M, Russell J: Development and use of simple sequence repeat SSR markers in Rubus species. Mol Ecol Notes. 2002, 2: 250-252. 10.1046/j.1471-8286.2002.00203.x.View Article
- Graham J, Smith K, Mackenzie K, Jorgenson L, Hackett C, Powell W: The construction of a genetic linkage map of red raspberry (Rubus idaeus subsp. idaeus) based on AFLPs, genomic-SSR and EST-SSR markers. Theor Appl Genet. 2004, 109: 740-749. 10.1007/s00122-004-1687-8. 10.1007/s00122-004-1687-8View ArticlePubMed
- Graham J, Smith K, Tierney I, Mackenzie K, Hackett C: Mapping gene H controlling cane pubescence in raspberry and its association with resistance to cane botrytis and spur blight, rust and cane spot. Theor Appl Genet. 2006, 112: 818-831. 10.1007/s00122-005-0184-z. 10.1007/s00122-005-0184-zView ArticlePubMed
- Woodhead M, McCallum S, Smith K, Cardle L, Mazzitelli L, Graham J: Identification, characterisation and mapping of simple sequence repeat (SSR) markers from raspberry root and bud ESTs. Mol Breed. 2008, 22: 555-563. 10.1007/s11032-008-9198-y. 10.1007/s11032-008-9198-yView Article
- Bushakra JM, Stephens MJ, Atmadjaja AN, Lewers K, Symonds VV, Udall JA, Chagné D, Buck EJ, Gardiner SE: Construction of black (Rubus occidentalis) and red (R. idaeus) raspberry linkage maps and their comparison to the genomes of strawberry, apple, and peach. Theor Appl Genet. 2012, 125: 311-327. 10.1007/s00122-012-1835-5. 10.1007/s00122-012-1835-5View ArticlePubMed
- Graham J, Hackett C, Smith K, Woodhead M, Mackenzie K, Tierney I, Cooke D, Bayer M, Jennings N: Towards an understanding of the nature of resistance to Phytophthora root rot in red raspberry. Theor Appl Genet. 2011, 123: 585-601. 10.1007/s00122-011-1609-5. 10.1007/s00122-011-1609-5View ArticlePubMed
- Catchen JM, Amores A, Hohenlohe P, Cresko W, Postlethwait JH: Stacks: building and genotyping Loci de novo from short-read sequences. G3 (Bethesda). 2011, 1: 171-182. 10.1534/g3.111.000240View Article
- Pattison JA, Samuelian SK, Weber C: Inheritance of Phytophthora root rot resistance in red raspberry determined by generation means and molecular linkage analysis. Theor Appl Genet. 2007, 115: 225-236. 10.1007/s00122-007-0558-5. 10.1007/s00122-007-0558-5View ArticlePubMed
- Spiller M, Linde M, Hibrand-Saint Oyant L, Tsai C-J, Byrne DH, Smulders MJM, Foucher F, Debener T: Towards a unified genetic map for diploid roses. Theor Appl Genet. 2010, 122: 489-500. 10.1007/s00122-010-1463-xView ArticlePubMed
- Dale A, Moore PP, McNicol RJ, Sjulin TM, Burmistrov LA: Genetic Diversity of Red Raspberry Varieties throughout the-World. J Am Soc Hortic Sci. 1993, 118: 119-129.
- Ward J, Price J, Clement MJ, Schatz MC, Weber C, Swanson J, Bodily P, Lewers K, Fernández-Fernández F, Burns P, Velasco R, Sargent DJ, Udall J: Plant and Animal Genome XX. A Draft Assembly AndAnalysis Of The Highly Heterozygous Diploid Red Raspberry Genome. https://pag.confex.com/pag/xx/webprogram/Paper1359.html
- Jennings D: Balanced lethals and polymorphism in Rubus idaeus. Heredity. 1967, 22: 465-479.View Article
- Jennings D: Aberrant segregation of a gene in the raspberry and its association with effects on seed development. Heredity. 1972, 29: 83-90. 10.1038/hdy.1972.66. 10.1038/hdy.1972.66View Article
- Keep E: Incompatibility in Rubus with Special Reference to R. Idaeus L. Can J Genet Cytol. 1968, 10: 253-262.View Article
- CBSUed: GBS barcode splitter. 2012, http://sourceforge.net/projects/gbsbarcode/
- Langmead B, Trapnell C, Pop M, Salzberg SL: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009, 10: R25-10.1186/gb-2009-10-3-r25. 10.1186/gb-2009-10-3-r25PubMed CentralView ArticlePubMed
- Voorrips RE: MapChart: software for the graphical presentation of linkage maps and QTLs. J Hered. 2002, 93: 77-78. 10.1093/jhered/93.1.77.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.