Analysis of human meiotic recombination events with a parent-sibling tracing approach
BMC Genomics volume 12, Article number: 434 (2011)
Meiotic recombination ensures that each child inherits distinct genetic materials from each parent, but the distribution of crossovers along meiotic chromosomes remains difficult to identify. In this study, we developed a parent-sibling tracing (PST) approach from previously reported methods to identify meiotic crossover sites of GEO GSE6754 data set. This approach requires only the single nucleotide polymorphism (SNP) data of the pedigrees of both parents and at least two of children.
Compared to other SNP-based algorithms (identity by descent or pediSNP), fewer uninformative SNPs were derived with the use of PST. Analysis of a GEO GSE6754 data set containing 2,145 maternal and paternal meiotic events revealed that the pattern and distribution of paternal and maternal recombination sites vary along the chromosomes. Lower crossover rates near the centromeres were more prominent in males than in females. Based on analysis of repetitive sequences, we also showed that recombination hotspots are positively correlated with SINE/MIR repetitive elements and negatively correlated with LINE/L1 elements. The number of meiotic recombination events was positively correlated with the number of shorter tandem repeat sequences.
The advantages of the PST approach include the ability to use only two-generation pedigrees with two siblings and the ability to perform gender-specific analyses of repetitive elements and tandem repeat sequences while including fewer uninformative SNP regions in the results.
Meiotic recombination is important for generating genetic diversity. Meiotic recombination occurs between homologous chromosomes during chiasmata formation, a process that is required for normal chromosomal segregation during meiosis. While variation in recombination rates is a ubiquitous feature of the human genome , the mechanisms governing the distribution of crossovers along meiotic chromosomes remain largely unclear, with the exception of the recent discovery that Prdm9 is involved in the activation of mammalian recombination hotspots [2–5]. Sex-specific effects [6–8] on regional meiotic recombination have been described. Recombination rates are approximately 1.7-fold higher in female meiosis than in male meiosis. In addition, crossover rates in males are 5-fold lower near centromeres but 10-fold higher near telomeres compared with those in females . These differences could be related to sex-specific patterns of initiation of synapses between homologs. For example, synaptonemal complex lengths are shorter in males than in females , and synapses appear preferentially in subtelomeric regions in males .
Meiotic recombination events can be measured directly or indirectly . Physical crossovers between homologous chromosomes, indicating meiotic recombination events, can be directly observed at specific time points during spermatogenesis . Alternatively, crossovers may be analyzed directly in cytogenetic analysis by labeling meiosis-related proteins, such as MLH1 . Despite the unequivocal value of direct analysis, these techniques are labor-intensive and precision is limited. Therefore, most analyses of human recombination currently rely on indirect approaches such as genetic linkage analysis of human pedigrees. This involves tracking the inheritance of alleles at multiple polymorphic markers (short tandem repeat polymorphisms, STRP; or single nucleotide polymorphisms, SNP) along the chromosomes across generations [15–17].
Molecular markers in individuals with known pedigrees can be traced to an ancestral identity using either the identity by descent (IBD) method  or the identity by state (IBS) method . Two alleles at a particular locus in the progeny are assumed to be identical if they are derived from an identical locus in a common ancestor. The IBD method requires knowledge of the genotypes of three generations to determine if the DNA segments are identical by descent from each generation. In the IBD method, shared results between each child and his/her paternal and maternal grandparents are analyzed separately. A paternal recombination event is detected when the IBD sharing "switches" from one paternal grandparent to the other. This application can be applied in the same manner for the maternal side. For instance, meiotic events can be switched between 2 SNP sites (Figure 1A and Additional File 1A). Therefore, application of the IBD method requires the pedigrees of three generations . The IBS method was used to detect meiotic recombination sites between individuals by analyzing allele sharing between siblings . Recently, Ting et al. also proposed another method for identifying meiotic recombination patterns based on two-generation pedigrees (pediSNP) . In the pediSNP method, genotypes of two children are analyzed and compared with the genotype of one parent .
Based on the distribution of SNPs in both parents and multiple siblings, meiotic cross sites in human chromosomes can be identified. This method was first proposed by Coop et al. in 2008 to trace the "informative markers" transmitted by the father to each offspring . They defined the "informative markers" as SNPs that are heterozygous in the father and homozygous in the mother. In 2009, Chowdhury et al. used two datasets, namely, the Autism Genetic Research Exchange (AGRE) and the Framingham Heart Study (FHS), to characterize the variation in recombination phenotypes . They analyzed sex differences and recombination jungles across the human genome, and described the gene loci associated with recombination phenotypes .
In this study, we have used a parent-sibling tracing (PST) approach, which was derived from two previous reports [6, 20], to analyze the Genomic Medicine Research Core Laboratory, Taiwan (GMRCL) dataset of Affymetrix SNP6.0 arrays which consists of 900 K SNP markers and the GSE6754 dataset from Gene Expression Omnibus (GEO) , which consists of 853 families. Our analyses of this dataset of 2,145 meioses resulted in a 1-Mb-resolution recombination map. In addition, we were able to characterize the relationships between recombination sites and repetitive elements as well as the relationships between recombination sites and tandem repeats sequences.
Comparison of two methods of detecting meiotic recombination sites
We used the GMRCL dataset of 900 K SNPs as a reference standard for comparison between the PST approach (Figure 1B) and previous approaches such as the IBD method  (Figure 1A). The code calling schema of PST is depicted in Figure 1B and Additional File 1B. Using chromosome 1 as an example, IBD analysis in both children could define the sites of meiotic recombination for paternal gametes. In child 1 and child 2, we observed 1 and 4 meiotic recombination events on their paternal gametes, respectively (Figures 2A and 2B). Using the PST approach, we could analyze the paternal genotypes for both children. When the paternal genotype was Aa and the maternal genotype was AA, children with Aa and AA were coded as "0: not identical between siblings". If both children were Aa and Aa [or (AA and AA)], they were coded as "1: identical between siblings" (identical genotype origin for both children). The PST approach (Figure 2C) detected the recombination sites of the combinatorial results for child 1 and child 2 as determined by IBD (Figures 2A and 2B). These results indicate that, using the SNP information of only two generations, PST can identify the origin of the recombination site. For the IBD method, information from three generations is required to determine whether the origin is from the grandfather or the grandmother. The 43 recombination sites identified in the GMRCL dataset using the IBD and PST methods are shown in Additional file 2.
Comparison of the code calling schemas between the IBD and PST methods showed that IBD identified fewer genotyping combination calls than the PST approach. For instance, when we analyzed the recombination sites in the 100-kb genomic region located at 114.6 Mb on chromosome 1 (Figures 2B and 2C, indicated with the arrow), the numbers of uninformative SNPs in the recombination site for the IBD and PST methods were 22 and 19, respectively (Figures 2D and 2E), resulting in uninformative regions of 54 kb for the IBD method (Figure 2D) and 48 kb for the PST approach (Figure 2E), respectively.
The use of the IBD and PST methods in the GMRCL sample led to the identification of 43 paternal recombination sites in child 1 and child 2. The mean numbers of uninformative SNP for the 43 paternal recombination sites were 71.2 and 36.7 for the IBD and PST methods, respectively (Table 1). The mean sizes of the uninformative regions for the 43 paternal recombination sites were 253 ± 349 kb (mean ± SD) with 110 (58 - 336) in Q2 (Q1-Q3) for the IBD method, and 167 ± 391 kb with 60 (23 - 157) in Q2 (Q1-Q3) for the PST approach (Table 1). The paired t-test showed that the PST approach resulted in significantly shorter uninformative regions than the IBD method (P < 10-10).
Analysis of the GEO dataset GSE6754 containing 11,000 SNP markers
The Affymetrix Human Mapping 10 K 2.0 Arrays (containing 10 K SNPs) were used to map autism susceptibility loci in the GSE6754 dataset . Three three-generation pedigrees (family ID: 3117, 3180, 8071) were selected to compare the usefulness of the IBD and PST methods. Since the 10 K 2.0 array covered fewer SNPs, the mean size of uninformative regions were about 20-fold higher and the number of uninformative SNPs was approximately 6-fold lower than those of SNP 6.0 Arrays. Compared to other approaches, the PST approach identified fewer uninformative SNPs and smaller uninformative genomic regions (Table 1).
In the 3864 arrays (853 families, 1721 parents, 2145 siblings) analyzed using the PST approach, the mean number of maternal recombination events was approximately 1.67-fold higher than that of paternal origin, with the highest value observed on chromosome 17 (2.00-fold) and the lowest on chromosome 22 (1.32-fold) (Table 2). The distribution of recombination events of paternal origin (mean 23.8 ± 4.1, median 22.5) and maternal origin (mean 39.5 ± 5.7, median 38.0) is presented in Figure 3A. The numbers of recombination events of each chromosome (2,145 maternal and paternal meioses) are summarized in Table 2.
In order to identify the regions with the highest and the lowest number of recombination events, we scanned the entire human genome. We first divided the genome into 2,765 bins of 1-Mb each. We then identified the number of recombination sites in each bin separately for female and male meioses. The results obtained from chromosome 1 are shown in Figure 3B (see the Additional file 3 for the results on other chromosomes). We also compared the recombination maps obtained from dataset GSE6754 with Marshfield map  (Figure 3B, middle panel), and Icelandic map  (Figure 3B, lower panel). The correlation coefficients between the data in GSE6754 map and Icelandic map and Marshfield map were r = 0.49 and r = 0.31, respectively.
To test the hypothesis that recombination rates are lower near the centromere but higher near the telomeres in men, we analyzed the correlation between the distances from the recombination sites to the centromere and the number of recombination sites. We found significant correlations (P < 0.00001) on chromosomes 1q, 2p, 3q, 4q, 5p, 5q, 6p, 6q, 7q, 8q, 9p, 9q, 10p, 10q, 11q, 12p, 12q, 16q, 18q, 19q, 20q, 21q in men. In contrast, similar correlations were found only on chromosome 1q and 6q in women (Table 3). For instance, the slope of correlation was significant in p arm of chromosome 5 in men but not in women (Figure 3C). On the other hand, both sexes showed significant correlations in the number of recombination sites near the telomere in the q arm. SNP information was not available for the p arm of chromosomes 13, 14, 15, 21, and 22.
Relation between the recombination site and repetitive elements
We compiled 57 major repetitive element classes that were characterized by RepeatMasker . Twenty-three repetitive-element classes were identified in more than 6,000 sites in the human genome. After downloading the location information of the human CpG islands from the UCSC database , we divided the genome into 2,765 bins of 1-Mb each and determined the number of repetitive-element sites in each bin. Using the 53,487 repetitive-elements on chromosome 1 as an example, we depicted the distribution of SINE/MIR (green lines in Figure 4A) and LINE/L1 sites (green lines in Figure 4C). In addition, the distributions of meiotic recombination sites (both paternal and maternal combined) are shown as blue lines. In each 1-Mb bin, we also analyzed the correlation between the number of meiotic recombination sites and the number of SINE/MIR (plotted in Figure 4B) and LINE/L1 sites (plotted in Figure 4D). The correlation coefficients between recombination sites and SINE/MIR and the correlation coefficients between recombination sites and LINE/L1 were 0.23 (P = 0.0005) and 0.29 (P = 0.00001), respectively.
The correlation coefficients and the corresponding P values for each of the 23 repetitive-elements, CpG island sites, and meiotic recombination sites are summarized in Table 4. The repetitive elements SINE/MIR, DNA/hAT-Charlie, DNA/hAT, LINE/L2, SINE/Alu, DNA/hAT-Tip100, DNA/hAT-Blackjack were positively correlated with meiotic recombination sites. In contrast, repetitive elements, which included LINE/L1, LTR/ERVK, and Low complexity (Table 4), showed negative correlation with meiotic recombination sites. In general, we found no significant differences in the distribution of maternal and paternal recombination sites. The scatter plots of the correlation analyses of repetitive elements SINE/MIR and LINE/L1 in the entire human genome are shown in Figure 5.
Relation between recombination sites and the length of tandem repeat sequences
Repetitive elements, including tandem repeat sequences, are distributed widely throughout the genome. Tandem DNA repeats are defined as a repeated pattern of two or more nucleotides. The pattern can range in length from 2 to ~100 base pairs (bp) (for example (CATG)n in a genomic region) . In this study, a total 947,696 tandem repeats sequences were identified using the Tandem Repeats Finder . The length distribution of the tandem repeats are shown in Figure 6A, where the 25, 50 and 75 percentile of the length of the tandem repeats were 4, 15 and 24 bp, respectively.
We divided the genome into 2,765 bins of 1-Mb each and determined the number of tandem repeats in each bin. We then analyzed the correlation between the number of maternal meiotic recombination sites and the number of tandem repeats (Figure 6B); the correlation coefficient was 0.11 (P < 2 × 10-7). Furthermore, we grouped tandem repeats into 4 quartiles by the length of these repeat sequences, as (Q1) 1-4, (Q2) 5-15, (Q3) 16-24 and (Q4) > 25 bp. The correlation coefficients between recombination sites and the 4 quartiles were 0.25 (P < 1 × 10-16), 0.11 (P < 2 × 10-8), 0.04 (P = 0.08) and 0.03 (P = 0.16), respectively (Figures 6C-F). These results showed that the maternal meiotic recombination sites were positively correlated with shorter repeat sequences and less correlated with longer repeat sequences. Similarly, we analyzed the correlation between the number of paternal meiotic recombination sites and the number of tandem repeats, with r = 0.12 (P < 5 × 10-9). The correlation coefficients for the 4 subgroups were 0.19 (P < 1 × 10-16), 0.09 (P < 4 × 10-6), 0.09 (P < 3 × 10-6) and 0.05 (P = 0.004), respectively (Additional file 4).
In this study, we use a PST approach to analyze the sites of meiotic recombination in two-generation pedigrees. We first tested it on a GMRCL dataset of the Affymetrix SNP 6.0 array consisting of 900 K SNP markers, followed by a 10 K GSE6754 dataset. In the GSE6754 dataset, which was previously used for mapping autism risk loci, most data are based on two-generation pedigrees (1,168 families) as this dataset contains only 29 three-generation pedigrees. Although the PST approach requires only pedigrees of two generations, it requires information from at least two siblings. The use of SNPs as genetic markers to identify recombination sites can often result in the inclusion of uninformative regions. However, the size of uninformative regions that result from the PST approach is significantly lower than that seen from the use of the IBD method (Table 1).
We next assessed whether crossovers may alter the DNA sequence by causing de novo mutations at sites of recombination. Given that the uninformative regions of PST were relatively small, eight recombination events were identified with sizes of less than 2 kb. Notably, we did not identify any sequence variation at these recombination points (data not shown). This observation needs further validation by sequencing more datasets.
The average number of recombination events observed with the PST approach was similar to the findings of other studies. The distribution of recombination events showed a mean value of 23.8 in paternal origin and 39.5 in maternal origin. Chowdhury et al reported the genome-wide recombination events in paternal origin ranged from 25.9 to 27.3 while in maternal origin ranged from 38.4 to 47.2 . Another study by Cheung et al demonstrated that the mean numbers of recombination events were 24.0 in male meiosis and 38.4 in female meiosis .
In an indirect pedigree analysis using SNPs as genetic markers, Cheung et al  reported that several recombination events appeared to occur nearer to the telomeres. Using the PST approach, we analyzed the distance between the recombination site and the centromere for each gender separately (Table 3). In male meiosis, most of the crossovers are located in the q arms, and the number of recombination events increased significantly when moving from centromeres to telomeres. Interestingly, we observed fewer recombination events in the p arms of female chromosomes, resulting in the male-to-female ratio of 1.67 (Table 2). In women, only chromosomes 1q and 6q showed a significant, positive correlation between the number of recombination sites and distance from the centromere (Table 3).
To determine the extensive sequence-context variation in recombination hotspots, Myers et al. constructed a fine-scale map of recombination rates and hotspots across the human genome based on genotypes of 1.6 million SNPs in three sample populations, including 24 European Americans, 23 African Americans, and 24 Han Chinese . The authors reported an increase of recombination hotspots in the regions surrounding coding genes, though these were preferentially located outside the transcribed regions. The analysis of the relationships between recombination hotspots and repeat elements indicated that L2 and THE1B are unusually high in hotspots, whereas L1 elements are low . In this study, we identified a similar pattern of frequent hotspots in L2 as opposed to the low number of hotspots in L1 elements (Table 4). Of note, results showed that the majority of the hotspots in both paternal and maternal meioses were similar.
Human chromosomes are characterized by prominent differences in the pattern and rate of meiotic recombination events. Significant inter-individual and gender differences also exist. The major advantages of the PST approach include the use of two-generation pedigrees with two or more siblings, fewer uninformative SNP regions, and the ability to perform gender-specific analyses of recombination hotspots (using databases derived from high density arrays such as Affymetrix SNP6.0) and repetitive elements. An accurate determination of meiotic crossovers using this approach may prove useful to explore the biology of human chromosomes.
Identification of meiotic recombination sites
In the present study we compared different SNP-based methods for detecting recombination points, i.e. IBD (Figure 1A) , and PST (Figure 1B). The code calling schema for the IBD and PST methods are depicted in the Additional Files 1A and 1B. The meiosis recombination sites were exported from the PSTReader, a MATLAB-based program (version 7.9). The PSTReader was used to define the recombination sites for the IBD and PST methods. The MATLAB source code, example data, and a standalone application can be freely downloaded from: http://www.mcu.edu.tw/department/biotec/en_page/PSTReader/index.htm.
In this study, a set of the Affymetrix Genome-Wide Human SNP array 6.0 (GMRCL dataset) consisting of 900 K SNP markers was used as a template. DNA was extracted from blood collected in a study that was approved by the Chang Gung Memorial Hospital Institute Review Board (IRB#99-0229B). SNP genotyping was performed using the SNP array 6.0 (Affymetrix, Santa Clara, CA, http://www.affymetrix.com) at the Genomic Medicine Research Core Laboratory (GMRCL), Chang Gung Memorial Hospital. The GMRCL dataset includes the genotypes of an anonymous family consisting of the paternal/maternal grandfather, paternal/maternal grandmother, father, mother and two children. The identity-delinked SNP genotypes and pedigree information for each member can be downloaded from http://www.mcu.edu.tw/department/biotec/en_page/PSTReader/index.htm.
The GSE6754 dataset was downloaded from the Gene Expression Omnibus (GEO), and contains information from 6,971 Affymetrix GeneChip Human Mapping 10 K 2.0 Arrays. Data from parental and sibling genotypes are available for 1,168 families in this dataset. To increase analytic accuracy, we excluded samples with genotyping call rates less than 90%, those lacking pedigree information, and individuals with chromosomal abnormalities (n = 22) . The remaining 3,864 arrays of 853 families (1,721 parents and 2,145 siblings) were included in the PST analysis of recombination events in human meiosis. The details on individual, families, and pedigrees are provided in Additional file 5.
Mapping of the recombination sites, repetitive elements and tandem repeat sequences
The recombination sites and repetitive elements were mapped using the hg18 (NCBI Build 36) human reference assembly. The classes and characters of major repetitive elements were downloaded from RepeatMasker , and the tandem repeat sequences were identified using the Tandem Repeats Finder program . Correlations between recombination sites and repetitive elements or tandem repeat sequences were analyzed with MATLAB (version 7.9). To assess the distribution and correlation between recombination sites and repetitive elements or tandem repeat sequences, we calculated the number of recombination sites (or repetitive elements or tandem repeat sequences) using a window width set to 1 Mb. We divided the human genome into 2765 bins of 1 Mb each and determined the number of recombination sites in each bin. The distance for each 1 Mb window was calculated based on SNP positions according to the Affymetrix data, assuming a constant crossover rate between two adjacent SNP markers. To calculate the correlation coefficients between the recombination in GSE6754 map, Icelandic map and Marshfield map, we divided the human genome into 2765 bins of 1 Mb each and determined the number of recombination sites in each bin, as described above.
identity by descent
identity by state
simple tandem repeat polymorphisms
single nucleotide polymorphisms.
Alberts B, Johnson A, Lewis J, Raff M, Roberts K, Walter P: Molecular Biology of THE CELL. 2008, New York: Garland Science, 5
Martinez-Perez E, Colaiacovo MP: Distribution of meiotic recombination events: talking to your neighbors. Curr Opin Genet Dev. 2009, 19: 105-112. 10.1016/j.gde.2009.02.005.
Parvanov ED, Petkov PM, Paigen K: Prdm9 controls activation of mammalian recombination hotspots. Science. 2010, 327: 835-10.1126/science.1181495.
Baudat F, Buard J, Grey C, Fledel-Alon A, Ober C, Przeworski M, Coop G, de Massy B: PRDM9 is a major determinant of meiotic recombination hotspots in humans and mice. Science. 2010, 327: 836-840. 10.1126/science.1183439.
Myers S, Bowden R, Tumian A, Bontrop RE, Freeman C, MacFie TS, McVean G, Donnelly P: Drive against hotspot motifs in primates implicates the PRDM9 gene in meiotic recombination. Science. 2010, 327: 876-879. 10.1126/science.1182363.
Coop G, Wen X, Ober C, Pritchard JK, Przeworski M: High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans. Science. 2008, 319: 1395-1398. 10.1126/science.1151851.
Fledel-Alon A, Wilson DJ, Broman K, Wen X, Ober C, Coop G, Przeworski M: Broad-scale recombination patterns underlying proper disjunction in humans. PLoS Genet. 2009, 5: e1000658-10.1371/journal.pgen.1000658.
Kong A, Thorleifsson G, Gudbjartsson DF, Masson G, Sigurdsson A, Jonasdottir A, Walters GB, Gylfason A, Kristinsson KT, Gudjonsson SA, et al: Fine-scale recombination rate differences between sexes, populations and individuals. Nature. 2010, 467: 1099-1103. 10.1038/nature09525.
Buard J, de Massy B: Playing hide and seek with mammalian meiotic crossover hotspots. Trends Genet. 2007, 23: 301-309. 10.1016/j.tig.2007.03.014.
Tease C, Hulten MA: Inter-sex variation in synaptonemal complex lengths largely determine the different recombination rates in male and female germ cells. Cytogenet Genome Res. 2004, 107: 208-215. 10.1159/000080599.
Brown PW, Judis L, Chan ER, Schwartz S, Seftel A, Thomas A, Hassold TJ: Meiotic synapsis proceeds from a limited number of subtelomeric sites in the human male. Am J Hum Genet. 2005, 77: 556-566. 10.1086/468188.
Lynn A, Ashley T, Hassold T: Variation in human meiotic recombination. Annu Rev Genomics Hum Genet. 2004, 5: 317-349. 10.1146/annurev.genom.4.070802.110217.
Jeffreys AJ, Murray J, Neumann R: High-resolution mapping of crossovers in human sperm defines a minisatellite-associated recombination hotspot. Mol Cell. 1998, 2: 267-273. 10.1016/S1097-2765(00)80138-0.
Sun F, Trpkov K, Rademaker A, Ko E, Martin RH: Variation in meiotic recombination frequencies among human males. Hum Genet. 2005, 116: 172-178. 10.1007/s00439-004-1215-6.
Cheung VG, Burdick JT, Hirschmann D, Morley M: Polymorphic variation in human meiotic recombination. Am J Hum Genet. 2007, 80: 526-530. 10.1086/512131.
Kong A, Gudbjartsson DF, Sainz J, Jonsdottir GM, Gudjonsson SA, Richardsson B, Sigurdardottir S, Barnard J, Hallbeck B, Masson G, et al: A high-resolution recombination map of the human genome. Nat Genet. 2002, 31: 241-247.
Matise TC, Sachidanandam R, Clark AG, Kruglyak L, Wijsman E, Kakol J, Buyske S, Chui B, Cohen P, de Toma C, et al: A 3.9-centimorgan-resolution human single-nucleotide polymorphism linkage map and screening set. Am J Hum Genet. 2003, 73: 271-284. 10.1086/377137.
Roberson ED, Pevsner J: Visualization of shared genomic regions and meiotic recombination in high-density SNP data. PLoS One. 2009, 4: e6711-10.1371/journal.pone.0006711.
Ting JC, Roberson ED, Currier DG, Pevsner J: Locations and patterns of meiotic recombination in two-generation pedigrees. BMC Med Genet. 2009, 10: 93-10.1186/1471-2350-10-93.
Chowdhury R, Bois PR, Feingold E, Sherman SL, Cheung VG: Genetic analysis of variation in human meiotic recombination. PLoS Genet. 2009, 5: e1000648-10.1371/journal.pgen.1000648.
Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Edgar R: NCBI GEO: mining tens of millions of expression profiles--database and tools update. Nucleic Acids Res. 2007, 35: D760-765. 10.1093/nar/gkl887.
Szatmari P, Paterson AD, Zwaigenbaum L, Roberts W, Brian J, Liu XQ, Vincent JB, Skaug JL, Thompson AP, Senman L, et al: Mapping autism risk loci using genetic linkage and chromosomal rearrangements. Nat Genet. 2007, 39: 319-328. 10.1038/ng1985.
Broman KW, Murray JC, Sheffield VC, White RL, Weber JL: Comprehensive human genetic maps: individual and sex-specific variation in recombination. Am J Hum Genet. 1998, 63: 861-869. 10.1086/302011.
Jurka J, Smit AFA: "Reference collections of human and rodent repetitive elements". Co-editor of the mammalian databases. 1994, [http://www.girinst.org/]
Kuhn RM, Karolchik D, Zweig AS, Wang T, Smith KE, Rosenbloom KR, Rhead B, Raney BJ, Pohl A, Pheasant M, et al: The UCSC Genome Browser Database: update 2009. Nucleic Acids Res. 2009, 37: D755-761. 10.1093/nar/gkn875.
Benson G: Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999, 27: 573-580. 10.1093/nar/27.2.573.
Myers S, Bottolo L, Freeman C, McVean G, Donnelly P: A fine-scale map of recombination rates and hotspots across the human genome. Science. 2005, 310: 321-324. 10.1126/science.1117196.
Lee YS, Chao A, Chao AS, Chang SD, Chen CH, Wu WM, Wang TH, Wang HS: CGcgh: a tool for molecular karyotyping using DNA microarray-based comparative genomic hybridization (array-CGH). J Biomed Sci. 2008, 15: 687-696. 10.1007/s11373-008-9275-6.
This study was supported by grants: NSC 97-2320-B-130-001-MY2 (to YS Lee), NSC 98-3112-B-001-027 from the National Research Program for Genomic Medicine (to YS Lee and CH Chen); DOH99-TD-C-111-006 (to A Chao and TH Wang) and DOH99-TD-I-111-TM013 (to TH Wang) from the Department of Health, Taiwan; and CMRPG340463 (to TH Wang) from the Chang Gung Medical Foundation. The authors wish to thank Dr. Chi-Nue Tsai (Chang Gung University) and Dr. Shih-Tien T. Wang of Children's Hospital of Wisconsin, Milwaukee, for helpful discussion.
The authors declare that they have no competing interests.
YSL, AC, SMW and THW designed the study and prepared the manuscript. YSL, TC and CHC carried out the statistical analysis. YSL and THW carried out the Affymetrix microarray experiments, obtained the clinical materials and analyzed clinical information. All authors read and approved the final manuscript.
Yun-Shien Lee, Angel Chao contributed equally to this work.
Electronic supplementary material
Additional file 1:Calling schema. Tables with calling schema for analyzing meiosis, identity by descent (IBD) and parent-sibling tracing (PST). (DOC 59 KB)
Additional file 2:Paternal recombination site along the chromosomes. The paternal recombination site of child 1 and 2 of GMRCL dataset (CH1 and CH2, defined in Figure 1) along chromosomes are demonstrated in figures by the identity by descent (IBD) and parent-sibling tracing (PST) methods. (DOC 146 KB)
Additional file 3:Distribution of recombination events. Figures illustrating the distribution of the 2,145 paternal and 2,145 maternal recombination events in human for each chromosome. (DOC 343 KB)
Additional file 4:Correlation between tandem repeats sequences and paternal recombination sites. Distribution of the length of the tandem repeats sequences and scatter plot of the number of paternal recombination sites with the tandem repeats sequences. (DOC 236 KB)
Additional file 5:Detailed information of GSE6754 dataset. Family ID, individual ID and the pedigree relative of the analyzed 3864 samples which were downloaded from GEO, GSE 6754. (XLS 382 KB)
Authors’ original submitted files for images
About this article
Cite this article
Lee, YS., Chao, A., Chen, CH. et al. Analysis of human meiotic recombination events with a parent-sibling tracing approach. BMC Genomics 12, 434 (2011). https://doi.org/10.1186/1471-2164-12-434