- Research article
- Open Access
Identification of ovule transcripts from the Apospory-Specific Genomic Region (ASGR)-carrier chromosome
BMC Genomics volume 12, Article number: 206 (2011)
Apomixis, asexual seed production in plants, holds great potential for agriculture as a means to fix hybrid vigor. Apospory is a form of apomixis where the embryo develops from an unreduced egg that is derived from a somatic nucellar cell, the aposporous initial, via mitosis. Understanding the molecular mechanism regulating aposporous initial specification will be a critical step toward elucidation of apomixis and also provide insight into developmental regulation and downstream signaling that results in apomixis. To discover candidate transcripts for regulating aposporous initial specification in P. squamulatum, we compared two transcriptomes derived from microdissected ovules at the stage of aposporous initial formation between the apomictic donor parent, P. squamulatum (accession PS26), and an apomictic derived backcross 8 (BC8) line containing only the Apospory-Specific Genomic Region (ASGR)-carrier chromosome from P. squamulatum. Toward this end, two transcriptomes derived from ovules of an apomictic donor parent and its apomictic backcross derivative at the stage of apospory initiation, were sequenced using 454-FLX technology.
Using 454-FLX technology, we generated 332,567 reads with an average read length of 147 base pairs (bp) for the PS26 ovule transcriptome library and 363,637 reads with an average read length of 142 bp for the BC8 ovule transcriptome library. A total of 33,977 contigs from the PS26 ovule transcriptome library and 26,576 contigs from the BC8 ovule transcriptome library were assembled using the Multifunctional Inertial Reference Assembly program. Using stringent in silico parameters, 61 transcripts were predicted to map to the ASGR-carrier chromosome, of which 49 transcripts were verified as ASGR-carrier chromosome specific. One of the alien expressed genes could be assigned as tightly linked to the ASGR by screening of apomictic and sexual F1s. Only one transcript, which did not map to the ASGR, showed expression primarily in reproductive tissue.
Our results suggest that a strategy of comparative sequencing of transcriptomes between donor parent and backcross lines containing an alien chromosome of interest can be an efficient method of identifying transcripts derived from an alien chromosome in a chromosome addition line.
Apomixis, asexual reproduction through seed, is widespread among flowering plant families, but low in its frequency of occurrence . Different from sexual reproduction, apomictically derived embryos develop autonomously from unreduced ovular cells instead of through fertilization of a reduced egg by a sperm. Therefore, the progeny of an apomictic plant are genetically identical to the maternal plant [2, 3]. This trait can be used as an advanced breeding tool in agriculture since it would enable fixation of hybrid vigor and seed propagation of desirable genotypes [4–7]. No major agriculturally important crop possesses this trait [8–10]. Introgression of apomixis into crops through crossing has been impeded by factors such as polyploidy and incompatibility . Therefore, discovery of genetic mechanisms underlying apomixis will be crucial for manipulation of apomixis for introduction into target crops.
Apomixis has been classified into two types and three developmental pathways: gametophytic apomixis, including apospory and diplospory, and sporophytic apomixis, which is also known as adventitious embryony . In sporophytic apomixis, an embryo forms directly from an ovular cell and coexists with the zygotic embryo. For gametophytic apomixis, the embryo develops from an unreduced egg in an embryo sac derived through mitosis of either a somatic nucellar cell (apospory) or the megaspore mother cell (diplospory). In apospory, meiosis either does not complete or its products degenerate while aposporous initials (AIs) develop from one or more somatic nucellar cells. Both genotypes chosen for the present study are aposporous with the trait conferred by genetic elements from Pennisetum squamulatum. Aposporous P. squamulatum has four-nucleate embryo sacs that lack antipodals . Apospory in this species is inherited as a dominant Mendelian trait  and is associated with an approximately 50 Mb, heterochromatic and hemizygous chromosomal region designated the Apospory-Specific Genomic Region (ASGR), [12, 13].
Many transcriptional approaches to discover the regulatory mechanisms and downstream effects associated with apomixis in many species have been undertaken. In Brachiaria, differential display applied to apomictic and sexual ovaries at anthesis yielded two apomixis-specific fragments  while a study on earlier sporogenesis and gametogenesis stages identified eleven differentially expressed fragments . In Paspalum notatum, three expressed sequence tags (ESTs), all highly similar in sequence, showed differential expression in flowers between apomictic and sexual F1 individuals after apospory initiation . An additional 65 genes were identified as differentially expressed between sexual and aposporous plants . cDNA-AFLP analysis in Paspalum simplex yielded transcripts linked to the apomixis-controlling locus (ACL). Many of these linked fragments showed stop codons and frameshift mutations, suggesting that they are pseudogenes . cDNA-AFLP was also applied to identify apomixis candidate genes in Poa pratensis where 179 transcript-derived fragments from spikelets showed qualitative and quantitative expression differences between apomictic and sexual genotypes . The full-length sequences of two genes of interest, PpSERK (SOMATIC EMBRYOGENESIS RECEPTOR-LIKE KINASE) and APOSTART were obtained and their temporal and spatial expression patterns were assessed by reverse transcription polymerase chain reaction (RT-PCR) and in situ hybridization, respectively. While neither one of these two candidate genes showed apomixis- or sexual-specific expression, quantitative differences in expression between apomictic and sexual genotypes were observed .
One apomixis-specific gene was identified from a Panicum maximum ovule cDNA library and shown to be expressed in both aposporous initials and embryos at four days after anthesis [21, 22]. Additional genes have been identified in Panicum through microarray and quantitative RT-PCR analysis . In Pennisetum ciliare, differential display and suppression subtractive hybridization were used to identify gene expression differences in ovaries of sexual and apomictic accessions [24, 25]. SuperSAGE, a high-throughput differential display approach, has been used to discover several hundred transcripts with heterochronic shifts in expression between apomictic and sexual ovules at multiple stages of development [26, 27].
Formation of aposporous initials is the first and most critical event for occurrence of apospory. Because the initiation of sexual and apomictic pathways likely is activated by different signals , understanding the molecular mechanism underlying apospory initiation can provide insight into developmental regulation and downstream signaling that results in apomixis. In order to discover candidates for regulating aposporous initial specification in P. squamulatum, we compared two transcriptomes derived from microdissected ovules at the stage of aposporous initial (AI) formation between the apomictic donor parent, P. squamulatum, and its apomictic derivative backcross 8 (BC8) containing a single P. squamulatum chromosome. Initially, a P. glaucum x P. squamulatum F1 was crossed with a P. glaucum x P. purpureum F1 and hybrid apomictic individuals with good male fertility were selected . Subsequent backcrosses with tetraploid P. glaucum yielded a BC8 line that was shown by FISH to contain only one chromosome from P. squamulatum. This single chromosome common to both apomictic BC8 and P. squamulatum was the ASGR-carrier chromosome based on the transmission of the trait of apomixis and linked molecular markers . We hypothesize that candidate genes regulating aposporous initial specification and localized to the ASGR will function in both PS26 and BC8 at the same developmental stage and would be identical in sequence as they are related by descent.
The development and commercialization of new massively parallel sequencing platforms have made transcriptome sequencing faster and more affordable. One platform, developed by 454 Life Sciences Corporation, the 454 GS-FLX sequencer, is capable of producing 100 Mb of sequence data with an average read length of 250 bp per bead in a 7-h run . Successful applications of these high-throughput sequencing technologies to transcriptome analysis have been reported [33–37]. Here we present expressed sequence tags (ESTs) generated by Roche 454 high-throughput sequencing technology from dissected ovule tissues staged for aposporous initial formation from two apomictic lines chosen for their common features of apospory and single shared chromosome. Alien chromosome (ASGR-carrier chromosome) expressed transcripts were identified and tested for ASGR linkage and tissue expression.
Aposporous ovule-enriched cDNA samples for sequencing
Ovules from PS26 and BC8 around the stage of aposporous initial formation were manually dissected from pistils (Figure 1). Three biological replicates of 40 ovules each were collected for both PS26 and BC8. The yield of total RNA from each replicate was approximately 20 ng from which 15 ng was used for one-round of T7 RNA polymerase-based RNA amplification. The average yield from one round of amplification was 90 μg. For each library, equal amounts of amplified RNA from each replicate were combined and 15 μg amplified RNA was used for ds-cDNA synthesis. The majority of the ds-cDNA synthesized from amplified RNA was distributed in a size range from 200 bp to 1000 bp (Figure 2).
Assembly of sequences from PS26 and BC8 aposporous ovules
Two aposporous ovule transcriptomes, one from PS26 and the other from BC8, were sequenced using the high-throughput 454-FLX sequencer. The PS26 transcriptome library contained 332,567 reads with an average read length of 147 base pairs (bp) and the BC8 transcriptome library contained 363,637 reads with an average read length of 142 bp. Assembly by the Multifunctional Inertial Reference Assembly (MIRA) program  resulted in 33,977 contigs from the PS26 ovule transcriptome library and 26,576 contigs from the BC8 ovule transcriptome library (Additional file 1: PS26_MIRA.fasta, Additional file 2: BC8_MIRA.fasta). The number of reads per contig ranged from 1 to 759 in PS26 assemblies and 1 to 1661 in BC8 assemblies with the majority having less than 30 reads per assembly in both cases. The numbers of singletons in PS26 and BC8 libraries were 176 and 78, respectively.
Contigs from both transcriptome libraries were analyzed for biological functions using Blast2GO . For both libraries, the use of T7 amplified RNA biased the sequencing data toward the 3' UTR region as shown by the BlastX results of the Blast2GO analysis. 5,730 PS26 contigs (~17%) and 4,833 BC8 contigs (~18%) had hits against the nr database of NCBI with an E-value cut-off of e-06. For both libraries, 90% of the top BlastX hits were, in order, to Sorghum bicolor, Zea mays or Oryza sativa proteins. Blast2GO was able to fully annotate 4,400 PS26 contigs and 3,692 BC8 contigs (Figure 3).
To obtain additional functional data from the shorter reads, a study was initiated to test whether the most significant BlastN EST_other database hit (E-value cut off of e-20) could be used as a surrogate longer sequencing read for the PS26/BC8 transcripts. Approximately 55% (14,518) of the BC8 contigs had an EST_OTHERS hit ≤e-20. Blast2GO analysis was used for the BC8_EST_OTHERS best matches and compared with Blast2GO mapping results for the 3692 annotated BC8 contigs. The majority (84%) of the BC8 contigs had Blast2GO mapping data identical to the corresponding BC8_EST_OTHERS mapping data while only 5% of the BC8 contigs had >50% non-matching mapping data. Given the large percentage of identical and/or highly matching mapping data, a library of PS26_EST_OTHERS was also established using the same parameters as BC8_EST_OTHERS. Approximately 53% (18,028) of the PS26 contigs had an EST_OTHERS hit ≤e-20. Blast2GO was able to fully annotate 12,462 PS26_EST_OTHERS contigs and 10,107 BC8_EST_OTHERS contigs.
A Fisher's Exact Test (using GOSSIP; ) was done to identify significant differences of expression data between the PS26 and BC8 libraries and the PS26_EST_OTHERS and BC8_EST_OTHERS libraries. At a false discovery rate (FDR) ≤0.01, 28 GO terms were identified as different between the PS26 and BC8 libraries (Table 1). However, when the PS26_EST_OTHERS and BC8_EST_OTHERS libraries were compared at FDR <0.05 (at an FDR ≤0.01 no significant results were returned), only 7 GO terms (ribosome, translation, ribosome biogenesis, ribonucleoprotein complex biogenesis, ribonucleoprotein complex, structural constituent of ribosome, cellular component biogenesis) were identified as differentially expressed between the two libraries (Table 1).
In Silico identification of putative alien expressed transcripts
When MIRA-assembled contigs from the two libraries were analyzed by BlastN with PS26 sequences as queries and BC8 sequences as the database, a total of 118 comparisons were obtained with 100% sequence identity across an overlapping region ≥100 bp corresponding to 115 unique contigs from the PS26 database and 116 unique contigs from the BC8 database. The 118 PS26/BC8 contigs were further analyzed by aligning the corresponding PS26 and BC8 contigs with each other, resulting in 61 inter-genotype contigs with no mismatches that were aligned. The average overlapping regions of the 61 inter-genotype contigs was 241 bp (ranging from 181 bp to 419 bp) with an average number of 28 sequence reads. The remaining PS26/BC8 contigs, while initially identified by BlastN as having 100% identity over a region >100 bp, did not continue to share sequence similarity outside this region and therefore did not align over the whole contig.
Mapping and predicted function of putative ASGR-carrier chromosome transcripts
Up to four primer pairs per contig were used to test for linkage of the 61 contigs to the ASGR-carrier chromosome. Sequence characterized amplified region (SCAR) primer pairs were designed based on the PS26 contig sequence (Additional file 3, Table S1). After screening by PCR against PS26, IA4X (4 × P. glaucum), N37 (P. purpureum) and a small number of progeny from apomictic BC8 segregating for mode of reproduction, 45 contigs showed specific amplification from PS26 and apomictic BC8 but no amplification from IA4X or sexual BC8 individuals (Figure 4, Table 2) establishing linkage of 45 contigs to the ASGR-carrier chromosome. Single-strand conformation polymorphism analysis (SSCP) and a CAPS screen using two to four restriction enzymes was applied to the 14 primer pairs which amplified products in both PS26 and IA4X DNA. Four additional contigs could be linked to the ASGR-carrier chromosome using SSCP analysis (Table 2). The CAPS screen identified a Hae III polymorphism for PS26_c2552, a transcript also mapped by SSCP.
The markers from the 49 ASGR-carrier chromosome-linked contigs were initially screened on a limited number of apomictic (4) and sexual (4) F1s for mapping to the ASGR. This resulted in one contig, PS26_c9369, showing tight linkage to the ASGR as the primers amplified DNAs from only apomictic F1s but not sexual F1s (Figure 5, Table 2). The remaining primer sets did not show amplification specificity in the F1 population; both apomictic and sexual progeny amplified.
A larger F1 population of 22 individuals (10 apomictic and 12 sexual) was used to map the PS26_c9369 and PS26_c2552 transcripts. PS26_c2552 was mapped based on the Hae III polymorphism found in the CAPS screen between PS26 and IA4X and also seen in the F1 population. PS26_c2552 is unlinked to the ASGR as the CAPS polymorphism segregated 1:1 in the population but with 7 sexual and 5 apomictic individuals containing the marker. In comparison, the PS26_c9369 primers remained specific to the 10 apomictic plants and did not amplify the 12 sexual plants.
BlastX searches against NCBI databases were carried out for the 49 PS26/BC8 ASGR-carrier chromosome linked contigs and best protein hits for 18 contigs are summarized in Table 3. Because the sequences are 3' biased, a BlastN analysis against the expressed sequence tag (EST_OTHERS) database at NCBI with the remaining 31 PS26/BC8 contigs was done to find potential orthologs from other species. At an E-value cutoff of e-20, 18 contigs had EST hits (Table 3). A BlastX was performed using these EST sequences to determine if tentative protein functions could be obtained, and the best hits are listed in Table 3. The remaining 13 (27%) contigs did not have hits by either BlastX or BlastN; therefore, they were considered orphan genes.
In order to generate contiguous sequence that might enhance the potential for mapping of contigs in the F1 population and to extract a longer cDNA sequence for PS26_c9369, a cDNA library containing ~300,000 phage plaques was constructed from apomictic BC8 mature ovary and anther RNA since all 49 ASGR-carrier chromosome transcripts showed expression in these tissues by RT-PCR. Screening of the cDNA library with 27 ASGR-carrier chromosome transcript probes yielded hybridization signals for 24 probes. PCR screening with the ASGR-carrier chromosome-specific primers identified 16 ASGR-carrier chromosome clones and one clone for PS26_c9369. Additional sequence for these clones was generated.
The PS26_c9369 clone contained a 646 bp insert. BlastX analysis identified similarity to a hypothetical protein SORBIDRAFT_10g020450 (XP_002438482.1; e-value 6e-18) and Oryza sativa hypothetical protein OsJ_30933 [EAZ15525.1; e-value 4e-16] over an ~155 bp region. In both sorghum and rice, the area of similarity overlapped a pfam03004: Transposase_24 domain for those proteins. The remaining PS26_c9369 clone sequence was unique. Nine primer sets were designed from nine PS26 contigs to span introns based on predicted splicing of best hits to sorghum. Five primer sets gave strong amplification of PS26 genomic DNA. These amplicons were cloned and sequenced to identify SNPs within the PS26 genomic alleles. CAPS markers could be designed for PS26_c1580 (Hpy CH4IV) and PS26_c33813 (Hpy CH4IV). Mapping of 4 apomictic and 4 sexual F1s did not show tight linkage of these contigs to the ASGR.
Expression profiles of ASGR-linked expressed transcripts by RT-PCR
RT-PCR with RNA extracted from apomictic BC8 leaf, root, anther, and ovary tissues was completed for the 49 candidate genes mapped to the ASGR-carrier chromosome. Forty-seven were expressed in all four organ types examined (Figure 6a). However, one putative MADS-domain containing transcription factor, corresponding to contig PS26_c33813, showed amplification only in anther and ovary tissues (Figure 6b) and contig PS26_c10535, a putative Lon protease, showed expression in all organs except anther.
Transcriptional profiling has been extensively used for gene discovery in plants because the absence of introns greatly enhances the information content of the data set and eases data interpretation [41–43]. Combined with 454 high-throughput sequencing technology, transcriptome sequencing has become an approach to understand molecular events at the gene expression level on a genome-wide scale. Many successful applications of 454 sequencing technology in transcriptome sequencing and single nucleotide polymorphism (SNP) discovery have been reported [44–49] and supported our use of this technology for ovule transcriptome sequencing.
In contrast to studies aimed at identifying genes involved in apomictic reproduction through the identification of differences between apomictic and sexual genotypes, our study compared two apomictic lines for identical transcripts. We previously reported that the ASGR is sufficient to induce apomixis in sexual pearl millet [11, 12]; therefore, the trait of apomixis in BC8 is conferred by the ASGR-carrier chromosome from PS26 . In the present study, we have attempted to identify candidate genes regulating the first step of apomixis, aposporous initial development, by transcriptome analysis of ovules from both PS26 and BC8. The ovules were collected at the stage of aposporous initial development, which ranged from no apparent apospory initials (~70%) to distinct aposporous initials observed (~30%). By pooling ovules over this range of development our objective was to minimize the chance of missing genes involved in the pathway of apomixis initiation since we would predict transcription prior to, and perhaps beyond, apospory initial formation.
The two ovule transcriptomes generated had an average read length of ~150 bp, shorter than the average read length of 200-300 bases for the 454 GS FLX sequencer. The shorter than expected reads could have been due to a combination of factors in preparing the samples for sequencing such as the T7-based antisense RNA amplification method, the conversion of antisense RNA to cDNA, or during the shearing process of the cDNA to prepare the sequencing library. Another possible factor is the species itself. It has been shown that the average read length can vary among different organisms due to differences in AT/GC content .
Even with short reads and using stringent comparison conditions to decrease the number of false positive joins between highly similar but not identical transcripts from the two species, 61 putative ASGR-carrier chromosome candidate expressed genes were identified in silico, of which 49 have confirmed linkage to the ASGR-carrier chromosome. The 3' bias of the T7 amplified transcripts helped in the design of primers to discriminate between P. squamulatum and the BC8 pearl millet genome containing one P. squamulatum chromosome. Our sequencing strategy helped remove, at least to a chromosomal level, the difficulties associated with candidate gene identification by comparative gene expression analysis in apomictic and sexual systems which lack, due to the apomictic process, an ability to generate isogenic lines that vary only in their mode of reproduction. Primer specificity for 48 transcripts was not seen when we attempted to map SCARs to the ASGR using a F1 population containing many P. squamulatum chromosomes. The additional sequence generated by the phage cDNA clones allowed mapping of two more transcripts in the F1 population. Greater sequence length would be advantageous for mapping of the ASGR-carrier chromosome transcripts to the ASGR locus.
The use of the gene ontology software Blast2Go allowed comparison of both the PS26 and BC8 libraries and the PS26_EST_OTHERS and BC8_EST_OTHERS libraries created by using the most significant EST_OTHERS BlastN result as a surrogate for our sequences. The PS26 and BC8 transcriptomes were almost identical on a level 3 biological process comparison. While many biological GO terms showed expression level differences when comparing the PS26 and BC8 libraries, all but seven became non-significant when the PS26_EST_OTHERS and BC8_EST_OTHERS libraries were compared. Six of the transcriptional differences noted belong to genes involved in either ribosomal or translational functions. This difference may be caused by ploidy level difference of PS26 (an octoploid) and BC8 (a tetraploid). MIRA assembly will separate alleles of genes into different contigs. More PS26 allelic transcripts for genes involved in either ribosomal or translational functions may be expressed in PS26 than in BC8 thus leading to a higher transcript difference between the libraries.
Expression analysis of the ASGR-carrier chromosome linked genes in BC8 tissue was used to identify transcripts specific to reproductive tissue. All but two ASGR-carrier chromosome transcripts showed constitutive expression in both vegetative and reproductive tissues. The one reproduction-specific transcript (the MADS box gene) did not map to the ASGR. The transcript which could be mapped to the ASGR shows similarity to "hypothetical" proteins in both sorghum and rice containing a Transposase_24 domain. Previous sequencing of BAC clones linked to the ASGR have shown a large number of both Type I and Type II transposons at the locus [50, 13]; therefore, it is not surprising that we identified an ASGR-linked transposon transcript in our study.
Our data show that the combination of selecting specific reproductive tissues and sequencing with 454 high-throughput sequencing technology is a promising approach for identification of genes involved in different developmental events and that a need for longer transcript contigs will be a requirement to allow for easier mapping of these transcripts. Given the rapid advancements in next-generation sequencing technologies that enable very deep sequence coverage and paired-end reads, it is likely that the fine tissue dissection requiring RNA amplification of starting materials now could be eliminated to favor longer transcript assemblies.
Pennisetum squamulatum (PS26; PI 319196, 2n = 56) and backcross line 8 (BC8)-line 58were used for ovule collection. Compared with the BC7 line which was used in previous studies , the BC8-line 58 contains only one alien chromosome from PS26, the ASGR-carrier chromosome . P. glaucum (IA4X), P. purpureum (N37), 4 apomictic and 4 sexual plants from BC8-line 58(BC8 is facultative thus it produces ~ 18% sexually derived offspring were used for assigning the candidate transcript fragments to the ASGR-carrier chromosome. Twenty-two individuals from a segregating F1 population between P. squamulatum and P. glaucum were used for mapping the transcript fragments to the ASGR.
Young florets were dissected from small inflorescence sections whose anthers were at stages between premeiosis and prophase, as determined by acetocarmine staining of anther squashes. One group of florets was stored in RNALater® solution (Ambion, Austin, TX, USA) at 4°C while the other group was processed for ovary clearing by methyl salicylate  to screen for the ovary developmental stage. Ovules from thirty cleared florets were examined for each group. If the cleared sample showed AIs in less than 30% of the ovaries and the remaining ovaries were at an earlier developmental stage, then florets stored in RNALater® solution from the same section of inflorescence were used for ovule dissection. About 40 ovules per sample were collected and total RNA was extracted from the ovules with RNAqueous®-Micro Kit (Ambion). RNA integrity and quantity were analyzed with an Agilent 2100 Bioanalyser (Santa Clara, CA) at the Interdisciplinary Center for Biotechnology Research (ICBR) of the University of Florida.
RNA amplification and ds-cDNA synthesis for Roche 454 sequencing
With total RNA as starting material, mRNA was amplified by T7-based in vitro transcription following the manual of TargetAmp™2-Round aRNA Amplification Kit 2.0 (Epicentre, Madison, WI). Size range and quantity of the amplified mRNA were measured by both gel electrophoresis and Agilent 2100 Bioanalyser analysis. For each sample, an equal amount of amplified mRNA from the three biological replicates was pooled for ds-cDNA synthesis following the protocol developed by the Schnable lab . Size-range and quantity of ds-cDNA were also analyzed by both gel electrophoresis and using the Agilent 2100 Bioanalyser before submitting the samples for sequencing.
454 sequencing and processing
About 6 μg of ds-cDNA from both PS26 and BC8 was submitted to the Genome Sequencing Center at Washington University for 454-FLX sequencing. Samples of cDNA were subjected to mechanical shearing (nebulization), size selected, and blunt-end fragments were ligated to short adaptors, which provided primer target sites for both amplification and sequencing. Sequencing files (Accession #SRA030528) were submitted to the Sequence Read Archive at NCBI http://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view=studies. The Multifunctional Inertial Reference Assembly (MIRA) program  was used to process and assemble the sequences from each library. Adaptor sequences and low quality sequence reads were removed prior to assembly. The assembly was run as a de novo, 454 EST project with accurate assembly and polyA/T clipping. Each library of contig assemblies from PS26 and BC8 was converted to a database and analyzed with the BlastN program provided by the RCC (Research Computing Center) at the University of Georgia http://rcc.uga.edu. The PS26 library contigs were chosen as queries and the BC8 library was chosen as the database. The BlastN analysis was performed with an E-value cutoff of ≤ e-100. The BlastN output was parsed using an internal script such that only contigs with 100% identity over at least 100 bp were selected for further analysis.
BLAST analysis of the selected contigs
BlastX was used to analyze sequences mapping to the ASGR-carrier chromosome by searching against the NCBI (National Center for Biotechnology Information, http://www.ncbi.nlm.nih.gov/) databases. A BlastN analysis was conducted on contigs without significant BlastX hits (e-value ≤ e-06) to search for similar ESTs from other species. The most significant EST hit with an e-value of at least ≤e-20 was used for BlastX query to search for putative encoding proteins.
Mapping of identical PS26/BC8 contigs to the alien chromosome and/or ASGR
Fasta files containing sequences from contigs with 100% identity over at least 100 bp from both PS26 and BC8 libraries were generated. Alignment of each PS26/BC8 contig pair yielded sixty-one assemblies of PS26/BC8 contigs used as candidates for mapping to the ASGR-carrier chromosome. The 61 PS26/BC8 contigs from were used as queries with BlastN against both the PS26 and BC8 MIRA-assembled databases at an E-value cutoff of ≤e-25. The BlastN results were parsed and used to help estimate the 'uniqueness' of the contig within the transcriptome. Primers were designed based on the overlapping region of PS26 and BC8 contigs, and in some cases included further 3' sequences for primer design if the contig was unique in both databases. When multiple contigs from each database showed high similarity to each other, primers were designed based on the region with the best polymorphisms to distinguish one from another. Primers were first tested for amplification with PS26, IA4X, N37 and 4 apomictic and 4 sexual plants from a segregating population of BC8. Primer pairs which did not amplify either IA4X or sexual BC8 individuals were used for further screening with apomictic and sexual F1s to test for linkage to the ASGR.
For SSCP analysis a Bio-Rad Protean II system (Bio-Rad Laboratories, Hercules, CA) was used to separate fragments in a 1 mm thick 12% non-denaturing PAGE gel with 10% glycerol. PCR product (2 μl) was mixed with 10 μl LIS loading dye (10% sucrose, 0.01% bromophenol blue, and 0.01% xylene cyanol FF), denatured at 98°C for 10 min and cooled to RT for at least 10 min. Sample (10 μl) was loaded and the gel was run in at 200 V for 20-22 hours at 25°C. Silver staining was used to detect the SSCP fragments.
Expression patterns of transcripts mapped to the alien chromosome
Total RNA was extracted from a panel of BC8 tissues including vegetative (leaf, root), and reproductive tissues at anthesis but before pollination (anther and ovary) with QIAGEN RNeasy® Plant Mini kit (QIAGEN, Valencia, CA) following the manufacturer's protocol. First-strand cDNA was synthesized following the manufacturer's protocol of First-strand cDNA Synthesis kit (Invitrogen, Carlsbad, CA). RT-PCR reactions were performed using primer pairs which mapped to the ASGR-carrier chromosome in a total volume of 20 μl containing 1 μl of first-strand cDNA, 1 μM of each primer, 1X PCR buffer, 1.5 mM MgCl2, 0.2 mM dNTPs, and 1 unit of JumpStart™ Taq DNA polymerase (Sigma, St. Louis, MO). Amplification of contaminating genomic DNA was tested by the inclusion of controls that omitted the reverse transcriptase enzyme from the cDNA synthesis reaction, e.g. no RT controls. The PCR reaction was denatured at 94°C for 5 min followed by 35 cycles of 94°C denaturation for 30 seconds, annealing for 30 seconds at respective temperatures, and 72°C extension for 1 min. RT-PCR products were separated on a 1.5% agarose gel and stained with ethidium bromide. Gel images were captured with the Molecular Imager Gel Doc XR System (Bio-Rad Laboratories).
cDNA library construction
Ovaries and anthers collected from apomictic BC8 around anthesis but prior to fertilization were frozen in liquid nitrogen. Total RNA was extracted with the RNeasy® Plant Mini kit (QIAGEN) and then poly A+ RNA was purified from total RNA with Oligotex® mRNA Mini kit (QIAGEN) following the manufacturer's protocols. Yield of mRNA was quantified with a Nanodrop spectrophotometer (Thermo Fisher Scientific Inc., Wilmington, DE). mRNA was used for double-stranded cDNA synthesis with ZAP-cDNA® Synthesis Kit following the manufacturer's protocol (Stratagene, La Jolla, CA). Ligations, packaging, titering of the packaging reactions, and plaque lifts were conducted following the manufacturer's protocol of ZAP-cDNA® Gigapack® III Gold Cloning Kit (Stratagene).
cDNA library screening for target genes
The apomictic BC8 ovary and anther-enriched cDNA library was screened with α-32P labeled probes with transcripts mapping to the ASGR-carrier chromosome. The PCR fragments amplified from apomictic BC8 genomic DNA with the primers used for assigning a fragment to the ASGR-carrier chromosome were diluted and labeled with α-32P by PCR in a total volume of 20 μl. The labeling reaction contained ~0.1 ng primary PCR fragment, 1.25 unit Jumpstart Taq DNA polymerase (Sigma), 0.25 μM of each primer, 0.5 mM dATP/dTTP/dGTP mixture, 5 μl of α-32P-labeled dCTP (3000 Ci/mmol) and 1 × PCR buffer (10 mM Tris-HCl, 50 mM KCl, 1.5 mM MgCl2). Probes were purified by passing through homemade Sephadex G-50 (Sigma) columns, which were assembled with Ultrafree®-MC Centrifugal Filter Units (Millipore, Bedford, MA). Pre-hybridization of the membranes in hybridization buffer (0.5 M sodium phosphate, 7% SDS, 1 mM EDTA, pH 8.0) containing 0.1 mg ml-1 salmon sperm DNA, which was denatured in boiling water for 10 minutes and cooled on ice before adding to the hybridization solution, was conducted at 65°C for 4 h before addition of the labeled, denatured probe. Hybridization was conducted at 65°C overnight followed by three washes at the same temperature for 30 min each with the following buffers: 1) 1 × SSC, 0.1% SDS; 2) 0.5 × SSC, 0.1% SDS; 3) 0.1 × SSC, 0.1% SDS. After the final wash, membranes were wrapped with plastic film and exposed to x-ray film overnight at -80°C prior to manually developing with Kodak® GBX Developer and Fixer (Thermo Fisher Scientific Inc). Autoradiographs were aligned with the respective plates to recover hybridizing plaques with sterile glass pipettes. Recovered plaques were released in tubes containing 1.0 ml SM phage buffer (according to the formula in the manual of ZAP-cDNA® Gigapack® III Gold Cloning Kit) and 20 μl chloroform (Sigma). After overnight elution at 4°C, 1 μl SM buffer of each recovered sample was used for PCR to verify positive signals. Since the primary screening was carried out with a high density of plaque clones, the recovered positive plaques were purified after secondary and tertiary screens at much lower densities. Single plaques showing positive hybridization signals were recovered in 500 μl SM buffer with 10 μl chloroform (Sigma) at 4°C.
Sequencing and mapping of candidate cDNA clones to the ASGR
In vivo excision of single plaque clones was conducted using ExAssist® helper phage with SOLR® strain following the protocol in the manual of ZAP-cDNA® Gigapack® III Gold Cloning Kit (Stratagene). Single colonies containing the pBluescript double-stranded phagemid with the cloned cDNA insert were isolated and cultured in liquid Luria-Bertani (LB) medium containing 100 μg mL-1 ampicillin at 37°C overnight. An aliquot of each culture was further grown in freeze broth containing 100 μg mL-1 ampicillin at 37°C overnight and then stored at -80°C before sending out for sequencing. Sequencing was conducted with M13 primers (Georgia Genomics Facility, Athens, GA). Vector and bad quality sequences were trimmed from the original sequences with VectorNTI Advanced 10 (Invitrogen) and primers were designed with VectorNTI using the high quality cDNA sequences. Primers were then tested with apomictic and sexual F1s for linkage to the ASGR as described above.
Annotation for each library was performed using Blast2GO software, http://www.blast2go.org/start_blast2go. BlastX (database: GenBank nr/E-value cutoff: e-06), GO term mapping (default values) and Annotation (database: b2g-2009 with default values) were used. Annotations were validated and augmented using ANNEX. Libraries were compared using the Fisher's exact test with FDR value of ≤0.01 or ≤0.05.
Nogler GA: Gametophytic apomixis. Embryology of Angisoperms. Edited by: Johri BM. 1984, Springer-Verlag, Berlin, 475-518.
Koltunow AM: Apomixis: Embryo Sacs and Embryos Formed without Meiosis or Fertilization in Ovules. Plant Cell. 1993, 5: 1425-1437.
Grimanelli D, Leblanc O, Perotti E, Grossniklaus U: Developmental genetics of gametophytic apomixis. Trends Genet. 2001, 17: 597-604. 10.1016/S0168-9525(01)02454-4.
Hanna WW: Use of apomixis in cultivar development. Adv Agron. 1995, 54: 333-350.
Koltunow AM, Bicknell RA, Chaudhury AM: Apomixis: Molecular Strategies for the Generation of Genetically Identical Seeds without Fertilization. Plant Physiol. 1995, 108: 1345-1352.
Savidan Y: Apomixis: genetics and breeding. Plant Breed Rev. 2000, 18: 13-85.
van Dijk P, van Damme J: Apomixis technology and the paradox of sex. Trends Plant Sci. 2000, 5: 81-84. 10.1016/S1360-1385(99)01545-9.
Carman JG: Asynchronous expression of duplicate genes in angiosperms may cause apomixis, bispory, tetraspory, and polyembryony. Biol J Linn Soc. 1997, 61: 51-94. 10.1111/j.1095-8312.1997.tb01778.x.
Bicknell RA, Koltunow AM: Understanding apomixis: recent advances and remaining conundrums. Plant Cell. 2004, 16 (Suppl): S228-245. 10.1105/tpc.017921.
Ozias-Akins P: Apomixis: Developmental Characteristics and Genetics. Critical Reviews in Plant Sciences. 2006, 25: 199-214. 10.1080/07352680600563926.
Ozias-Akins P, Roche D, Hanna WW: Tight clustering and hemizygosity of apomixis-linked molecular markers in Pennisetum squamulatum implies genetic control of apospory by a divergent locus that may have no allelic form in sexual genotypes. Proc Natl Acad Sci USA. 1998, 95: 5127-5132. 10.1073/pnas.95.9.5127.
Goel S, Chen Z, Conner JA, Akiyama Y, Hanna WW, Ozias-Akins P: Delineation by fluorescence in situ hybridization of a single hemizygous chromosomal region associated with aposporous embryo sac formation in Pennisetum squamulatum and Cenchrus ciliaris. Genetics. 2003, 163: 1069-1082.
Akiyama Y, Conner JA, Goel S, Morishige DT, Mullet JE, Hanna WW, Ozias-Akins P: High-resolution physical mapping in Pennisetum squamulatum reveals extensive chromosomal heteromorphism of the genomic region associated with apomixis. Plant Physiol. 2004, 134: 1733-1741. 10.1104/pp.103.033969.
Leblanc O, Armstead I, Pessino SC, Ortiz JP, Evans C, Valle CD, Hayward MD: Non-radioactive mRNA fingerprinting to visualize gene expression in mature ovaries of Brachiaria hybrids derived from B. brizantha, an apomictic tropical forage. Plant Science. 1997, 126: 49-58. 10.1016/S0168-9452(97)00067-8.
Rodrigues JC, Cabral GB, Dusi DM, de Mello LV, Rigden DJ, Carneiro VT: Identification of differentially expressed cDNA sequences in ovaries of sexual and apomictic plants of Brachiaria brizantha. Plant Mol Biol. 2003, 53: 745-757.
Pessino SC, Espinoza F, Martinez EJ, Ortiz JP, Valle EM, Quarin CL: Isolation of cDNA clones differentially expressed in flowers of apomictic and sexual Paspalum notatum. Hereditas. 2001, 134: 35-42.
Laspina NV, Vega T, Seijo JG, Gonzalez AM, Martelotto LG, Stein J, Podio M, Ortiz JPA, Echenique VC, Quarin CL, Pessino SC: Gene expression analysis at the onset of aposporous apomixis in Paspalum notatum. Plant Mol Biol. 2008, 67: 615-628. 10.1007/s11103-008-9341-5.
Polegri L, Calderini O, Arcioni S, Pupilli F: Specific expression of apomixis-linked alleles revealed by comparative transcriptomic analysis of sexual and apomictic Paspalum simplex Morong flowers. J Exp Bot. 2010, 61: 1869-1883. 10.1093/jxb/erq054.
Albertini E, Marconi G, Barcaccia G, Raggi L, Falcinelli M: Isolation of candidate genes for apomixis in Poa pratensis L. Plant Mol Biol. 2004, 56: 879-894. 10.1007/s11103-004-5211-y.
Albertini E, Marconi G, Reale L, Barcaccia G, Porceddu A, Ferranti F, Falcinelli M: SERK and APOSTART. Candidate genes for apomixis in Poa pratensis. Plant Physiol. 2005, 138: 2185-2199. 10.1104/pp.105.062059.
Chen L, Miyazaki C, Kojima A, Saito A, Adachi T: Isolation and characterization of a gene expressed during early embryo sac development in apomictic guinea grass (Panicum maximum). J Plant Physiol. 1999, 154: 55-62.
Chen L, Guan L, Seo M, Hoffmann F, Adachi T: Developmental expression of ASG- 1 during gametogenesis in apomictic guinea grass (Panicum maximum). J Plant Physiol. 2005, 162: 1141-1148. 10.1016/j.jplph.2005.02.010.
Yamada-Akiyama H, Takahara M, Kikuchi S, Takamiza T, Nakagawa H, Sugita Si, Kishimoto N, Ebina M, Akiyama Y, Xu Q, Yazaki J, Tsuruta Si: Analysis of expressed sequence tags in apomictic guineagrass (Panicum maximum). J Plant Physiol. 2009, 166: 750-761. 10.1016/j.jplph.2008.10.001.
Vielle-Calzada JP, Nuccio ML, Budiman MA, Thomas TL, Burson BL, Hussey MA, Wing RA: Comparative gene expression in sexual and apomictic ovaries of Pennisetum ciliare (L.) Link. Plant Mol Biol. 1996, 32: 1085-1092. 10.1007/BF00041392.
Singh M, Burson BL, Finlayson SA: Isolation of candidate genes for apomictic development in buffelgrass (Pennisetum ciliare). Plant Mol Biol. 2007, 64: 673-682. 10.1007/s11103-007-9188-1.
Sharbel TF, Voigt ML, Corral JM, Thiel T, Varshney A, Kumlehn J, Vogel H, Rotter B: Molecular signatures of apomictic and sexual ovules in the Boechera holboellii complex. Plant J. 2009, 58: 870-882. 10.1111/j.1365-313X.2009.03826.x.
Sharbel TF, Voigt ML, Corral JM, Galla G, Kumlehn J, Klukas C, Schreiber F, Vogel H, Rotter B: Apomictic and sexual ovules of Boechera display heterochronic global gene expression patterns. Plant Cell. 2010, 22: 655-671. 10.1105/tpc.109.072223.
Ozias-Akins P, van Dijk PJ: Mendelian genetics of apomixis in plants. Annu Rev Genet. 2007, 41: 509-537. 10.1146/annurev.genet.40.110405.090511.
Dujardin M, Hanna WW: Cytogenetics of double cross hybrids between Pennisetum americanum -P. purpureum amphiploids and P. americanum X Pennisetum squamulatum interspecific hybrids. Theoretical and Applied Genetics. 1984, 69: 97-100.
Dujardin M, Hanna WW: Developing apomictic pearlmillet--characterization of a BC3 plant. J Genet Breed. 1989, 43: 145-151.
Singh M, Conner JA, Zeng Y, Hanna WW, Johnson VE, Ozias-Akins P: Characterization of apomictic BC7 and BC8 pearl millet: Meiotic chromosome behavior and construction of an ASGR-carrier chromosome-specific library. Crop Science. 2010, 50: 892-902. 10.2135/cropsci2009.05.0263.
Droege M, Hill B: The Genome Sequencer FLX System--longer reads, more applications, straight forward bioinformatics and more complete data sets. J Biotechnol. 2008, 136: 3-10. 10.1016/j.jbiotec.2008.03.021.
Emrich SJ, Barbazuk WB, Li L, Schnable PS: Gene discovery and annotation using LCM-454 transcriptome sequencing. Genome Res. 2007, 17: 69-73.
Jones-Rhoades MW, Borevitz JO, Preuss D: Genome-wide expression profiling of the Arabidopsis female gametophyte identifies families of small, secreted proteins. PLoS Genet. 2007, 3: 1848-1861.
Vera JC, Wheat CW, Fescemyer HW, Frilander MJ, Crawford DL, Hanski I, Marden JH: Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol Ecol. 2008, 17: 1636-1647. 10.1111/j.1365-294X.2008.03666.x.
Guo SG, Zheng Y, Joung JG, Liu SQ, Zhang ZH, Crasta OR, Sobral BW, Xu Y, Huang SW, Fei ZJ: Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types. BMC Genomics. 2010, 11: 384-10.1186/1471-2164-11-384.
Sun C, Li Y, Wu Q, Luo HM, Sun YZ, Song JY, Lui EMK, Chen SL: De novo sequencing and analysis of the American ginseng root transcriptome using a GS FLX Titanium platform to discover putative genes involved in ginsenoside biosynthesis. BMC Genomics. 2010, 11: 262-10.1186/1471-2164-11-262.
Chevreux B, Pfisterer T, Drescher B, Driesel AJ, Muller WE, Wetter T, Suhai S: Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs. Genome Res. 2004, 14: 1147-1159. 10.1101/gr.1917404.
Conesa A, Götz S, García-Gómez JM, Terol J, Manuel Talón M, Robles M: Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005, 21 (18): 3674-3676. 10.1093/bioinformatics/bti610.
Bluthgen N, Brand K, Cajavec B, Swat M, Herzel H, Beule D: Biological profiling of gene groups utilizing gene ontology. Genome Inform. 2005, 16 (1): 106-115.
Malik MR, Wang F, Dirpaul JM, Zhou N, Polowick PL, Ferrie AM, Krochko JE: Transcript profiling and identification of molecular markers for early microspore embryogenesis in Brassica napus. Plant Physiol. 2007, 144: 134-154. 10.1104/pp.106.092932.
Spencer MW, Casson SA, Lindsey K: Transcriptional profiling of the Arabidopsis embryo. Plant Physiol. 2007, 143: 924-94.
Hoecker N, Keller B, Muthreich N, Chollet D, Descombes P, Piepho HP, Hochholdinger F: Comparison of maize (Zea mays L.) F1-hybrid and parental inbred line primary root transcriptomes suggests organ-specific patterns of nonadditive gene expression and conserved expression trends. Genetics. 2008, 179: 1275-1283. 10.1534/genetics.108.088278.
Morozova O, Marra MA: Applications of next-generation sequencing technologies in functional genomics. Genomics. 2008, 92: 255-264. 10.1016/j.ygeno.2008.07.001.
Cheung F, Haas BJ, Goldberg SM, May GD, Xiao Y, Town CD: Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology. BMC Genomics. 2006, 7: 272-10.1186/1471-2164-7-272.
Ohtsu K, Smith MB, Emrich SJ, Borsuk LA, Zhou R, Chen T, Zhang X, Timmermans MC, Beck J, Buckner B, Janick-Buckner D, Nettleton D, Scanlon MJ, Schnable PS: Global gene expression analysis of the shoot apical meristem of maize (Zea mays L.). Plant J. 2007, 52: 391-404. 10.1111/j.1365-313X.2007.03244.x.
Barbazuk WB, Emrich SJ, Chen HD, Li L, Schnable PS: SNP discovery via 454 transcriptome sequencing. Plant J. 2007, 51: 910-918. 10.1111/j.1365-313X.2007.03193.x.
Torres TT, Metta M, Ottenwalder B, Schlotterer C: Gene expression profiling by massively parallel sequencing. Genome Res. 2008, 18: 172-177.
Bekal S, Craig JP, Hudson ME, Niblack TL, Domier LL, Lambert KN: Genomic DNA sequence comparison between two inbred soybean cyst nematode biotypes facilitated by massively parallel 454 micro-bead sequencing. Mol Genet Genomics. 2008, 279: 535-543. 10.1007/s00438-008-0331-8.
Conner JA, Goel S, Gunawan G, Cordonnier-Pratt MM, Johnson VE, Liang C, Wang H, Pratt LH, Mullet JE, Debarry J, Yang L, Bennetzen JL, Klein PE, Ozias-Akins P: Sequence Analysis of Bacterial Artificial Chromosome Clones from the Apospory-Specific Genomic Region of Pennisetum and Cenchrus. Plant Physiol. 2008, 147 (3): 1396-1411. 10.1104/pp.108.119081.
Young BA, Sherwood RT, Bashaw EC: Cleared-pistil and thicksectioning techniques for detecting aposporous apomixis in grasses. Can J Bot. 1979, 57: 1668-1672. 10.1139/b79-204.
Nakazono M, Qiu F, Borsuk LA, Schnable PS: Laser-capture microdissection, a tool for the global analysis of gene expression in specific plant cell types: identification of genes expressed differentially in epidermal cells or vascular tissues of maize. Plant Cell. 2003, 15: 583-596. 10.1105/tpc.008102.
We thank Dr. Wayne Hanna for providing plant materials, Virgil Ed Johnson for bioinformatics assistance, and Evelyn P. Morgan for technical support. This work was funded by the National Science Foundation (award no. 0115911).
YZ and JC performed the sequence analysis. YZ collected and prepared cDNA samples for 454 sequencing. YZ and JC mapped transcripts and did expression analysis. JC performed the Blast2GO analysis. PO-A provided guidance for the study. All authors have read and approved the manuscript.
Yajuan Zeng, Joann Conner contributed equally to this work.
Electronic supplementary material
Additional file 1:PS26_MIRA.fasta. A fasta file containing the MIRA assembled contigs of the PS26 ovule transcriptome. (FASTA 9 MB)
Additional file 3:Table S1 - Primers designed for mapping transcripts to the ASGR-carrier chromosome. Microsoft word file: ASGR-Carrier Chromosome transcript primers.doc contains a table with primer sequences used for experiments to map ovule transcripts to the ASGR-carrier chromosome and the ASGR locus with annealing temperatures. (DOC 141 KB)
Authors’ original submitted files for images
About this article
Cite this article
Zeng, Y., Conner, J. & Ozias-Akins, P. Identification of ovule transcripts from the Apospory-Specific Genomic Region (ASGR)-carrier chromosome. BMC Genomics 12, 206 (2011). https://doi.org/10.1186/1471-2164-12-206
- Reverse Transcription Polymerase Chain Reaction
- Alien Chromosome
- Average Read Length
- Sexual Genotype
- Gametophytic Apomixis