Skip to main content

The molecular signatures of compatible and incompatible pollination in Arabidopsis



Fertilization in flowering plants depends on the early contact and acceptance of pollen grains by the receptive papilla cells of the stigma. Deciphering the specific transcriptomic response of both pollen and stigmatic cells during their interaction constitutes an important challenge to better our understanding of this cell recognition event.


Here we describe a transcriptomic analysis based on single nucleotide polymorphisms (SNPs) present in two Arabidopsis thaliana accessions, one used as female and the other as male. This strategy allowed us to distinguish 80% of transcripts according to their parental origins. We also developed a tool which predicts male/female specific expression for genes without SNP. We report an unanticipated transcriptional activity triggered in stigma upon incompatible pollination and show that following compatible interaction, components of the pattern-triggered immunity (PTI) pathway are induced on the female side.


Our work unveils the molecular signatures of compatible and incompatible pollinations both at the male and female side. We provide invaluable resource and tools to identify potential new molecular players involved in pollen-stigma interaction.


In flowering plants, the early interaction between the extremity of the female organ (stigma) and the male gametophyte (pollen grain) acts as a checkpoint for fertilization. This first step of the female-male interaction includes recognition by the female tissues of the male partner. In the Brassicaceae, sophisticated mechanisms have evolved that allow the epidermal cells of the stigma, also known as papillae, to reject self (or incompatible) pollen while accepting non-self (or compatible) pollen. These self/non-self recognition mechanisms underlie self-incompatibility and promote genetic variability in the species. Following compatible pollination, the dry pollen grain hydrates on the stigma papilla and ultimately germinates, producing a tube that penetrates the wall of the stigmatic cell and grows down to convey the male gametes towards the ovules for fertilization [1, 2]. By contrast, when a pollen grain is recognized as incompatible, it fails to hydrate properly and shows defective germination [3]. This rejection mechanism is initiated by a ligand-receptor interaction and is genetically controlled by a single polymorphic locus, called the S-locus [4]. The S-locus Cysteine Rich protein (SCR)/ S-locus protein 11 (SP11) located on the pollen surface interacts with its cognate S-locus Receptor Kinase (SRK) localized at the plasma membrane of the papilla cells [5, 6]. This interaction leads to the phosphorylation of SRK that triggers the downstream cascade leading to pollen rejection [1, 6, 7]. Cellular events triggered in the stigma papillae by these two pathways, compatible and incompatible, have started to be more clearly defined. Compatible pollination induces actin network orientation, calcium export and polarized secretion towards the pollen grain [8,9,10,11]. Incompatible pollen leads to inhibition of both actin polymerization and vesicular trafficking accompanied by a strong calcium influx within the stigmatic cell [9, 11, 12]. Stigmatic calcium fluxes were reported to involve the autoinhibited Ca2+-ATPase13 for pollen acceptance [8] and a glutamate receptor-like channel for pollen rejection [12]. In addition, the stigmatic EXO70A1 protein was identified as a factor required for polarized secretion during compatible pollination, which is negatively regulated in incompatible reaction [11, 13]. While these features are now well established, they constitute a narrow framework that limits the understanding of the whole recognition process. To obtain a global picture of the early fertilization events with no a priori, transcriptome approaches were conducted. The main goal was to draw up catalogues of genes whose expression is modulated during pollination so as to unravel the stigmatic response to compatible or incompatible pollen [8, 14,15,16,17]. However, the main drawback of these approaches was the impossibility to distinguish pollen and stigma derived transcripts. To address that issue, translatome analysis [18] has been applied to identify sex-specific genes expressed during pollination, but this strategy needed large amounts of tissues from a transgenic line expressing tagged ribosomes in pollen and fine techniques of biochemistry.

Here, inspired by a previous RNA-seq analysis [19], we developed a new experimental procedure, coupled with a bioinformatic analysis of sequencing data, to comprehensively unravel the dynamic events that occur both in the stigma and pollen grain following compatible and incompatible pollinations. We took advantage of the SNPs existing between two distinct Arabidopsis thaliana accessions, one used as female (Col-0) and the other as male (C24), to differentiate male and female transcripts based on a new statistical methodology, ASE-TIGAR [20]. This statistical tool can take all sequenced reads in account even those without SNPs, while previous study used only reads with SNPs [19]. Our analysis allowed the identification of 80% of mRNAs according to their parental origin and revealed transcriptional changes occurring specifically in either the stigma or pollen grain/tube. We report an unanticipated transcriptional activity triggered in stigma upon incompatible pollination. On the other hand, based on Gene Ontology enrichment study and pathway-based prediction of upregulated genes we show that following compatible pollination, components of the pattern-triggered immunity (PTI) pathway are induced on the female side. This work provides a key resource to identify potential new molecular players involved in pollen-stigma interaction.


Experimental setup to isolate transcripts from compatible and incompatible pollinations in A. thaliana

Early work showed that self-incompatibility can be restored in the self-fertile species A. thaliana by reintroducing a functional SRK-SCR gene pair isolated from its close self-incompatible relative A. lyrata [21, 22]. To analyze both compatible and incompatible reactions and take advantage of nucleotide polymorphisms between A. thaliana accessions, previously, we generated two transgenic lines: one in the Col-0 background expressing the SRK gene from the A. lyrata S14 haplotype (Col-0/SRK14) and the other in C24 background expressing the SCR gene from the same S-haplotype (C24/SCR14) [3]. Both lines were self-fertile but when stage 13-14E (according to [23]) Col-0/SRK14 stigmas were pollinated with C24/SCR14 pollen, a strong incompatible reaction was observed as deduced from the absence of pollen tube in the stigma, 1 hour after pollen deposition (Fig. 1a).

Fig. 1

Experimental and data analysis pipeline of SNP-based RNA-seq analysis. a Time course of sample collection. Flowers of Col-0/SRK14 were emasculated at stage 12, 16 h - 20 h before pollination with compatible (C24) or incompatible (C24/SCR14) pollen grains, respectively. Stigmas were harvested (dashed line) 0, 10, 60 min after compatible (C0, C10, C60) or incompatible (I10, I60) pollination for RNA extraction. Typical scanning electron microscope images observed 60 min after pollination, for compatible reaction (Col-0 /SRK14 x C24) with hydrated round pollen grains and pollen tubes, and incompatible reaction (Col-0 /SRK14 x C24/SCR14) with dehydrated ellipsoidal pollen grains and no pollen tube. Scale bar = 20 μm. b We performed whole genome sequencing and detected variants using GATK (McKenna et al., 2010). SNPs and short indels between Col-0/SRK14 and C24 were identified (1.). Then, we produced new reference genomes for Col-0/SRK14 and C24 introducing the identified SNPs into TAIR10 Col-0 genome (2.). After deriving predicted mRNA from the new genomes, we performed RNA-sequencing and got sex-specific isoform abundance by using a statistic tool ASE-TIGAR (Nariai et al., 2016) (3.). Read normalization and differentially expressed gene analysis were performed using DESeq2 (Love et al., 2014). Sequence quality was checked by FastQC ( DNA and RNA reads were cleaned with custom Perl scripts

Using these two Arabidopsis lines, we performed a compatible and incompatible pollination kinetics focusing on two time-points of the interaction to identify genes whose expression is modified following pollen-stigma interaction. We selected an early stage (10 min after pollen deposition), which corresponds to the start of pollen grain hydration [3], and a later stage (one hour after pollination) when pollen tubes reach the base of the stigma. We sequenced mRNAs extracted from pollinated stigmas at 10, and 60 min after compatible (Col-0/SRK14 x C24) pollination (C10, C60, respectively) and incompatible (Col-0/SRK14 x C24/SCR14) pollination (I10, I60, respectively) (Fig. 1a). We also harvested pollinated stigma right after compatible pollen deposition, as a control for both compatible and incompatible pollinations (C0).

SNP-based transcriptome analysis using variants between Col-0 and C24

To distinguish the parental origin of the transcripts, we developed a method based on the detection of small genomic variations, including SNPs and small insertions and deletions (indels) between Col-0 and C24 accessions. The pipeline includes three main steps (Fig. 1b). In the first step, variations between Col-0/SRK14 and C24 genomes were identified by whole-genome sequencing of the two strains with a read depth of 9.3 X for Col-0/SRK14 and 14.2 X for C24 (Additional file 1: Table S1). After cleaning, the reads were mapped on the public Arabidopsis genome sequence retrieved from the Arabidopsis information resource (TAIR) database (TAIR10, accession Col-0). Single-nucleotide polymorphisms (SNPs) and short insertions/deletions (indels) were identified between the mapped reads and the TAIR10 sequence. Reads from Col0/SRK14 and C24 covered 95.8 and 90.6% of the TAIR10 genome sequence, respectively. We identified 2032 variants (SNPs and indels) between Col-0/SRK14 reads and TAIR10 genome sequence, while we identified 732,767 variants between C24 reads and TAIR10 genome sequence. In a second step, we generated two new genome sequences, one for Col-0/SRK14 and one for C24, by introducing the SNPs identified in each line into the TAIR10 sequence. To simplify this step, only SNPs were used, and not indels (Fig. 1b). The two resulting genome sequences were compared by pairwise aligning their pseudomolecules, and we identified 616,781 SNPs between them. These two genome sequences were used as references for the subsequent steps of the project. We found that 27% of this polymorphism was in untranslated regions (UTRs) and coding sequences (CDSs) in Col-0/SRK14 genome and 31% in C24 genome and led to sequence variations in predicted mRNAs. We then pairwise aligned the pseudomolecules of each of the two new genomes to their TAIR10 counterparts, and used both the position and the sequence information of TAIR10 gene models to annotate Col-0/SRK14 and C24 genome sequences. We predicted 39,205 gene models in Col-0/SRK14 and 39,206 in C24. We extracted predicted mRNA sequences for each gene model from these annotated Col-0/SRK14 and C24 genome sequences to produce maternal and paternal reference transcripts. Combining both sets of predicted mRNAs, we obtained a list of 39,204 predicted gene models that were shared between maternal and paternal references. The number of predicted mRNAs with at least one SNP between Col-0/SRK14 and C24 was 31,271 among the total common predicted mRNAs (39204). This result allowed us to distinguish the origin of about 80% (31,271/39204 = 79.8%) of mRNAs with SNP-based analysis. The third step included mapping of the sequenced RNA reads and the estimation of sex-specific isoform abundance using ASE-TIGAR. The total length of raw reads from each condition was more than 6400 Mb (Additional file 1: Table S2). ASE-TIGAR uses a Bayesian approach to estimate allele-specific expression [20] and allowed us to utilize all sequenced reads even those without SNP to estimate gene expression. After obtaining the estimated read counts from ASE-TIGAR, we used DESeq2 [24] to normalize counts, for female and male transcripts, respectively (Fig. 1b).

From this large dataset, we first determined the contribution of each tissue (stigma vs. pollen) in mixed samples harvested immediately after compatible pollination (C0). We found that among the 47.0 million RNA reads, ASE-TIGAR assigned only 15% reads to genes without SNPs. These reads were distributed equally between Col-0/SRK14 and C24 genomes using the Bayesian statistic. Thereby, 69% of the total RNA reads were estimated to derive from Col-0/SRK14 (stigma), and 31% from C24 (pollen) (Fig. 2a). These proportions were stable over time (0, 10 min, 60 min) and independent of the pollination type (compatible, incompatible); this may reflect the relative abundance of stigmatic cells compared with pollen grains in our collected samples.

Fig. 2

Quality check of SNP-based RNA-seq analysis. a Estimated proportions of reads allocated to each tissue (31% pollen, 69% stigma) including reads without SNP (15%) that were equally distributed between pollen and stigma (shaded green, 7.5% + shaded purple, 7.5%). b Hexbin plot to visualize distribution of gene expression in stigma (nFPKM stigma) and pollen (nFPKM pollen) at C0. Gene count per hexagon is represented using a color gradient from light grey to orange. Genes without SNP are plotted outside the graph based on their stigma expression (nFPKM stigma)

Then, we analyzed the relative abundance of each transcript between stigma and pollen. To do so, transcript abundance was computed by ASE-TIGAR and was expressed as Fragments Per Kilo base of exon per Million reads mapped (FPKM), which account for sequencing depth and gene length. We normalized FPKM (nFPKM) by dividing the FPKM of each transcript by the ratio of the transcript counts from stigma [nFPKM (stigma)] or pollen [nFPKM (pollen)] at C0 (Fig. 2a, Additional file 2: Table S3). Values of nFPKM were displayed on a hexbin [25] plot to visualize the distribution of gene expression levels. Genes without SNP did not show any particular pattern (Fig. 2b, bottom line). Gene expression levels were widely distributed, but almost all highly expressed genes were very specific to female or to male, suggesting the existence of distinct transcript signatures between stigma and pollen (Fig. 2b).

Post-validation of the SNP-based analysis

To assess the global consistency of our datasets with already published studies, we compared the results of our SNP-based expression analysis with tissue-specific transcripts reported from microarray experiments [26,27,28]. Stigma-associated transcripts from our analysis showed the highest correlation with transcriptome from unpollinated stigmas, and only a weak correlation with transcriptomes from mature pollen or growing pollen tube (Fig. 3a). Conversely, our pollen-associated transcriptome showed a very high correlation with male transcriptomes and almost no correlation with female transcriptomes or transcriptomes from vegetative tissues (Fig. 3a). Correlations between another SNP-based analysis, which identified pistil- and pollen tube-specific transcripts 8 h after pollination [19] showed similar trends to our stigma and pollen transcripts (Fig. 3a, two rightmost columns).

Fig. 3

Validation of the SNP-based analysis. a Heat map of Pearson’s correlation coefficient between the stigma / pollen transcripts from the SNP-based analysis and transcript information from previously published data. b Number of stigma or pollen (preferentially / specifically) -expressed genes at C0 selected by nFPKM (Additional file 3: Table S4). c RT-PCR and sequence analysis of stigma or pollen specifically-expressed genes at C0. Genes are selected among the top 20 specifically-expressed genes; their rank according to their expression level are presented. RNAs were extracted from pollinated stigmas at C0 and we analyzed their SNP-information by RT-PCR and sequencing. SNP number corresponds to the number of SNPs present in the sequenced regions. d Top ten GO term enrichment categories (biological processes) of stigma or pollen preferentially-expressed genes at C0. Enrichment analysis was performed with the one thousand top expressed genes in stigma (left) or pollen (right), respectively. Selection criteria for genes analysed in c (sex-specific), and d (sex-preferential), are described in text

To have a global view of expressed genes in stigma and pollen at C0, we constructed three classes of genes from calculated nFPKM: expressed genes, sex-preferentially and sex-specifically expressed genes (see Methods for precise criteria of gene selection) (Additional file 3: Table S4). Briefly, we defined genes that showed nFPKM > 1 as expressed genes. A total of 21,335 genes were expressed in at least one condition; 16,964 and 6857 genes were expressed in stigma and pollen respectively (Fig. 3b). Genes that were expressed at least a ten-fold higher in stigma than in pollen were defined as stigma-preferentially expressed genes, and those that were at least one hundred-fold higher expressed in stigma than in pollen as stigma-specifically expressed genes (and vice versa for pollen preferentially and specifically expressed genes) (Fig. 3b, Additional file 3: Table S4). We first focused on sex-specifically expressed genes and analyzed the top 20 from stigma and pollen gene lists (Additional file 3: Table S4) using the ThaleMine database ( Heatmaps of gene expression levels based on Cheng et al., 2017 [29] were generated (Additional file 1: Fig. S1). Stigma genes were clearly excluded from pollen even though many of them were also expressed in various tissues. By contrast, most pollen genes showed expression restricted to pollen and stage 12-inflorescences, which contain mature pollen. This result is consistent with the expression pattern that we observed on the hexbin plot (Fig. 2b). Moreover, within the top 20 stigma genes, we found seven of the 11 top genes ranked by expression levels in stigma RNA-Seq and microarray datasets [30], and as the top 1 pollen gene, we found CPK34 (AT5G19360), the protein product of which is involved in pollen tube regulation [31]. Finally, to confirm the specificity of expression of these genes in stigma or pollen, we carried out RT-PCR and sequenced the cDNAs of SNP-containing regions using C0 sample as template. From the top 20 specifically expressed genes, we selected genes whose location of SNPs permitted designing of primers. As expected, based on the SNPs identified in their sequence, we found that mRNAs from stigma-specific genes came only from Col-0/SRK14 (stigma), whereas mRNAs from the pollen-specific genes emanated only from C24 (pollen) (Fig. 3c).

To further characterize the pollen vs. stigma associated transcripts, we looked for Gene Ontology (GO) term enrichment at C0 within the list of sex-preferentially expressed genes (False Discovery Rate, FDR < 0.05) [32, 33]. Although GO terms may be somewhat subjective or not fully consolidated by functional tests, with the current annotation, we observed a clear difference between the two sets of transcripts (Fig. 3d). From GO term enrichment of the top 1000 preferentially expressed genes either in stigma or pollen (Additional file 3: Table S4, Additional file 4: Table S5), our analysis revealed high enrichment of several GO terms associated with metabolism in stigmas (such as “photosynthesis”, “mitochondrial-”, and “-metabolic process”) suggesting an active metabolic state of stigmatic cells (Fig. 3d left). Conversely, GO terms on the pollen side were specific to pollen functions such as “pollen tube growth”, “pollen sperm cell differentiation” and “cell tip growth” (Fig. 3d right).

The transcriptome of unpollinated stigmas was previously reported through laser microdissection of Arabidopsis stigmatic cells [32]. Among the top 100 expressed genes in this analysis, 44 were common with the top 100 expressed genes in stigma at C0, whereas no common genes were found with the top 100 expressed genes in pollen at C0.

Altogether, these results validate our SNP-based workflow, which allows identification of female- and male-derived transcripts from a combination of tissues following pollination.

Gene expression dynamics triggered after pollination

To examine the transcriptomic response of pollen and stigma following compatible or incompatible pollination, we first performed a principal component analysis (PCA) [34, 35] using the gene expression levels of each biological replicate in all conditions (Fig. 4a).

Fig. 4

Gene expression dynamics after pollination. a PCA of stigma (left) and pollen (right) transcripts. Biological replicates (dots or triangles) in all pollination conditions were analyzed. b Venn diagrams showing the number of upregulated (FC > 2) genes in stigma (left) and pollen (right), after compatible (green and violet) or incompatible (grey) pollinations. c Gene enrichment categories of up-regulated genes after incompatible pollination in stigma according to GO terms of biological processes. Genes exclusively induced after incompatible reaction (b, left; 4 + 104) were analyzed. Only enrichment categories with significant FDR (< 0.05) are shown. d Gene enrichment categories of up-regulated genes after compatible pollination in stigma according to GO terms of biological processes. Genes exclusively induced after compatible reaction (b, left; 158 + 163 + 623) were analyzed. Only top 20 enrichment categories are shown

Expression levels in stigma and pollen were treated separately (Fig. 4a left and right, respectively). The total explained variance of the first two principal components (PC1 and PC2) is around 42% (28% + 14% respectively) for stigma and 35% (24% + 11% respectively) for pollen. The following two principal components (PC3 and PC4) are around 19% (10% + 9% respectively) for stigma and 15% (8% + 7% respectively) for pollen. Comparing the PCA of stigma and of pollen transcripts suggests different dynamics in each tissue. On the stigma side, the PCA shows that both compatible and incompatible pollinations triggered a transcriptional response, as along the PC1 axis, capturing almost 30% of the explained variance, samples are temporally sorted from 0 min to 60 min, thus suggesting that PC1 can be interpreted as a time axis. For the second axis, explaining 14% of the observed variability, compatible samples are located in the lower part whereas incompatible samples are located in the upper part, close to those of the starting point (C0). This seems to indicate that this axis is oriented by the compatible/incompatible effect. Meanwhile, on the pollen side, all the samples except C60 cluster together. Again, PC1 seems to capture the temporality of the compatible response, however we cannot conclude that PC2 is related to the compatible response effect since early compatible and incompatible clusters overlap on this axis. The clear response pattern of pollen in C60 samples is consistent with the massive changes displayed by compatible pollen after 1 h, which hydrated and germinated a pollen tube growing in the stigmatic tissue.

To get a quantitative view of the number of genes whose expression was modified during the course of pollination, we performed a differentially expressed gene (DEG) analysis (FC, Fold Change more than 2 between one condition and the zero-time point, padj < 0.1; Additional file 2: Table S3). We found more genes that were upregulated in at least one condition than downregulated (Additional file 1: Table S6). Then, we used a Venn diagram representation only with upregulated genes (FC > 2, padj < 0.1) to access the dynamics of gene expression upon pollination (Fig. 4b, Additional file 5: Table S7). The Venn diagram of upregulated genes in stigma (Fig. 4b left) showed an induction of gene expression in incompatible reactions, with 45 genes at I10 (corresponding to 0.2% (45/21335) of the total expressed genes) and 344 genes at I60 (1.6%). By comparison, a massive and progressive change of gene expression, following compatible pollination was detected, with 414 upregulated genes at C10 (2%) and 1038 (4.9%) at C60 (Fig. 4b left). On the pollen side (Fig. 4b right), very few genes showed altered expression, except in condition C60, where 551 genes were upregulated (2.6%). This correlates well with the dynamics of gene expression detected by PCA (Fig. 4a). Altogether, our data show a molecular reprogramming in stigma in response to both incompatible and compatible pollination, although more moderate in incompatible stigma. Meanwhile, compatible pollen grains reactivate their transcriptomic activity following germination and pollen tube growth in papilla cells, whereas the incompatible pollen remains almost quiescent.

Pattern-triggered immunity (PTI) pathway induced after compatible pollination in stigma

To investigate the molecular events occurring during pollination, we applied a GO term enrichment analysis to genes that were exclusively upregulated in stigma after incompatible (104 + 4 = 108) and compatible (158 + 163 + 623 = 944) pollination (Fig. 4b left). In the case of the incompatible reaction, we found only 9 GO terms with high confidence (FDR < 0.05) (Additional file 6: Table S8, Fig. 4c). Nevertheless, it is worth noting that the two upmost enriched GO categories are related to water or fluid movements (water transport (GO:0006833), fluid transport (GO:0042044)). As Pollen hydration is highly controlled at the surface of Brassicaceae stigma [3], we may assume that these GO term categories reflect the regulation of water fluxes from the stigmatic cells towards the incompatible pollen.

At the opposite, after compatible pollination, there were 126 terms with high confidence (FDR < 0.05) among them, 85 had a very high confidence (FDR < 0.005) (Additional file 6: Table S8). These GO terms and enrichment were dramatically different from those preferentially expressed at C0 (Fig. 3d left, Fig. 4d). Among the top 20 upmost enriched categories, we found many GO terms related to defense response (defense response to insect (GO:0002213), response to chitin (GO:0010200), response to molecule of bacterial origin (GO:0002237), plant-type hypersensitive response (GO:0009626), host programmed cell death induced by symbiont (GO:0034050). Besides, we also found several GO categories related to wounding (regulation of jasmonic acid mediated signaling pathway (GO:2000022), response to wounding (GO:0009611), ethylene-activated signaling pathway (GO:0009873), response to jasmonic acid (GO:0009753)). The abundance of these GO term categories is likely to reflect the response of stigmatic cells to damages/stresses caused by the pollen tube while growing in the female tissues.

The rapid and global transcriptomic changes in stigma after compatible pollination motivated us to search for molecular pathways activated. To predict such pathways, we mapped the 944 upregulated genes to KEGG pathways [35]. We found many metabolism-related pathways, signaling pathways such as hormone-signaling, plant-pathogen interaction and MAPK pathways. Interestingly, the plant-pathogen interaction pathway known as PTI induced by bacterial flg22 and EF-Tu [36] was predicted with 5 sequentially upregulated genes (SERK4, MEKK1, MKK4, MKK6, WRKY22, with FC between 2.2 and 2.9) in compatible situation but not in incompatible situation (Fig. 5, Additional file 7: Table S9), and among them, two genes (MKK6 and WRKY22) were rapidly induced from 10 min. One gene of this pathway, the receptor-like kinase EF-Tu Receptor (EFR) involved in bacterial pathogen associated molecular patterns (PAMP) was upregulated at I60 (FC = 2.2, padj = 1E-06). We cannot rule out that this might be biologically relevant in incompatible response. It is worth noting that SERK4, BAK1, EFR, FLS2, CERK1, all genes known to encode receptor-like kinases involved in plant-pathogen responses, were expressed preferentially in stigma at C0 (Additional file 3: Table S4), and among them, SERK4 was upregulated at C60 (FC = 2.3) (Fig. 5, Additional file 7: Table S9).

Fig. 5

PTI pathway predicted to be induced in stigma after KEGG pathway mapping analysis. PTI pathway modified according to the information from Bigeard et al. 2015, Bi et al. 2018, and Lian et al. 2018. Genes in green boxes were upregulated (FC > 2) after compatible pollination, at 10 min and 60 min (light green), or only at 60 min (dark green: genes with SNPs, hatched dark green: genes without SNP but predicted stigma specific with ECM, see Supplementary file 8 Table S10). Gene in grey box was upregulated (FC > 2) 60 min after incompatible pollination. White boxes mean genes without SNPs (with no ECM prediction), not induced, or induced after both (compatible/incompatible) pollination (see Supplementary file 7 Table S9). Genes in bold were expressed at C0; nFPKM (stigma) > 1. RLCKs = Receptor-Like Cytoplasmic Kinases

To predict tissue specificity of genes without SNP, we developed a method based on gene expression correlation using the ATTED-II database ( [16, 17, 37] (see methods). Using this Expression Correlation Method (ECM), additional genes were predicted to be stigma specific (Additional file 8: Table S10). Interestingly, MPK3 and WRKY33 two hallmarks of the PTI pathway, were predicted to be stigma specific and were highly induced following compatible (FC = 6.3 and 10.0, respectively) compared with incompatible (FC = 1.5 and 2.8, respectively) reaction (Additional file 7: Table S9). Altogether, these analyses indicate that the PTI pathway is rapidly induced in stigma only after compatible pollination.

Selectively upregulated genes after pollination in stigma and in pollen

To identify new molecular players involved in pollen-stigma interaction, we strengthened our criteria to reduce potential candidates by retaining only genes exclusively upregulated after pollination (108 genes in incompatible situation, 944 genes in compatible situation; Venn diagram Fig. 4b, Additional file 5 Table S7) and sorted according to their fold change ratio between both pollination conditions (see methods for precise criteria). With these criteria, almost no gene were selectively induced 10 min after pollination and led us to focus on the 60-min pollination time (Fig. 6, Additional file 9: Table S11). In response to compatible pollen, we found five genes related to pathogen defense processes in the top 10 stigma genes (see discussion) which is in accordance with our KEGG pathway analysis. On the pollen side, several induced genes were already reported as pollen grain and/or pollen tube expressed genes (see discussion), underlining the reliability of our approach in discriminating pollen versus stigma transcripts. In the incompatible pollination, the uppermost upregulated genes in stigma and pollen were clearly distinct from those induced following pollen acceptance (see discussion), confirming the specific transcriptional changes linked to compatibility and incompatibility responses.

Fig. 6

Selectively induced genes after pollination. From gene lists generated using a Venn diagram representation (Fig. 4b) we identified genes selectively induced 60 min after pollination in stigma (left) and in pollen (right) with criteria applied to the FC ratio between compatible and incompatible situations. The uppermost selectively induced genes sorted by FC ratio from the largest to the smallest (up to 10) are shown. * genes described in the discussion section


Novel pipeline to analyse sex-specific transcriptome

To decipher the mechanisms that control reproduction in flowering plants, we extracted the global transcriptomic signatures associated with both pollen and stigmatic cells following their interaction during compatible or incompatible pollination. Although transcriptomes of pollinated stigmas have been recently published [16, 17], these analyses did not permit the distinction between pollen and stigma transcripts. In the present work, we provide a comprehensive transcriptomic dataset for stigma-pollen interactions using a novel SNP-based analysis. We took advantage of a recently developed statistical tool called ASE-TIGAR, which is based on a Bayesian approach to estimate allele-specific expression in diploid cells [20]. Nariai et al. showed an accurate estimation of gene expression from RNA-seq with ASE-TIGAR and identified some autosomal genes as allele-specific genes in a human reference lymphoblastoid cell line. We applied this new tool to discriminate transcripts from female-male mixed tissues during the first steps of the fertilization process. Our experimental design and bioinformatic pipeline allowed us to unveil the transcriptomic response of the pollen/pollen tube and that of the stigma following compatible or incompatible pollination at two early (10 min and 60 min) time points of the male-female interaction (Fig. 1). It is worth noting that another SNP-based pipeline has recently been published [19], identifying transcripts derived from the pistil and pollen tubes collected 8 h after compatible pollination. Although we cannot compare the two analyses directly, as pollination time points were different and only compatibility was studied by Leydon et al., both analyses succeeded in distinguishing male and female transcripts as presented in Fig. 3. However, while Leydon et al. used 12% of reads that had SNPs among the total sequenced reads, in our analysis we took all the RNA-seq reads into account, including those without SNP [19, 20]. This difference in the approaches has likely contributed to making our analysis more exhaustive, as exemplified by the remarkable segregation we found between male transcripts and those derived from female and vegetative tissues (Fig. 3a). In addition, the new expression correlation method (ECM) we developed here, allowed us to predict sex-specificity for some genes without SNP. The sex preferentially- or specific-expressed gene lists we drew up contain many of the already identified female or male specific genes and reported GO term enrichment [8, 11, 13, 30, 32] (Fig. 3d). Together with the correlation analysis (Fig. 3a) and experimental evaluations (Fig. 3c), we can conclude that our female- / male-transcripts globally represent stigma−/pollen-transcripts.

Global molecular dynamics after compatible and incompatible pollination

While the Venn diagram (Fig. 4b) shown that transcriptional activity occurs after pollination, the percentage of upregulated genes remains however low, varying from 0.2 to 4.9%. This is in accordance with another transcriptomic analysis in Arabidopsis [16], which reported only a few percent of difference between transcripts before and 60 min after pollination for both compatible and incompatible interactions.

Even though our data are consistent with other transcriptomic analyses of pollinated stigmas [8, 16, 32], the clear distinction we made between male and female transcripts also reveals unanticipated characteristics (Figs. 3 and 4). The Venn diagram suggested a rapid transcriptomic response in the stigma for compatible (with 231 genes exclusively induced at C10) but not for incompatible pollination (with only 4 exclusively upregulated genes at I10) (Fig. 4b left). This suggests that key molecules required for the incompatibility reaction are likely to be already present in the stigma. This may be consistent with the observation that the pollen rejection response in A. thaliana lines exhibiting a restored incompatibility system is extremely rapid and occurs within minutes [3, 12].

While no cellular changes were detectable by SEM on stigmas 1 hour after incompatible pollination (Fig. 1a), we identified more than one hundred genes exclusively upregulated in stigmas at I60 (104 genes, Fig. 4b left). This demonstrates that the stigma undergoes molecular changes following incompatible pollination. As the incompatibility reaction is maintained for several days, at least as long as SRK is properly expressed in the stigma, we may propose that among the stigma upregulated genes there are key factors required for proper development of the SI response and its maintenance over time.

Plant responses to pollen and pathogens share conserved molecular mechanisms

The similarity between the stigma-pollen interaction and plant-pathogen interaction has been discussed and has drawn researchers’ attention for a long time [10, 38,39,40]. Several transcriptomic analyses of pollinated stigmas/pistils reported enrichment in transcripts linked to stress or defense response in female tissues [19, 27, 40,41,42] and it was suggested that the self-incompatible response shared common mechanism with plant-pathogen interaction as it is a response to block the invader. Based on a time course transcriptome analysis of compatible and incompatible reactions in Brassica napus, Zhang and collaborators speculated that both compatible and incompatible reactions had close parallels with plant-pathogen interactions. Although we found some upregulated defense related genes in the incompatible reaction (see below), KEGG analysis indicates that components of PTI pathway were induced only during compatible response. (Fig. 5, Additional file 7: Table S9). This result was not completely unexpected since pollen tube growth resembles pathogen attack from several angles, for example invasive growth of an external organism, cell wall digestion, and calcium burst [8, 43, 44]. However, it is unclear why PTI pathway, whose function is to block the pathogen development, is induced in the stigma while pollen tube growth is promoted. We may suppose that components of this PTI pathway participate in the acceptance of pollen grains. Another alternative would be that penetration of the pollen tube in the papilla cell wall provide a route of entry for bacterial and fungal pathogens and hence, the PTI response would prevent infection by pathogens during compatible pollination.

Upregulated genes induced after SC response in pollen and stigma

Among the top 10 genes selectively upregulated 1 hour after compatible pollination in stigma (Fig. 6, Additional file 9: Table S11), we found the two closely related genes, AT1G56240 and AT1G56250 encode proteins containing a lectin domain, known to bind specific carbohydrate structures [45] and a F-box domain. F-box proteins have been reported to be components of the SCF (SKP1-CUL1-F-box protein) ubiquitin ligase complex, where they mediate polyubiquitination of target proteins for their subsequent proteasome-dependent degradation [46]. The three others upregulated genes related to plant defense, encode a product with a Toll/interleukin-1 receptor (TIR) domain (AT2G20142, AT3G04220, AT5G41740, Fig. 6, Additional file 9: Table S11). The Arabidopsis genome contains 135 genes encoding proteins with a TIR domain that could be associated with others motifs, including nucleotide-binding site (NBS) and leucine-rich repeat (LRR), [47]. Association of TIR-NBS-LRR domains is found in several disease resistance (R) protein involved in pathogen recognition [48] and, one R-genes is upregulated in response to pollen recognition (AT5G41740).

The highest upregulated gene in pollen, selectively induced 60 min after compatible pollination (Fig. 6, Additional file 9: Table S11), with a very high fold change induction compared with incompatible pollination (> 100 fold) is AT5G35380. The encoded protein belongs to the Receptor-Like Cytoplasmic protein Kinase family (RLCK), whose members contain a domain with predicted kinase activity but lack an extracellular domain. Several RLCKs have been described as required for successful sexual reproduction and function downstream of kinase receptors [49]. Upstream of the kinase domain, the RLCK AT5G35380 contains a domain with similarity to the bacterial Universal Stress Protein A (UPSA). In bacteria, UPSA is implicated in coordinating the metabolic transition observed when bacterial cells are growing on minimal medium with glucose as the sole carbon source [50,51,52]. UPSA domains have been found in 44 A. thaliana proteins but their functions in plant cells remain enigmatic [53]. We may postulate that increased levels of AT5G35380 during pollen germination and early pollen tube growth could be involved in the metabolic transition from a quiescent desiccated pollen grain towards an active germinating grain and growing pollen tube in coordination with an as-yet uncharacterized kinase receptor.

Among the others pollen genes, we found the transcription factor AtMYB120 (AT5G55020) that, together with two closely related MYBs (AtMYB97 and AtMYB101), is required for pollen tube growth arrest and sperm cell release [54]. These three MYB factors regulate the expression of a set of pollen tube genes [19, 54] among them, one plant-specific antimicrobial peptides from the thionin family (AT5G26673) and one Subtilisin-like protease (AT1G66220), which are both present in our top 10 list. We also found increased expression of the K+-transporter /proton exchanger gene CHX21 (AT2G31910), the corresponding protein localizes to the endoplasmic reticulum of pollen tubes and is involved in the reorientation of pollen tube growth towards the ovules (Fig. 6, Additional file 9: Table S11) [55]. Altogether, our data suggest that genes described as involved in late stage of pollen tube guidance within the pistil tissues, may also have roles in earlier stages of male-female interaction and could function in guiding the pollen tube throughout its journey in the stigma/style.

Upregulated genes in pollen and stigma following SI response

Astonishingly, the two highest upregulated genes in the pollen and stigma after 1 hour of incompatible pollination belong to the same family of genes and encode F-box/Kelch repeat proteins (Fig. 6, Additional file 9: Table S11). F-box/Kelch repeat proteins are involved in various processes such as response to biotic [56,57,58] and abiotic stress [59, 60] hormone signaling [61], metabolic pathways [62,63,64,65], cell expansion [66], and circadian clock regulation [67,68,69]. The F-box/Kelch repeat protein gene KFB50/KMD4 (AT3G59940), has three homologs designated KMD1 (AT1G80440), KMD2 (AT1G15670) and KMD3 (AT2G44130) [61]. All four encoded proteins have redundant functions in negatively regulating cytokinin response by degrading type-B Arabidopsis response regulators (ARRs) [61]. Interestingly, analysis of triple mutants for the three cytokinin receptors (CRE1, AHK2 and AHK3) revealed that cytokinin perception is essential for pollen function as cre1 ahk2 ahk3 pollen, while capable of germinating in vitro, cannot germinate on wild-type stigmas [70]. Based on these data, we may suggest that the increased expression of KFB50/KMD4 in incompatible pollen grains impairs cytokinin response and hence inhibits germination. How the signaling cascade initiated in the papilla cell following SCR-SRK interaction may induce changes in KFB50/KMD4 expression in the pollen remains yet elusive. In the stigma, the F-box/Kelch repeat protein gene AT4G39580 is strongly induced (FC ~ 7) following SI reaction (Additional file 9: Table S11). In Brassica sp. and A. lyrata, SI response in the stigma has been associated with the activation of a U-box/ARM-repeat E3 ligase (ARC1) that is believed to degrade compatibility factors required for pollen germination and tube growth [71]. Although A. thaliana does not have a functional ARC1 gene [72], co-expression of B. napus ARC1 or A. lyrata ARC1 along with A. lyrata SCR-SRK gene pair confers a strong SI response in transgenic A. thaliana [13]. Our work shows that SCR14-SRK14 interaction can induce a strong SI response in A. thaliana without the need of co-expressing ARC1 [3]. We might suggest that the F-box/Kelch repeat protein AT4G39580 could substitute for ARC1 in the proteasome degradation pathway leading to self-pollen rejection.

The two other uppermost upregulated genes in stigmas are the cysteine-rich receptor-like protein kinase genes CRK41 (AT4G00970) and CRK31 (AT4G11470) (Fig. 6, Additional file 9: Table S11). In Arabidopsis, CRKs form a large family of 44 members that are involved in a broad variety of biological processes including plant growth and development as well as plant response to biotic and abiotic stresses [73]. Following activation of the receptor-like protein kinase FLAGELLIN-SENSING2 (FLS2) by its ligand flagellin (flg22), CRK28 was recently shown to physically interact with another receptor-like kinase (RLK), the BRASSINOSTEROID INSENSITIVE 1-associated receptor kinase1 (BAK1) and to associate with the FLS2-BAK1 immune complex [74]. In addition, CRK28 was also found to bind to the closely related CRK29. As abundance of these two CRKs increased after flg22 perception and that CRK28 overexpression enhanced disease resistance to Pseudomonas syringae, it is tempting to speculate that CRK41 and CRK31 might similarly interact with the activated SRK following SCR binding to regulate the SI signaling cascade. Among the other upregulated genes in the stigma, PIRL2 (AT3G26500) protein is involved in gametophyte development [75], FLOT1 (AT5G25250) in a clathrin-independent endocytosis pathway [76], the degradation of which is enhanced following flg22 elicitation [77], ROH1 (AT1G63930) in secretion of seed coat pectin and exocyst function [78], QQS (AT3G30720) in plant defense [79], and MLO12 (AT2G39200) in defense response to fungi as a susceptibility factor to powdery mildew pathogens [80] (Fig. 6, Additional file 9: Table S11). The functions of these SI upregulated genes are mostly related to cell signaling pathways and plant response to stress, and for some of them to cellular mechanisms that have already been described to operate in SI such as RLK signaling, proteasome-dependent degradation of proteins, exocyst-dependent secretion and endocytosis [81]. Internalization of SRK was reported in Brassica oleracea following self-pollination and this was associated with SRK degradation, suggesting that initiation of the signaling cascade occurs at the plasma membrane and that endocytosis of the activated receptor is involved in the downregulation of SI pathway [82]. Recently, clathrin-mediated endocytosis was shown to be unnecessary for SRK signaling [83]. Linked with this latter work, it is of particular interest to note an increased level of FLOT1 expression in our transcriptomic analysis. Indeed, FLOT1 protein is required for Sterol-Dependent Endocytosis (SDE) of membrane-associated proteins, which is another mode of endocytosis that works independently of clathrin [84]; besides, FLOT1 is degraded following internalization [77]. All together, we might hypothesize that endocytosis of the activated SCR-SRK complex would depend on SDE, and that neo-synthesis of FLOT1 would be necessary to fine-tune the SI response.


During reproduction, the mixture of parental transcripts makes challenging the characterization of male- and female-specific patterns of gene expression. Our work, based on SNP-based RNA-seq analysis coupled with the statistical methodology ASE-TIGAR, allows us to distinguish the parental origin of 80% of transcripts and provides an accurate view of dynamic transcriptional changes occurring both in pollen and stigma following compatible and incompatible pollinations. We report hundreds of genes whose expression is upregulated during the compatibility response. Remarkably, KEGG prediction identifies components of PTI pathways that are activated in the stigma upon compatible pollination. We speculate that this defense response may contribute to the signaling cascade leading to pollen acceptance or, alternatively, may protect the pistil, which is invaded by hundreds of pollen tubes, from concomitant pathogen attacks. We also report the upregulation of a hundred genes in the stigma following self-pollen rejection, that are likely to participate in the regulation or maintenance of SI over time. These genes may have retained their functionality in A. thaliana, which is in agreement with the estimation of recent transition to selfing in this species [85, 86]. Overall, our work provides substantial resources and innovative tools to identify novel molecular players in the fertilization process.


Plant material and growth condition

Arabidopsis thaliana C24 (N22680) and Col-0 (N22681) seeds were obtained from NASC [87]. We generated transgenic plants like previously reported [3]. AlSRK14 construct was introduced in Col-0 (Col-0/SRK14). AlSCR14 construct was introduced in C24 (C24/SCR14). All plants were grown in growth chambers under long-day cycles (16 h light/8 h dark at 21 °C/19 °C). Transgenic seeds are available upon request.

Environmental scanning electron microscopy (SEM)

Flowers from stages 12 were emasculated and pollinated on plants with mature pollen. One hour after pollination, pistils were cut in the middle of the ovary, deposited on a SEM platform and observed under Hirox SEM SH-3000 at − 20 °C, with an accelerating voltage of 10 kV. Images were processed with ImageJ software 1.53b version.

Genomic DNA and mRNA preparation

For genomic DNA extraction, about 2 mL volume young inflorescences of Col-0/SRK14 and C24 were harvested, respectively. After grinding the material in liquid nitrogen, DNA was extracted as described previously [88]. For RNA extraction, late stage 12 [23] Col-0/SRK14 flowers were emasculated and 16-20 h after emasculation stigmas were pollinated with compatible (C24) or incompatible (C24/SCR14) pollen. 0, 10 or 60 min after compatible pollination (C0, C10 and C60) or incompatible pollination (I10, I60), 50 stigmas were harvested manually using fine tweezers, then frozen in liquid nitrogen and stored at − 80 °C until further processing. After grinding the material in liquid nitrogen, RNA was extracted and purified by using Arcturus® PicoPure® RNA Isolation Kit (Applied Biosystems / Thermo Fisher Scientific) following the manufacturer instruction except that we added a DNAse treatment (Qiagen, catalog#79254). Five biological replicates of RNA at each point were prepared, then four replicates were selected for sequencing based on their quality and quantity.

Whole genome sequencing and variant calling

Library preparation and whole genome sequencing of Col-0/SRK14 and C24 were performed by HELIXIO (Clermont-Ferrand, France; with TruSeq® DNA PCR-free Library Preparation kit (Illumina) and NextSeq500 platform (Illumina) applying paired-end sequencing (2 × 150 bp) (Additional file 1: Table S1). We then proceeded to the analysis of the sequencing data using GATK3.5 after quality check using FastQC (, trimming and pairing of the resulting reads using custom Perl scripts. We aligned the reads to the Col-0 public genome (TAIR10 public genome refers to the TAIR9 genome sequence and the TAIR10 annotation for Arabidopsis Col-0, which are both available from For simplicity, we refer to them as simply TAIR10 because no genome sequence changes were made between the TAIR9 and TAIR10 annotation releases) with BWA [89] applied GATK [90] base quality score recalibration, indel realignment, duplicate removal, and performed SNPs and indels discovery across all samples according to GATK Best Practices recommendations [91, 92]. After checking the depth of coverage of two samples by using GATK Depth Of Coverage (Additional file 1: Fig. S2), we performed filtering applying depth 3 for Col-0/SRK14 and depth 6 for C24 and homozygous for both, then obtained the complete set of variants for each genome as VCF files.

Production of new reference genomes

Col-0/SRK14 and C24 genome sequences were derived from TAIR10 genome, by introducing the called variants (only SNPs and not indels) in the sequence, by using GATK Fasta Alternate Reference Maker. Gene annotations from TAIR10 genome were transferred onto the two new genome sequences using RATT [93] in “Strain” mode (a gene is annotated if it exhibits at least 95% similarity with the gene model) and seqret from EMBOSS suite [94]. To characterize polymorphism (only SNPs) between obtained Col-0/SRK14 and C24 genome sequences, inside and outside predicted genes, we aligned chromosome sequences using LAST (v. 938) [95], after training LAST on chromosomes 1.

RNA sequencing and expression analysis

Library preparation and RNA sequencing were performed by HELIXIO with TruSeq® Standard mRNA sample Preparation kit (Illumina) and NextSeq500 platform (Illumina) applying paired-end sequencing (2 × 75 bp). We then proceeded to the analysis of the sequencing data using ASE-TIGAR after quality check by FastQC, trimming and pairing of the resulting reads using custom Perl scripts. First, we derived mRNA reference sequences from new reference genomes and merged them in one FASTA file, then mapped the clean reads on the mRNA reference with Bowtie2 [96]. We then run ASE-TIGAR with SAM files produced by mapping and the mRNA reference ( [20].

Raw sequencing data is available on NCBI database under the SRA accession number SRP154565 (

Differential gene expression analysis and calculation of nFPKM

Differential expression analysis on the whole transcriptome was performed using DESeq2 [24]. After performing principal component analysis (PCA) for all biological replicates at each condition, we removed one replicate at I10 from female samples and one at I60 from male samples, as they were isolated from the other replicates. All the analyses, including the presented PCA, were performed with the sample set after the removal.

Normalized FPKM (nFPKM) was calculated by dividing stigma- or pollen-FPKM by female or male transcript proportion at each condition (C0, C10, C60, I10, I60).

Comparison with published datasets

To check the consistency of our RNA-seq data with previously published microarray data [26,27,28], for each pair of condition, we computed a Pearson correlation coefficient (r) over all genes with SNPs from our variant analysis, using mean of estimated read counts from three or four biological replicates for RNA-seq and mean absolute intensity for multiple microarray replicates. We also computed Pearson correlation coefficients (r) between previously published RNA-seq data [19] and the microarray data sets. Then, the heat map was drawn using an original R-script (R version 3.4.2) and libraries Hmisc, reshape2 and ggplot2.

Criteria to select expressed genes, sex–preferentially or specifically expressed genes

Among genes with SNPs between Col-0 and C24/SRK14, we defined expressed genes, sex-preferentially or sex-specifically expressed genes at C0 (Fig. 3b, Additional file 3: Table S4), with criteria applied to nFPKM. We defined stigma-expressed genes as [nFPKM (stigma) > 1] and stigma-preferentially expressed genes as [nFPKM (stigma) > 1 ∩ nFPKM (stigma) ≥ 10 × nFPKM (pollen)]. Stigma-specifically expressed genes were selected among stigma-preferentially expressed genes applying more stringent criteria. Stigma-specifically expressed genes are defined as [stigma preferentially expressed genes ∩ nFPKM (pollen) ≤ 1 ∩ nFPKM (stigma) > 100 ∩ nFPKM (stigma) > 100 x nFPKM (pollen)], then sorted by nFPKM (stigma) from the largest to the smallest. Vice-versa for pollen (preferentially / specifically) expressed genes. The same criteria were applied for other pollination time-points (Additional file 8: Table S10).

GO term and enrichment analysis and pathway analysis

GO and gene enrichment analyses were performed using Gene Ontology Consortium website ( and PANTHER 13.1 software [97, 98]. We used “GO biological process complete” for the classification. GO terms with FDR < 0.05 were considered as significantly enriched.

Pathway analysis was performed using and KEGG PATHWAY Database using KEGG Mapper software ( [35].

RT-PCR and sequencing

For experimental evaluation of the SNP-based analysis with reverse transcription polymerase chain reaction (RT-PCR) and sequencing, we selected genes among the top-20 specifically expressed genes that have SNPs at suitable positions allowing discrimination of parental origin (stigma vs. pollen) (Fig. 3c). RNA at C0 was purified as described above. cDNA was generated using SuperScript® VILO™ cDNA Synthesis Kit (Invitrogen). Primers were designed by Primer3web ( [99] at the flanking region of SNP including sites. PCR was performed with GoTaq (Promega). PCR products were purified using PCR clean-up Gel extraction kit (Macherey-Nagel) and then sequenced.

Criteria to select induced genes in stigma and pollen after compatible/incompatible responses, based on FC

Among genes with SNPs between Col-0 and C24/SRK14, we selected genes which are induced after compatible and incompatible pollination in stigma or in pollen respectively with criteria applied to FC. We selected induced genes 10 min after compatible reaction in stigma as [FC_Col-0/SRK14_C10/C0 > 2] with padj < 0.1. We proceeded in the same way for 60 min time-point, for incompatible situation and for pollen (Additional file 5: Table S7).

Genes selectively upregulated in stigma after compatible pollination were selected among those exclusively upregulated after compatible pollination (Venn diagram Fig. 4b), applying criteria on FC Col-0/SRK14_C60/I60 > 2, with padj < 0.1, then sorted by FC_Col-0/SRK14_C60/I60 from the largest to the smallest. Vice-versa for incompatible situation and for pollen (Additional file 9: Table S11).

Prediction of sex-specificity for genes without SNP between Col-0/SRK14 and C24

To infer sex-specificity of genes without SNP between Col-0/SRK14 and C24 but with high expression levels in our RNA-seq experiment, we used the database ATTED-II ( [37] to select genes with correlated expression patterns. Only genes with nFPKM (stigma+pollen) ≥ 350 were considered for specificity prediction, as we showed that most of these genes were sex-specific (Fig. 2b). Based on the genes having SNPs between the two accessions, this subset contains 86.3% sex-preferentially- or sex-specifically-expressed genes at C0. For each target gene, we retrieved its ATTED-II record from either the Ath-m v7.1 dataset (based on 16,033 microarray hybridizations, [37]) or, if not available, the Ath-r dataset v3.0 (based on 2120 RNA-seq experiments). Based on the mutual rank, we kept the 50 most highly correlated genes having polymorphism between Col-0/SRK14 and C24, and we looked at their sex-specificity in our dataset. Using the 505 target genes with SNPs, we determined simple ad-hoc criteria to predict sex-specificity: target genes with expression pattern correlated with more than 35 stigma-specific genes were considered stigma-specific; target genes correlated with more than 25 pollen-specific genes were considered pollen-specific. With these thresholds, the specificity prediction was correct for 96.7% of stigma-specific genes and 97.8% of pollen-specific genes; 93 genes out of 505 remained without prediction and the prediction was wrong in only 12 cases (Additional file 8: Table S10). We applied the same criteria to the 161 target genes with no SNPs and nFPKM (stigma+pollen) ≥ 350, and we were able to predict stigma-specificity for 129 of them and pollen specificity for 28 genes (Additional file 8 Table S10). We named this method “Expression Correlation Method (ECM)”.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding authors on reasonable request.







S-locus Cysteine Rich protein


S-locus protein 11


S-locus Receptor Kinase


Adenosine triphosphatase


Exocyst complex component 70 A1


Ribonucleic acid


Deoxyribonucleic acid


Single nucleotide polymorphism


Allele-specific expression-transcript isoform abundance estimation with gapped alignment of RNA-seq data


Pattern-triggered immunity


Small insertions and deletions


The Arabidopsis information resource


Untranslated region


Coding sequences


Differential expression analysis for sequence count data


Fragments per kilo base of transcript per million reads mapped


Normalized FPKM


Hexagonal binning


Calcium-dependent protein kinase 34


Reverse transcriptase-polymerase chain reaction


False discovery rate


Gene ontology


Principal component analysis


Principal components


Differentially expressed gene analysis


Fold change


Adjusted p-value


Somatic embryogenesis receptor kinase


Mitogen-activated protein kinase kinase kinase


Mitogen-activated protein kinase kinase


WRKYGQK containing transcription factor


Receptor-like kinase


Elongation factor thermo unstable


EF-Tu receptor


Bacterial pathogen associated molecular patterns


Brassinosteroid insensitive 1-associated receptor kinase 1




Flagellin-sensing 2


Chitin elicitor receptor kinase 1

ATTED-II database:

Arabidopsis thaliana trans-factor and cis-element prediction database


Expression correlation method


Mitogen-activated protein kinase


Kyoto encyclopedia of genes and genomes


Scanning electron microscopy


SKP1-CUL1-F-box protein


Toll/interleukin-1 receptor


Nucleotide-binding site


Leucine-rich repeat


Receptor-like cytoplasmic protein kinase


Universal stress protein A



CHX21 :

Cation/H+ exchanger 21


Kelch repeat F-box


Kiss me deadly


Type-B Arabidopsis response regulators


Cytokinin receptors 1


Histidine kinase homologs


Armadillo repeat containing 1


Cysteine-rich receptor-like protein kinase


Plant intracellular ras-group-related LRR 2


Flotillin-like protein 1


Czech roh meaning corner 1


Qua-quine starch


Barley mildew resistance locus o


Sterol-dependent endocytosis


Nottingham Arabidopsis stock centre


Genome analysis toolkit


Fast quality control




Variant call format


National center for biotechnology information


Rapid annotation transfer tool


European molecular biology open software suite


Genome-scale sequence comparison


Sequence alignment map


  1. 1.

    Doucet J, Lee HK, Goring DR. Pollen acceptance or rejection: a tale of two pathways. Trends Plant Sci. 2016;21:1058–67.

    CAS  PubMed  Article  Google Scholar 

  2. 2.

    Mizuta Y, Higashiyama T. Chemical signaling for pollen tube guidance at a glance. J Cell Sci. 2018;131:jcs208447.

    PubMed  Article  CAS  Google Scholar 

  3. 3.

    Rozier F, Riglet L, Kodera C, Bayle V, Durand E, Schnabel J, et al. Live-cell imaging of early events following pollen perception in self-incompatible Arabidopsis thaliana. J Exp Bot. 2020;71:2513–26.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  4. 4.

    Bateman AJ. Self-incompatibility systems in angiosperms: III. Cruciferae. Heredity. 1955;9:53–68.

    Article  Google Scholar 

  5. 5.

    Kachroo A, Schopfer CR, Nasrallah ME, Nasrallah JB. Allele-specific receptor-ligand interactions in brassica self-incompatibility. Science. 2001;293:1824–6.

    CAS  PubMed  Article  Google Scholar 

  6. 6.

    Takayama S, Shimosato H, Shiba H, Funato M, Che F-S, Watanabe M, et al. Direct ligand–receptor complex interaction controls Brassica self-incompatibility. Nature. 2001;413:534–8.

    CAS  PubMed  Article  Google Scholar 

  7. 7.

    Cabrillac D, Cock JM, Dumas C, Gaude T. The S -locus receptor kinase is inhibited by thioredoxins and activated by pollen coat proteins. Nature. 2001;410:220–3.

    CAS  PubMed  Article  Google Scholar 

  8. 8.

    Iwano M, Igarashi M, Tarutani Y, Kaothien-Nakayama P, Nakayama H, Moriyama H, et al. A pollen coat-inducible autoinhibited Ca2+−ATPase expressed in stigmatic papilla cells is required for compatible pollination in the Brassicaceae. Plant Cell. 2014;26:636–49.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  9. 9.

    Iwano M, Shiba H, Matoba K, Miwa T, Funato M, Entani T, et al. Actin dynamics in papilla cells of Brassica rapa during self- and cross-pollination. Plant Physiol. 2007;144:72–81.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  10. 10.

    Elleman CJ, Dickinson HG. Identification of pollen components regulating pollination-specific responses in the stigmatic papillae of Brassica oleracea. New Phytol. 1996;133:197–205.

    CAS  PubMed  Article  Google Scholar 

  11. 11.

    Safavian D, Goring DR. Secretory activity is rapidly induced in stigmatic papillae by compatible pollen, but inhibited for self-incompatible pollen in the Brassicaceae. PLoS One. 2013;8:e84286.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  12. 12.

    Iwano M, Ito K, Fujii S, Kakita M, Asano-Shimosato H, Igarashi M, et al. Calcium signalling mediates self-incompatibility response in the Brassicaceae. Nat Plants. 2015;1:15128.

    CAS  PubMed  Article  Google Scholar 

  13. 13.

    Indriolo E, Safavian D, Goring DR. The ARC1 E3 ligase promotes two different self-pollen avoidance traits in Arabidopsis. Plant Cell. 2014;26:1525–43.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  14. 14.

    Swanson R, Edlund AF, Preuss D. Species specificity in pollen-pistil interactions. Annu Rev Genet. 2004;38:793–818.

    CAS  PubMed  Article  Google Scholar 

  15. 15.

    Sankaranarayanan S, Jamshed M, Samuel MA. Proteomics approaches advance our understanding of plant self-incompatibility response. J Proteome Res. 2013;12:4717–26.

    CAS  PubMed  Article  Google Scholar 

  16. 16.

    Matsuda T, Matsushima M, Nabemoto M, Osaka M, Sakazono S, Masuko-Suzuki H, et al. Transcriptional characteristics and differences in Arabidopsis stigmatic papilla cells pre- and post-pollination. Plant Cell Physiol. 2015;56:663–73.

    CAS  PubMed  Article  Google Scholar 

  17. 17.

    Zhang T, Gao C, Yue Y, Liu Z, Ma C, Zhou G, et al. Time-course Transcriptome analysis of compatible and incompatible pollen-stigma interactions in Brassica napus L. Front Plant Sci. 2017;8.

  18. 18.

    Lin S-Y, Chen P-W, Chuang M-H, Juntawong P, Bailey-Serres J, Jauh G-Y. Profiling of Translatomes of in vivo-grown pollen tubes reveals genes with roles in Micropylar guidance during pollination in Arabidopsis. Plant Cell. 2014;26:602–18.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  19. 19.

    Leydon AR, Weinreb C, Venable E, Reinders A, Ward JM, Johnson MA. The molecular dialog between flowering plant reproductive partners defined by SNP-informed RNA-sequencing. The Plant Cell, Vol. 29:984–1006

  20. 20.

    Nariai N, Kojima K, Mimori T, Kawai Y, Nagasaki M. A Bayesian approach for estimating allele-specific expression from RNA-Seq data with diploid genomes. BMC Genomics. 2016;17.

  21. 21.

    Kusaba M, Dwyer K, Hendershot J, Vrebalov J, Nasrallah JB, Nasrallah ME. Self-incompatibility in the genus Arabidopsis: characterization of the S locus in the outcrossing a. lyrata and its autogamous relative a. thaliana. Plant Cell. 2001;13:627–43.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  22. 22.

    Nasrallah ME. Generation of self-incompatible Arabidopsis thaliana by transfer of two S locus genes from a. lyrata. Science. 2002;297:247–9.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  23. 23.

    Smyth DR, Bowman JL, Meyerowitz EM. Early flower development in Arabidopsis. Plant Cell. 1990;2:755–67.

    CAS  PubMed  PubMed Central  Google Scholar 

  24. 24.

    Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15.

  25. 25.

    Carr DB, Littlefield RJ, Nicholson WL, Littlefield JS. Scatterplot matrix techniques for large N. J Am Stat Assoc. 1987;82:424–36.

    Google Scholar 

  26. 26.

    Qin Y, Leydon AR, Manziello A, Pandey R, Mount D, Denic S, et al. Penetration of the stigma and style elicits a novel Transcriptome in pollen tubes, pointing to genes critical for growth in a pistil. PLoS Genet. 2009;5:e1000621.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  27. 27.

    Boavida LC, Borges F, Becker JD, Feijo JA. Whole genome analysis of gene expression reveals coordinated activation of signaling and metabolic pathways during pollen-pistil interactions in Arabidopsis. Plant Physiol. 2011;155:2066–80.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  28. 28.

    Honys D, Twell D. Transcriptome analysis of haploid male gametophyte development in Arabidopsis. Genome Biol. 2004;5:R85.

    PubMed  PubMed Central  Article  Google Scholar 

  29. 29.

    Cheng C-Y, Krishnakumar V, Chan AP, Thibaud-Nissen F, Schobel S, Town CD. Araport11: a complete reannotation of the Arabidopsis thaliana reference genome. Plant J. 2017;89:789–804.

    CAS  PubMed  Article  Google Scholar 

  30. 30.

    Doucet J, Truong C, Frank-Webb E, Lee HK, Daneva A, Gao Z, et al. Identification of a role for an E6-like 1 gene in early pollen-stigma interactions in Arabidopsis thalianaPreprint. Plant Biol. 2019.

  31. 31.

    Myers C, Romanowsky SM, Barron YD, Garg S, Azuse CL, Curran A, et al. Calcium-dependent protein kinases regulate polarized tip growth in pollen tubes. Plant J. 2009;59:528–39.

    CAS  PubMed  Article  Google Scholar 

  32. 32.

    Osaka M, Matsuda T, Sakazono S, Masuko-Suzuki H, Maeda S, Sewaki M, et al. Cell type-specific Transcriptome of Brassicaceae stigmatic papilla cells from a combination of laser microdissection and RNA sequencing. Plant Cell Physiol. 2013;54:1894–906.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  33. 33.

    Thomas PD. The gene ontology and the meaning of biological function. In: Dessimoz C, Škunca N, editors. The gene ontology handbook. New York: Springer; 2017. p. 15–24.

    Google Scholar 

  34. 34.

    Ringnér M. What is principal component analysis? Nat Biotechnol. 2008;26:303–4.

    PubMed  Article  CAS  Google Scholar 

  35. 35.

    Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 2017;45(Database issue):D353–61.

    CAS  PubMed  Article  Google Scholar 

  36. 36.

    Bigeard J, Colcombet J, Hirt H. Signaling mechanisms in pattern-triggered immunity (PTI). Mol Plant. 2015;8:521–39.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  37. 37.

    Obayashi T, Aoki Y, Tadaka S, Kagaya Y, Kinoshita K. ATTED-II in 2018: a plant Coexpression database based on investigation of the statistical property of the mutual rank index. Plant Cell Physiol. 2018;59:e3.

    PubMed  Article  CAS  Google Scholar 

  38. 38.

    Dickinson H. Dry stigmas, water and self-incompatibility in brassica. Sex Plant Reprod. 1995;8:1–10.

    Article  Google Scholar 

  39. 39.

    Nasrallah J. Recognition and rejection of self in plant self-incompatibility: comparisons to animal histocompatibility. Trends Immunol. 2005;26:412–8.

    CAS  PubMed  Article  Google Scholar 

  40. 40.

    Mondragón-Palomino M, John-Arputharaj A, Pallmann M, Dresselhaus T. Similarities between reproductive and immune pistil Transcriptomes of Arabidopsis species. Plant Physiol. 2017;174:1559–75.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  41. 41.

    Tung C-W. Genome-wide identification of genes expressed in Arabidopsis pistils specifically along the path of pollen tube growth. Plant Physiol. 2005;138:977–89.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  42. 42.

    Osaka M, Nabemoto M, Maeda S, Sakazono S, Masuko-Suzuki H, Ito K, et al. Genetic and tissue-specific RNA-sequencing analysis of self-compatible mutant TSC28 in Brassica rapa L. toward identification of a novel self-incompatibility factor. Genes Genet Syst. 2019;94:167–76.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  43. 43.

    Sanati Nezhad A, Geitmann A. The cellular mechanics of an invasive lifestyle. J Exp Bot. 2013;64:4709–28.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  44. 44.

    Marsollier A-C, Ingram G. Getting physical: invasive growth events during plant development. Curr Opin Plant Biol. 2018;46:8–17.

    PubMed  Article  PubMed Central  Google Scholar 

  45. 45.

    Eggermont L, Verstraeten B, Van Damme EJM. Genome-wide screening for Lectin motifs in Arabidopsis thaliana. Plant Genome. 2017;10:1-17.

  46. 46.

    Hua Z, Vierstra RD. The Cullin-RING ubiquitin-protein ligases. Annu Rev Plant Biol. 2011;62:299–334.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  47. 47.

    Jebanathirajah JA, Peri S, Pandey A. Toll and interleukin-1 receptor (TIR) domain-containing proteins in plants: a genomic perspective. Trends Plant Sci. 2002;7:388–91.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  48. 48.

    Gururani MA, Venkatesh J, Upadhyaya CP, Nookaraju A, Pandey SK, Park SW. Plant disease resistance genes: current status and future directions. Physiol Mol Plant Pathol. 2012;78:51–65.

    CAS  Article  Google Scholar 

  49. 49.

    Liang X, Zhou J-M. Receptor-like cytoplasmic kinases: central players in plant receptor kinase–mediated signaling. Annu Rev Plant Biol. 2018;69:267–99.

    CAS  PubMed  Article  Google Scholar 

  50. 50.

    Nyström T, Neidhardt FC. Isolation and properties of a mutant of Escherichia coli with an insertional inactivation of the uspA gene, which encodes a universal stress protein. J Bacteriol. 1993;175:3949–56.

    PubMed  PubMed Central  Article  Google Scholar 

  51. 51.

    Nyström T, Neidhardt FC. Expression and role of the universal stress protein, UspA, of Escherichia coli during growth arrest. Mol Microbiol. 1994;11:537–44.

    PubMed  Article  Google Scholar 

  52. 52.

    Tao H, Bausch C, Richmond C, Blattner FR, Conway T. Functional genomics: expression analysis of Escherichia coli growing on minimal and Rich Media. J Bacteriol. 1999;181:6425–40.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  53. 53.

    Kerk D, Bulgrien J, Smith DW, Gribskov M. Arabidopsis proteins containing similarity to the universal stress protein domain of bacteria. Plant Physiol. 2003;131:1209–19.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  54. 54.

    Leydon AR, Beale KM, Woroniecka K, Castner E, Chen J, Horgan C, et al. Three MYB transcription factors control pollen tube differentiation required for sperm release. Curr Biol. 2013;23:1209–14.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  55. 55.

    Lu Y, Chanroj S, Zulkifli L, Johnson MA, Uozumi N, Cheung A, et al. Pollen tubes lacking a pair of K+ transporters fail to target ovules in Arabidopsis. Plant Cell. 2011;23:81–93.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  56. 56.

    Curtis RHC, Pankaj S, Powers SJ, Napier J, Matthes MC. The Arabidopsis F-box/Kelch-repeat protein At2g44130 is upregulated in giant cells and promotes nematode susceptibility. Mol Plant-Microbe Interact MPMI. 2013;26:36–43.

    CAS  PubMed  Article  Google Scholar 

  57. 57.

    Thiel H, Hleibieh K, Gilmer D, Varrelmann M. The P25 pathogenicity factor of beet necrotic yellow vein virus targets the sugar beet 26S proteasome involved in the induction of a hypersensitive resistance response via interaction with an F-box protein. Mol Plant-Microbe Interact MPMI. 2012;25:1058–72.

    CAS  PubMed  Article  Google Scholar 

  58. 58.

    Wang J, Yao W, Wang L, Ma F, Tong W, Wang C, et al. Overexpression of VpEIFP1, a novel F-box/Kelch-repeat protein from wild Chinese Vitis pseudoreticulata, confers higher tolerance to powdery mildew by inducing thioredoxin z proteolysis. Plant Sci Int J Exp Plant Biol. 2017;263:142–55.

    CAS  Google Scholar 

  59. 59.

    Paquis S, Mazeyrat-Gourbeyre F, Fernandez O, Crouzet J, Clément C, Baillieul F, et al. Characterization of a F-box gene up-regulated by phytohormones and upon biotic and abiotic stresses in grapevine. Mol Biol Rep. 2011;38:3327–37.

    CAS  PubMed  Article  Google Scholar 

  60. 60.

    Jia Y, Gu H, Wang X, Chen Q, Shi S, Zhang J, et al. Molecular cloning and characterization of an F-box family gene CarF-box 1 from chickpea (Cicer arietinum L.). Mol Biol Rep. 2012;39:2337–45.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  61. 61.

    Kim HJ, Chiang Y-H, Kieber JJ, Schaller GE. SCFKMD controls cytokinin signaling by regulating the degradation of type-B response regulators. Proc Natl Acad Sci. 2013;110:10028–33.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  62. 62.

    Zhang X, Gou M, Liu C-J. Arabidopsis Kelch repeat F-box proteins regulate phenylpropanoid biosynthesis via controlling the turnover of phenylalanine ammonia-lyase. Plant Cell. 2013;25:4994–5010.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  63. 63.

    Zhang X, Gou M, Guo C, Yang H, Liu C-J. Down-regulation of Kelch domain-containing F-box protein in Arabidopsis enhances the production of (poly) phenols and tolerance to ultraviolet radiation. Plant Physiol. 2015;167:337–50.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  64. 64.

    Zhang X, Abrahan C, Colquhoun TA, Liu C-J. A Proteolytic regulator controlling Chalcone synthase stability and flavonoid biosynthesis in Arabidopsis. Plant Cell. 2017;29:1157–74.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  65. 65.

    Borah P, Khurana JP. The OsFBK1 E3 ligase subunit affects anther and root secondary Cell Wall thickenings by mediating turnover of a Cinnamoyl-CoA Reductase. Plant Physiol. 2018;176:2148–65.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  66. 66.

    Franciosini A, Lombardi B, Iafrate S, Pecce V, Mele G, Lupacchini L, et al. The Arabidopsis COP9 SIGNALOSOME INTERACTING F-BOX KELCH 1 protein forms an SCF ubiquitin ligase and regulates hypocotyl elongation. Mol Plant. 2013;6:1616–29.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  67. 67.

    Han L, Mason M, Risseeuw EP, Crosby WL, Somers DE. Formation of an SCF (ZTL) complex is required for proper regulation of circadian timing. Plant J Cell Mol Biol. 2004;40:291–301.

    CAS  Article  Google Scholar 

  68. 68.

    Nelson DC, Lasswell J, Rogg LE, Cohen MA, Bartel B. FKF1, a clock-controlled gene that regulates the transition to flowering in Arabidopsis. Cell. 2000;101:331–40.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  69. 69.

    Sawa M, Nusinow DA, Kay SA, Imaizumi T. FKF1 and GIGANTEA complex formation is required for day-length measurement in Arabidopsis. Science. 2007;318:261–5.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  70. 70.

    Kinoshita-Tsujimura K, Kakimoto T. Cytokinin receptors in sporophytes are essential for male and female functions in Arabidopsis thaliana. Plant Signal Behav. 2011;6:66–71.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  71. 71.

    Stone SL. ARC1 is an E3 ubiquitin ligase and promotes the Ubiquitination of proteins during the rejection of self-incompatible brassica pollen. Plant Cell Online. 2003;15:885–98.

    Article  Google Scholar 

  72. 72.

    Indriolo E, Tharmapalan P, Wright SI, Goring DR. The ARC1 E3 ligase gene is frequently deleted in self-compatible Brassicaceae species and has a conserved role in Arabidopsis lyrata self-pollen rejection. Plant Cell. 2012;24:4607–20.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  73. 73.

    Bourdais G, Burdiak P, Gauthier A, Nitsch L, Salojärvi J, Rayapuram C, et al. Large-scale Phenomics identifies primary and fine-tuning roles for CRKs in responses related to oxidative stress. PLoS Genet. 2015;11:e1005373.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  74. 74.

    Yadeta KA, Elmore JM, Creer AY, Feng B, Franco JY, Rufian JS, et al. A cysteine-rich protein kinase associates with a membrane immune complex and the cysteine residues are required for cell death. Plant Physiol. 2017;173:771–87.

    CAS  PubMed  Article  Google Scholar 

  75. 75.

    Forsthoefel NR, Klag KA, Simeles BP, Reiter R, Brougham L, Vernon DM. The Arabidopsis plant intracellular Ras-group LRR (PIRL) family and the value of reverse genetic analysis for identifying genes that function in gametophyte development. Plants Basel Switz. 2013;2:507–20.

    Google Scholar 

  76. 76.

    Li R, Liu P, Wan Y, Chen T, Wang Q, Mettbach U, et al. A membrane microdomain-associated protein, Arabidopsis Flot1, is involved in a clathrin-independent endocytic pathway and is required for seedling development. Plant Cell. 2012;24:2105–22.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  77. 77.

    Yu M, Liu H, Dong Z, Xiao J, Su B, Fan L, et al. The dynamics and endocytosis of Flot1 protein in response to flg22 in Arabidopsis. J Plant Physiol. 2017;215:73–84.

    CAS  PubMed  Article  Google Scholar 

  78. 78.

    Kulich I, Cole R, Drdová E, Cvrčková F, Soukup A, Fowler J, et al. Arabidopsis exocyst subunits SEC8 and EXO70A1 and exocyst interactor ROH1 are involved in the localized deposition of seed coat pectin. New Phytol. 2010;188:615–25.

    CAS  PubMed  Article  Google Scholar 

  79. 79.

    Qi M, Zheng W, Zhao X, Hohenstein JD, Kandel Y, O’Conner S, et al. QQS orphan gene and its interactor NF-YC4 reduce susceptibility to pathogens and pests. Plant Biotechnol J. 2019;17:252–63.

    CAS  PubMed  Article  Google Scholar 

  80. 80.

    Kusch S, Panstruga R. Mlo-based resistance: an apparently universal “weapon” to defeat powdery mildew disease. Mol Plant-Microbe Interact. 2017;30:179–89.

    CAS  PubMed  Article  Google Scholar 

  81. 81.

    Jany E, Nelles H, Goring DR. The molecular and cellular regulation of Brassicaceae self-incompatibility and self-pollen rejection. In: International review of cell and molecular biology: Elsevier; 2019. p. 1–35.

  82. 82.

    Ivanov R, Gaude T. Endocytosis and Endosomal regulation of the S-receptor kinase during the self-incompatibility response in Brassica oleracea. Plant Cell Online. 2009;21:2107–17.

    CAS  Article  Google Scholar 

  83. 83.

    Yamamoto M, Nishio T, Nasrallah JB. Activation of self-incompatibility signaling in transgenic Arabidopsis thaliana is independent of AP2-based Clathrinmediatedendocytosis. G3 (Bethesda). 2018;8(7):2231–9.

  84. 84.

    Fan L, Li R, Pan J, Ding Z, Lin J. Endocytosis and its regulation in plants. Trends Plant Sci. 2015;20:388–97.

    CAS  PubMed  Article  Google Scholar 

  85. 85.

    Bechsgaard JS. The transition to self-compatibility in Arabidopsis thaliana and evolution within S-haplotypes over 10 Myr. Mol Biol Evol. 2006;23:1741–50.

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  86. 86.

    Shimizu KK, Tsuchimatsu T. Evolution of Selfing: recurrent patterns in molecular adaptation; 2015. p. 33.

    Google Scholar 

  87. 87.

    Scholl RL, May ST, Ware DH. Seed and molecular resources for Arabidopsis. Plant Physiol. 2000;124:1477–80.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  88. 88.

    Verger S, Chabout S, Gineau E, Mouille G. Cell adhesion in plants is under the control of putative O-fucosyltransferases. Dev Camb Engl. 2016;143:2536–40.

    CAS  Google Scholar 

  89. 89.

    Li H, Durbin R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinforma Oxf Engl. 2009;25:1754–60.

    CAS  Article  Google Scholar 

  90. 90.

    McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  91. 91.

    DePristo MA, Banks E, Poplin RE, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43:491–8.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  92. 92.

    Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, et al. From FastQ data to high confidence variant calls: the genome analysis toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013;43:11.10.1–11.10.33.

    Article  Google Scholar 

  93. 93.

    Otto TD, Dillon GP, Degrave WS, Berriman M. RATT: rapid annotation transfer tool. Nucleic Acids Res. 2011;39:e57.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  94. 94.

    Rice P, Longden I, Bleasby A. EMBOSS: the European molecular biology open software suite. Trends Genet TIG. 2000;16:276–7.

    CAS  PubMed  Article  Google Scholar 

  95. 95.

    Hamada M, Ono Y, Asai K, Frith MC, Hancock J. Training alignment parameters for arbitrary sequencers with LAST-TRAIN. Bioinformatics. 2017;33:926–8.

    CAS  PubMed  Google Scholar 

  96. 96.

    Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10:R25.

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  97. 97.

    Mi H, Muruganujan A, Thomas PD. PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res. 2013;41(Database issue):D377–86.

    CAS  PubMed  Google Scholar 

  98. 98.

    Thomas PD, Campbell MJ, Kejariwal A, Mi H, Karlak B, Daverman R, et al. PANTHER: a library of protein families and subfamilies indexed by function. Genome Res. 2003;13:2129–41.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  99. 99.

    Koressaar T, Remm M. Enhancements and modifications of primer design program Primer3. Bioinforma Oxf Engl. 2007;23:1289–91.

    CAS  Article  Google Scholar 

Download references


We thank Yvon Jaillais, Olivier Hamant, Gwyneth Ingram for critical reading of the manuscript, SiCE team members for fruitful discussions and comments, Fabrice Besnard for critical reading of the manuscript and for specific advice and script sharing relative to SNP calling, Naoki Nariai for the help to perform analysis with ASE-TIGAR, Alexander R. Leydon for the information for SNP-based RNAseq analysis. Agnès Attard for the discussion of the bioinformatics data analysis. We gratefully acknowledge the PSMN (Pôle Scientifique de Modélisation Numérique) of the ENS de Lyon to share their computing resources.


Funding for experiments was provided by the French Agence Nationale de la Recherche (Grant ANR-14-CE11–0021). CK was supported by an ANR fellowship. The funders had no role in either the study design, analysis and interpretation of data, or in writing the manuscript.

Author information




CK, JJ, MDR developed the data analysis pipeline and performed the bioinformatics analyses. AL, JL supported the bioinformatics analyses. AL helped with interpretation of the transcriptomic data. CK is responsible for all experiments and analysis performed in the study except described below. CK and LR performed the image acquisition by scanning electron microscope. CK, LR, FR, IFL harvested plant materials and extracted RNA. CK, TG, IFL designed the study and CK, JJ, TG, IFL wrote the manuscript. All the authors contributed to the discussion. The authors read and approved the final manuscript.

Corresponding authors

Correspondence to Chie Kodera or Isabelle Fobis-Loisy.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

We have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

Known gene expression information for sex-specifically expressed genes. Top 20 sex-specifically expressed genes were analysed with ThaleMine database ( Heat maps of RNA-seq based gene expression levels (Cheng et al., 2016) for each list were derived (stigma: upper, pollen: lower). * genes analyzed by RT-PCR and sequencing in Fig. 3c. Figure S2. Depth coverage of Col-0/SRK14 and C24 reference genomes after variants calling. Distributions of depth coverage before filtering were obtained by using GATK Depth Of Coverage. After checking depth coverage, Col-0/SRK14 variants were filtered at depth 3 and C24 variants were filtered at depth 6. Table S1. Read length and depth of whole genome sequencing. Col-0/SRK14 and C24 genomes were sequenced with NextSeq500 platform (Illumina) applying paired-end sequencing (2×150 bp). Sequence quality was checked by FastQC ( Read parts with quality score less than 26 were trimmed and reads shorter than 50bp were discarded. F = forward sequence, R = reverse sequence. Table S2. Length of sequenced RNA reads. Four RNA replicates selected from five samples at each pollination condition were sequenced with NextSeq500 platform (Illumina) applying paired-end sequencing (2×75 bp). Sequence quality was checked by FastQC ( Read parts with quality score less than 26 were trimmed and reads shorter than 40bp were discarded. F=forward sequence, R = reverse sequence. Table S6. Number of up- and down-regulated genes. Genes up-regulated (FC > 2.0, padj < 0.1) and down-regulated (FC < − 2.0, padj < 0.1) were selected.

Additional file 2: Table S3.

Relative abundance of each transcript between stigma and pollen

Additional file 3: Table S4.

Expressed genes, sex-preferentially and sex-specifically expressed genes.

Additional file 4: Table S5.

GO term enrichment of the top 1000 preferentially expressed genes in stigma or in pollen

Additional file 5: Table S7.

Genes analyzed with a Venn diagram representation

Additional file 6: Table S8.

GO term enrichment of genes induced after pollination in stigma

Additional file 7: Table S9.

Genes in the PTI pathway induced in stigma after compatible pollination.

Additional file 8: Table S10.

Prediction of sex-specificity by ECM

Additional file 9: Table S11.

Selectively induced genes after compatible/incompatible responses in stigma or pollen.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Kodera, C., Just, J., Da Rocha, M. et al. The molecular signatures of compatible and incompatible pollination in Arabidopsis. BMC Genomics 22, 268 (2021).

Download citation


  • RNA sequencing
  • SNPs
  • Pollen-stigma interaction
  • Compatible / incompatible pollination
  • Male-female transcriptome
  • PTI pathway