- Research article
- Open Access
Targeting environmental adaptation in the monocot model Brachypodium distachyon: a multi-faceted approach
© Dell’Acqua et al.; licensee BioMed Central Ltd. 2014
- Received: 29 July 2014
- Accepted: 4 September 2014
- Published: 18 September 2014
The local environment plays a major role in the spatial distribution of plant populations. Natural plant populations have an extremely poor displacing capacity, so their continued survival in a given environment depends on how well they adapt to local pedoclimatic conditions. Genomic tools can be used to identify adaptive traits at a DNA level and to further our understanding of evolutionary processes. Here we report the use of genotyping-by-sequencing on local groups of the sequenced monocot model species Brachypodium distachyon. Exploiting population genetics, landscape genomics and genome wide association studies, we evaluate B. distachyon role as a natural probe for identifying genomic loci involved in environmental adaptation.
Brachypodium distachyon individuals were sampled in nine locations with different ecologies and characterized with 16,697 SNPs. Variations in sequencing depth showed consistent patterns at 8,072 genomic bins, which were significantly enriched in transposable elements. We investigated the structuration and diversity of this collection, and exploited climatic data to identify loci with adaptive significance through i) two different approaches for genome wide association analyses considering climatic variation, ii) an outlier loci approach, and iii) a canonical correlation analysis on differentially sequenced bins. A linkage disequilibrium-corrected Bonferroni method was applied to filter associations. The two association methods jointly identified a set of 15 genes significantly related to environmental adaptation. The outlier loci approach revealed that 5.7% of the loci analysed were under selection. The canonical correlation analysis showed that the distribution of some differentially sequenced regions was associated to environmental variation.
We show that the multi-faceted approach used here targeted different components of B. distachyon adaptive variation, and may lead to the discovery of genes related to environmental adaptation in natural populations. Its application to a model species with a fully sequenced genome is a modular strategy that enables the stratification of biological material and thus improves our knowledge of the functional loci determining adaptation in near-crop species. When coupled with population genetics and measures of genomic structuration, methods coming from genome wide association studies may lead to the exploitation of model species as natural probes to identify loci related to environmental adaptation.
- Landscape genomics
- Brachypodium distachyon
- Genotyping by sequencing
- Population genetics
- Association mapping
One of the most ambitious objectives of natural variation studies is to provide a description of functional variability in natural populations . The ability of a living organism to endure environmental challenges depends on the portion of genetic variation with adaptive implications  that sustains the formation of ecotypes through ecological evolution . In plant sciences, being able to identify the genetic determinants of complex traits may help enhance crops . The discovery of the genetic bases of complex traits with adaptive significance in model species  and in crops [6, 7] is often the first step towards molecular breeding programs [8, 9].
Domestication and breeding, however, have caused a severe reduction of crop diversity, whose extant genetic variation is much smaller than that of their wild relatives [10, 11]. This limits the diversity in which to search for adaptation, thus hindering our ability to identify favourable allelic combinations. Focusing on natural populations of the wild relatives of crops, with their broader genetic diversity, could help overcome this limitation and even allow new ground to be broken. As geographical objects, natural populations might be used to study the relation between the genetic and ecologic diversity in search of adaptive traits. Genomic synteny would then allow the targeting of homologous candidate adaptive genes in the crop of interest [12, 13]. The environment can be considered as an unceasing breeder selecting for successful alleles, providing this approach potential downfalls in an agronomic perspective.
The relation between genetic and climatic variation in natural populations has already been explored in humans [14, 15], and genetic determinants for fitness variation in different environments have been described in Arabidopsis thaliana. Environmental data was gradually introduced in population genetics practises, being addressed by some landscape genetics and landscape genomics [17, 18], thereby being able to describe adaptive variability by means of the differential distribution of alleles on an ecological basis . This can be done either through i) outlier detection or ii) association methods . Outlier detection relies on Wright’s fixation index Fst to identify loci under selection through their differentiation from the basal and neutral genomic variation . Although widely used in animal species [23, 24] and less frequently in plant species , outlier detection can be biased by genetic structure and limited sensitivity . In addition, it does not explicitly address environmental variation. On the other hand, association methods are based on marker - trait regressions and they directly target quantitative measures of the environment. The statistical framework of association methods is largely similar to that of genome-wide association studies (GWAS), which were originally developed in humans  to map complex trait determinants. GWAS are increasingly applied to plants [28, 29], where generally higher minor allele frequencies, multi-trait directional selection, and extensive linkage disequilibrium simplify their application .
When considering organisms with limited displacing abilities such as plants, association methods might accommodate quantitative environmental data as a response variable rather than phenotypes, and map genomic associations with climate [31–33]. Whilst outlier loci methods perform better with the strongest signatures of selection, association methods are appropriate to ascertain weak selection , and may lead to the identification of soft sweep signatures of low intensity selection [34, 35]. Outlier detection and association methods were merged in an investigation into Populus and Teosinte, thus leading to the identification of loci with clear adaptive significance towards climate. A study in Medicago joined the association approach with an ex situ phenotypic evaluation, confirming the reliability of these methods . In all cases, great focus is needed on to the interrelation of genetic variation and spatial displacement, as false statistical signals might arise when spatial structuration mirrors environmental adaptation . The dependency of genetic diversity upon spatial diversity, though rarely considered in depth, can heavily influence the outcome of both these methods.
Merging population genomics and landscape data requires two sources of information. The landscape derives from geographical information systems (GISs), which can be used to couple quantitative geographical data with biological sampling [40, 41] and model the spatial relations of individuals. Global climate models developed for GISs  link climatic information with sampled individuals, providing both quantitative environmental data for each individual studied and a means for controlling spatial bias over genetic diversity. The genomics, in fact, must first consider the disturbance caused by the many evolutionary forces other than selection , as well as disturbance due to unknown demography that might add noise to association approaches . High-throughput genotyping data are needed in order to provide the widest possible representation of the variation at a genome level, and thus efficiently control the many forces acting at such scale. The lowering of DNA sequencing costs together with the application of strategies for the reduction of genome complexity  makes DNA sequencing itself a means for discovering and analysing molecular markers . Genotyping-by-sequencing (GBS)  is a reductionist strategy, and is increasingly employed in ecological genomics studies .
In this paper, we identify loci linked to environmental adaptation in Turkish accessions of the grass species Brachypodium distachyon (L.) P. Beauv. Brachypodium distachyon is the leading model species for small grain monocots and temperate grasses , with an ancestral range spanning the Middle and Near East, and currently including most of the temperate areas of the world . Until recently, B. distachyon was deemed to have three distinct cytotypes of 2n = 10, 20 and 30 chromosomes: a recent study identified three different taxonomic entities, of which B. distachyon has the 2n = 10 chromosome set . B. distachyon genome (approximately 271 Mbp) was completely sequenced in the inbred line Bd21 . Natural populations of B. distachyon have already been extensively collected in Turkey, showing high intra-population homozygosity and a high level of inter-population genetic diversity . This was an interesting condition to test the possibility to search for environmental adaptation whilst accounting for structuration.
We explored the possibility of identifying the relation between climate and genomic features in a starting panel of 82 B. distachyon individuals collected in nine locations scattered across a 1000-km transect in Turkey. By this, we wanted to exploit both methods developed in the landscape genomics field and in the GWAS community. Bringing landscape genomics closer to complex traits mapping, especially in an agronomical perspective, might open a significant perspective in the field. We employed a GBS approach to provide a genome-wide representation of molecular diversity in these B. distachyon individuals. The sampling locations were monitored on a GIS system to obtain climatic data for each individual, at the same time controlling for the spatial distribution of genetic diversity. The data was processed using the complementary characteristics of outlier and association approaches in order to identify signatures of adaptation at a molecular level.
We found that the association and outlier methods mostly targeted soft and hard sweeps of selection, respectively. GWAS and landscape genomics method jointly identified 15 genes involved in B. distachyon adaptation. We also found that transposable elements were differentially distributed across the genomes of local groups, some with a pattern matching the climatic diversity of the sampling transect.
Our method could be extended by including more genotypes and by targeting additional environments and environmental variables. Once the biological material is characterized, this might aggregate additional data and thus extend our capacity to understand the molecular bases of adaptation. B. distachyon could then be used as a natural probe to report functional variations in a broad set of environmental situations.
GIS analyses and sampling
Biological material included in the study
18 Mart Üniv. Kampus
Yenice Balya arası
Balya Yenice arası II
Kütahya Tavşanlı çıkışı
Kaymaz Mesire yeri Eskişehir
Polatlı- Haymana arası
Çanakkale Bursa Yolu Başlangıcı
Genotyping by sequencing
Genomic distribution of transposable elements
Diversity analyses and population genetics
Distance and diversity among populations
A structure analysis conducted with Bayesian methods pointed to the existence of five distinct genetic clusters, thus extending the geographical pattern that had already emerged from the previous analyses. Samples from sampling locations A, B, C2, and H were all assigned to the largest cluster. D, F, G mostly accounted for the second largest cluster. All samples from region E but one clustered in a third cluster. Samples from region C shown some ancestry with those from C2 but acted as a separate cluster. The fifth cluster was contributed in small amounts by individuals from sampling location F. Overall, the spatial genetic diversity displays strong structuration, but little correlation with spatial distance.
Genomic loci with adaptive significance
Genes emerging from association analysis with climatic variables
5 kb upstream
5 kb upstream
5 kb upstream
5 kb upstream
5 kb upstream
5 kb upstream
5 kb upstream
5 kb upstream
Positional enrichment of SNPs identified by association and outlier methods
> 5 kbp upstream
< 5 kbp upstream
Predicted genes involved
Putative genes involved in adaptation
We assayed the functional role of EASs detected by both association methods, as representatives of the strongest signal for adaptation (Table 4). Environmental PC1 targets most of the genic EASs. Bradi1g03700, a 60s ribosomal protein L36-3-like, is probably involved in expression control. In maize, the 60s subunit is involved in flooding responses  and might be related to environmental stress responses. The same EAS targets on the reverse strand Bradi1g02575, bearing an oxidoreductase activity domain. PC1 also targets a MYB transcription factor (Bradi2g38560) a class of proteins involved in plant responses processes, including those to abiotic stresses. MYB are a strategic targets for crop improvement . Notably, we detected three outlier loci less than 100 Kb downstream this association [Additional file 3]. The phosphoprotein phosphatase Bradi1g71690, is likely involved in cellular signalling. Signalling is also contributed by Bradi3g28560 (transferase activity). This predicted gene encodes for a 3-ketoacyl-CoA synthase, whose elective biological processes include wax synthesis  and response to cold and light stimulus (http://www.uniprot.org). The energetic balance of the cell is possibly contributed by Bradi1g73170, a sucrose transmembrane transporter targeted by PC1, and Bradi4g04710, targeted by PC2 and involved in the mitochondrial respiration chain. Within 500 Kb of this locus, two outlier loci are found [Additional file 3]. The EAS at 44,083,155 bp on chromosome 3, identified by both PC1 and PC3 is in the vicinity of a set of protein coding genes of unknown function.
The twofold gain of genotyping by sequencing
Although genome re-sequencing offers the most inclusive possible overview of the genomic variability of small genome species [60, 61], methods based on the reduction of genome complexity such as GBS represent a cheaper and versatile alternative to genotype any species of interest in multiplex. However, due to the technical variations inherent in the protocol, a GBS run can yield an unbalanced representation of samples . Here we showed that such an unbalanced distribution might mask biological reasons that are actually worth investigating.
In our case, the persistence of phylogenetic relationships among samples when using P/A regions as genetic markers (Figures 2 and 3) suggest that there is an inheritable pattern that is consistent with the differential distribution of transposable elements (TE) . In this case P/A regions may either result from the loss of the cut site because of TE movement, or from the impairing of the methyl-sensitive ApeKI cleavage as a consequence of the presence of methylated DNA regions.
In both cases, the coverage of sequencing reads will show a gap as a result of the failure of the enzymatic cut. P/A regions are clearly enriched in TE, as demonstrated by the fact that the average TE content in those regions is significantly higher than that of the entire genome (45.96% versus 26.48%). The enrichment is even more dramatic when the TE content of regions not classified as P/A regions is taken into account (14.52%, more than three times less). This evidence strongly suggests that TE displacement has a role in the P/A polymorphism.
Two simple scenarios could be envisaged to explain the data: i) TEs inserted into the Bd21 reference genome after it separated from the other populations (or TEs inserted before Bd21 separated, but were then removed from some of the populations) thus giving rise to P/A polymorphism; ii) TEs are present in orthologous regions of both Bd21 and resequenced samples, but they are methylated only in some of the resequenced regions.
As far as we are aware this structural variation, as revealed by GBS, has never been reported before. We believe that it is of great importance as it may introduce a significant bias in genomic imputation. Our findings might stimulate further studies on the adaptive role of the differential distribution of transposable elements in B. distachyon natural populations. Our CCA analysis identified some of the P/A as being strongly related to environment, especially to PC1 and PC2 (Figure 7). Modifications in methylation patterns associated with transposable elements have already been reported to influence a set of genes in 20 maize lines .
Approaching environmental associations
Brachypodium distachyon proved to be an effective model for the application of landscape genomics. The high Fst value between local groups (Table 3) is in accordance with the expectancy for self-fertilizing plants . The depletion of intra-population variation in presence autogamy is exacerbated by selective sweeps, background selection, and possibly recurrent extinctions and recolonizations , as likely in our case. Our results might appear to be in contrast with those from B. distachyon populations from the Iberian peninsula, where SSR and ISSR markers showed an unexpectedly high intra-population variation . We believe this might derive from the markers used, as SSR and ISSR sites change at a higher pace than coding regions targeted by GBS. In addition, as the authors suggest , the high variation in the Iberian populations might be linked to the proximity to the distribution limit of B. distachyon. On a broader scale, our SNP-based survey showed that the genetic diversity did not linearly correlate with spatial distances. As expected local groups are highly differentiated, yet share similarities with individuals far away (Figures 2, and 4). This is why the correlation between cGD and physical distance is not significant, but spatial structuration is both evident from the population graph and sPCA analyses (Figure 4).
Are we thus looking at isolation by distance (IBD)? IBD is the direct consequence of the limited dispersal of alleles, causing populations that are spatially near to share more similarities with each other than populations far away . This phenomenon affects the exploitability of the molecular data derived from sampling natural populations . The samples under study, though, do not show IBD in these terms. This is largely due to the split between locales A, B and H and all the others. While gene flow between local groups is low, there is no clear spatial pattern in the distribution of the genetic diversity.
Given the erratic nature of the sampling, we cannot rule out that patterns of gene flow between populations apply, at a finer scale, to IBD, as it is outside the scope of this work. However, the use of these results is a key feature for our association mapping approach. IBD, as in general spatial structuration, can mirror environmental association, leading to high rates of false positives. This was demonstrated in , where association methods without correction for population structure (such as SAM  and outlier loci discovery methods) found more significant associations than justified from the data if run in conditions of IBD. This happens in association methods because both climate and genetic variability have strong spatial dependencies which might lead to bias when overlapped. Hierarchical structure tests are also known to be possibly biased by IBD [39, 70]. Our analyses showed extensive non-linear spatial structuration, as expected since the autogamous reproduction of B. distachyon. This finding is in line with a previous survey performed with 43 SSR markers on 56 Turkish populations , where B. distachyon accessions split into two distinct phylogenetic clades differing in terms of vernalization habits and morphological features without belonging clearly to a specific geographical area.
However, the absence of a diversity gradient did not rule out structuration. We thus performed our association approach by considering structuration in order to avoid overrepresentation of false positives. This was done both with a hierarchical structure and a PCA with LFMM and CMLM, respectively, and we showed that the two different approaches yield similar results though differing in magnitude in terms of the statistical association found.
Our results are an empirical confirmation of what emerged in a simulation study testing the performance of five outlier-based and three correlation methods under explicit models for selection, demography and spatial relations . In that study, the outlier detection implemented in Bayescan outperformed the other methods under any migration model, while all correlation-based methods proved powerful yet prone to bias due to structuration within and among populations. Nevertheless, if coupled with methods accounting for cryptic genomic structure, such methods could reduce type I and type II errors, especially in autogamous species. The portion of differentiated loci was in line with other studies , confirming that the use of a conservative FDR threshold (5%) and SNP filtering lowered the noise resulting from the use of a high number of polymorphisms.
In-gene polymorphisms are not the sole ones involved in environmental adaptation . In fact, SNPs in genes and 5 kb window upstream of the genes (i.e. potentially involved in the regulation of gene expression) show an almost equal contribution to significant associations . This also emerges from our association and outlier loci analyses, which revealed the EASs and outlier loci were enriched for genic and gene-related regions (Table 5).
Lack of congruence between methods
An interesting point concerns the differences that emerged between outlier and association methods, which here report little loci in common. This result does not seem to fit the early tendency of seeing outlier loci as a confirmation for EAS validity and vice versa [69, 74]. Instead it highlights that association and outlier analyses estimate complementary aspects of functional adaptation, as recently suggested in similar studies .
The association approach is not dependent upon population genetic parameters, instead it targets a limited set of quantitative environmental characteristics. Complex traits targeted by means of correlative approaches, and especially those regarding climatic adaptation, are expected to reveal small changes in allele frequencies that push populations to a new optimum . In this sense, polygenic selection  would seem to favour the simultaneous presence of multiple alleles rather than a complete fixation at the loci involved , resulting in the co-occurrence of different haplotypes at any given genomic location . This contributes to the lack of congruence between the two methods, as a fainter signature of selection is less likely to be detected by outlier detection methods . Unsurprisingly, a low intensity selection causes Bayescan to fail the most . In addition, the LD-correction for false discovery rate possibly has an excessive number of type II errors . However, these kinds of studies benefit from a more conservative threshold than from a permissive approach. A few loci are in fact expected to have high enough effects to be confidently detected.
Conversely, outlier methods do not depend, at least not directly, on environmental data. Loci identified by Bayescan but not by association methods might represent a set of loci under selection from factors not considered in the association analysis, such as fire regimes, soil composition, anthropic disturbance, grazing pressure, pathogens, and so on. Outlier methods are also affected by the assumptions about the null distribution used to compare loci, making the demographic history and structure of populations able to bias the outcome of the analysis [79, 80]. We argue that, at the net of false positives and negatives that might be effectively but not completely controlled by both methods, loci identified by both methods represent alternative portions of adaptive variation. Outliers represent the pool of loci under the strongest selection, whereas EASs represent the sum of the present and historical multilocus variations related to the environmental features considered.
A closer evaluation of the genes related to EASs identified by both the association methods provided a varied set of putative functions (Table 4). The annotation of Brachypodium distachyon is currently based mostly on in silico models, and therefore needs a careful evaluation of the functional relevance of EASs, which was outside the scopes of our experiments. Yet, we identified a set of genes, including a MYB transcription factor pointed by association and outlier loci, which already suggests the potential downstream applicability of these methods. Owing to the nature of LD, however, a less-than complete coverage sequencing cannot achieve the single-gene definition in association: our analyses revealed that the genome of our B. distachyon collection could be split into 734 LD blocks. To achieve a higher definition, more recombination events should be sampled, i.e. more individuals are needed. This is one of the strengths of this approach: since it is modular it allows the stratification of environmental and biological data in an integrated framework to map for adaptation in B. distachyon.
We strongly support the application of next generation sequencing approaches to landscape genomics as a fast and modular tool for the discovery of adaptive traits, particularly in sequenced species. The application of landscape genomics to plants akin to crops can directly address adaptive variation that would be of great interest from an applied perspective. We noted that, when structuration is accounted for, the methodological effort to discover loci responsible for environmental adaptation might trace back to GWAS. This means that advances and statistics built by the complex trait mapping community could be exploited to gather information in the field.
Our results derive from a modular method that can be extended in order to deal with any relevant environmental questions. Although our initial set of genotypes and environmental variables is limited, we believe that this and similar collections will soon be enlarged to provide a better capacity to map environmental adaptation. B. distachyon - like other model species - is thus not only an effective laboratory tool, but also a natural probe. By exploiting their geographical distribution, these model species could be used to identify functional variation, and ultimately genomic loci, whose evolution was shaped for survival well before artificial selection took place. We envisage this approach being directly applied to crops, focusing either on their wild relatives or landraces, to cleverly incorporate in agronomy the results of natural selection efforts.
GIS analysis and sampling
The plant material studied comes from B. distachyon seeds collected in Turkey . We focused on B. distachyon populations spanning from the western Dardanelles strait to the eastern region beyond lake Tuz in order to cover a continuous and comprehensive environmental gradient. This region was analysed by coupling DIVA GIS and BioClim data derived from Worldclim 2.5 minutes data (years ~1950-2000, ~5 Km) . The function of the most limiting factor in Ecocrop model in DIVA GIS was used to identify a subset of locations maximizing climatic differences, reporting for each grid (5 × 5 Km) the BioClim variable with the lowest score with regard to general biological features for grasses.
A subset of nine local groups was chosen accordingly (Figure 1; Table 1). The sampling point C2 was chosen nearby C for control purposes. In each location, a minimum of 10 individuals were sampled as individual spikes bearing mature seeds. To ensure that the sampled individuals were reproducing (i.e. had non-zero fitness), we collected seeds rather than green tissues. Collection points were associated with GPS coordinates (±6 m), hence WGS84 coordinates were used to extract local altitude values and BioClim data from the Worldclim 2.5 database. BioClim is made up of 19 variables, the result of processing raw measures of rainfall and temperature. Using the full set of BioClim variables in correlation analyses might result in augmented noise without any real information gain , thus a PCA was conducted in R  over the 20 normalized environmental variables to extract the first three PCs.
At least five seeds from each spike were pooled, and all sample pools underwent the same germination routine. Seeds representing each original individual were sown in separate Petri dishes with moist turf and underwent six weeks of vernalization in the dark. Seeds were then transferred to 1:1 turf and pebbly soil, and germinated in separated pots in a growth chamber (16 h 25°C light/ 8 h 21°C dark). Green tissues were collected in equal proportions from the resulting seedlings, so as to reconstitute the full allelic set of each original natural accession. Genomic DNA was extracted using the GeneElute Plant Genomic DNA Miniprep extraction kit (Sigma-Aldrich, St Louis, MO) following the suggested protocol. Four inbred lines developed by Dr. John Vogel in Albany, CA, USA, and the Bd21 inbred lines were added to the sample pool as reference. A total of 96 samples were selected for the following analyses.
The Genotyping-by-Sequencing (GBS) protocol is based on genome complexity reduction and multiplexed DNA sequencing for SNP discovery . The protocol required a new adapter titration before being applied to B. distachyon. Total genomic DNA was digested with ApeKI restriction enzyme (120’ at 75°C; New England Biolabs, Ipswich, MA). Adapters were titrated by ligating Bd21 genomic fragments to increasing concentrations of adapters in separate reactions, then piping them through GBS library construction. After the library quality had been evaluated on a Bioanalyzer 2100 (Agilent Technologies, Palo Alto, CA), 6 ng of adapters per 100 ng of genomic DNA were deemed appropriate for all samples.
After adapter ligation with T4 ligase (New England Biolabs, Ipswich, MA) for 60’ at 22°C, then 30’ at 64°C, samples were pooled in two 48-plex cohorts and subjected to PCR amplification with high-fidelity Phusion DNA polymerase (New England Biolabs, Ipswich, MA) using adapter-specific primers. The two 48-plex libraries were treated following the Illumina pair-end sequencing protocol, and then sequenced in separate lanes on a Genome Analyzer II (Illumina, Inc., San Diego, CA) at IGA Services, Udine, Italy.
An ad hoc script, available upon request, was used to carry out the following process on GBS Illumina reads: i) reads were sorted according to their barcode, ii) barcodes were removed from reads, iii) reads were trimmed according to their overall quality using the rNA program . Trimmed reads were mapped onto the B. distachyon reference genome  using BWA software  run with the following settings: −n 3 -o 1 -e 1 -l 28, i.e. allowing three mismatches, disallowing long gaps, and using a seed length of 28 nucleotides. The results were analysed using the GATK pipeline . GATK was used as it is the gold standard of SNP calls [87, 88]. At the time of the analyses Tassel software  was not capable of analysing paired-end sequencing data, and thus would have caused the loss of much information. The recommended identification and realignment of questionable aligned regions was carried out, and the actual SNP calls were made using the following settings: −stand_call_conf 50.0 -stand_emit_conf 10.0 -dcov 500 -out_mode EMIT_ALL_CONFIDENT_SITES. Alignments were edited and reformatted using SAM tools  and Picard tools (http://picard.sourceforge.net). Samples below the 9th percentile of the distribution of read counts were discarded, thus reducing the number of individuals from 96 to 87, of which 82 were from field collection. Reads were mapped on the reference Bd21 genome sequence, and polymorphic positions were extracted.
The vcf files produced by GATK were parsed using a Perl script (available upon request): the analysis was limited to SNPs deemed as having PASSED by GATK (Phred-like quality score 50, i.e. α < 0.001%). All polymorphic positions missing in over 20% of the samples were discarded, and loci were filtered for minor allele frequency (MAF) of 5%.
The reference genome was split into arbitrary 1,000 bp bins, and the amount of reads mapped per bin per sample was counted to assess the consistency of the distribution of the reads. We labeled as Presence/Absence (P/A) regions those bins that were present in the reference genome but did not produce any read in any of the samples from one to eight of the groups (A-H) tested. In “absence” bins, no samples sharing the same geographical origin mapped any read, whilst one or more of the other groups did (with at least 1,000 sequenced reads per sample mapping on average). The content of transposable elements (TE) was assessed separately for P/A and non P/A regions using RepeatMasker  and a collection of B. distachyon TE as a repeat library (ftp://ftpmips.helmholtz-muenchen.de/plants/brachypodium).
A phylogeny comprising both natural accessions and inbred lines was derived from shared SNPs. SplitsTree4  was used to build a NJ phylogeny based on uncorrected P distances, and bootstrapping was used in 1000 replicates to build a bootstrap network based on all the alternative splits that had occurred . The degree of kinship among individuals was estimated from molecular data in R/GAPIT  using VanRaden's  method. P/A regions were used to derive binary markers (1/0) to mark the presence or absence of sequences in each genomic bin in each local group, and a distance matrix was calculated on the basis of Jaccard distances, hence considering shared states only. This method does not require any assumption on the biological nature of P/A regions.
Gene flow dynamics underlying the geographical sampling can affect the results of the analyses, and need to be considered in landscape genomics practises [39, 71]. Genepop 4.1.4  was used to estimate Wright’s fixation index (Fst) . The genetic distance among local groups was measured as the conditional genetic distance (cGD) , a measure derived from population graphs , which by accounting for spatial variance outperformed classical measures of genetic distance [96, 98]. In a population graph each population or group of individuals is identified by a node on a graph, and nodes are connected by edges whose length (cGD) is inversely related to the genetic covariance between populations. Null length, i.e. unconnected nodes, represent populations lacking allelelic exchange. cGD values were regressed over spatial distances.
The spatial pattern of genetic diversity was explored at a finer scale with a spatial PCA  in R/adegenet . This method summarizes both the spatial structure and the genetic diversity among individuals, thus enabling global and local spatial structures to be differentiated. Structure  was used in admixture model to survey the number of cryptic genetic clusters (K) present in the dataset. The most likely K was identified by structure harvester .
Where Y is the vector of phenotypic/climate values, and X and Z are the known design matrix. The fixed effects (genetic marker, intercept and population structure (Q)) are represented by the unknown vector β; random additive genetic effects are represented by the unknown vector u, while e represents the non-observed residuals. Kinship is included in the computation of u and e variance. The most significant PCs computed over molecular markers and the Structure clustering were evaluated as Q by assessing the normal fit of the model on quantile-quantile plots.
To control for false positives we applied an LD-corrected Bonferroni. The Bonferroni method is conservative in that it divides the target threshold (e.g. 0.05) by the number of tests performed. However GWAS is not necessarily a collection of completely independent tests [78, 106]. This is because the genetic and functional linkage among markers, expressed by LD, causes SNPs to be inherited in linkage blocks rather than independently. This is especially true in natural populations of autogamous plants with extensive LD . R/trio  was used to compute pairwise LD in 500 marker windows (8 Mbp on average). The normalized D’ LD measure was used to identify LD blocks where strong LD was defined by an upper confidence bound of D’ > 0.98 and a lower confidence bound of D’ > 0.7. Strong evidence of recombination was provided wherever the upper bound of D’ was lower than 0.9, according to Gabriel's method . We established a threshold corresponding to one false association out of ten (0.1) and divided it by the number of linkage blocks in order to have LD-corrected Bonferroni FDR.
The same dataset was tested to detect outlier loci (i.e. loci under selection) using Bayescan 2.1 . This method entails decomposing Fst values in a locus-specific component (α; shared by all populations), and a population-specific component (β; shared by all loci). The departure of α from the equilibrium suggests selection operating on a given locus. The 5% FDR threshold provided by Bayescan was used as a significance threshold. Brachypodium distachyon genome V1.2 annotation (ftp://ftpmips.helmholtz-muenchen.de/plants/brachypodium/v1.2) was used to locate EASs and outliers either more than 5 kb upstream, within 5 kb upstream, and within predicted genes with R/GenomicRanges . The limit of 5 kbp was chosen as being representative of possible cis regulatory regions . To avoid redundancy, SNPs falling at the same time into a predicted genic region and 5 kb upstream of another predicted genic region, were considered once and genic only. The list of outliers was compared with that of the EASs significant for either of the two association methods. SNPs identified by at least two methods were further discussed as strong adaptation candidates.
P/A regions as binary markers were used in a canonical correspondence analysis (CCA)  with R/vegan . A CCA is used in ecological studies to evaluate the amount of variability of a matrix of observations X is explained by a matrix of descriptive variables Y referring to the same sites where observations are made. Typically, CCA is used to assess the unconstrained relation between environmental factors and species distribution, but can also be used to associate climate gradients with molecular data . We used CCA to evaluate the linear relation existing between P/A regions and environmental PC with 999 permutations.
This work was fully supported by the Doctoral Programme in Agrobiodiversity of Scuola Superiore Sant’Anna, Pisa, Italy.
- Feder ME, Mitchell-Olds T: Evolutionary and ecological functional genomics. Nat Rev Genet. 2003, 4: 649-655. 10.1038/nrg1128.Google Scholar
- Storz JF: Using genome scans of DNA polymorphism to infer adaptive population divergence. Mol Ecol. 2005, 14: 671-688. 10.1111/j.1365-294X.2005.02437.x.PubMedGoogle Scholar
- Hughes AL: Adaptive Evolution of Genes and Genomes. 1999, New York: Oxford University PressGoogle Scholar
- Mauricio R: Mapping quantitative trait loci in plants: uses and caveats for evolutionary biology. Nat Rev Genet. 2001, 2: 370-381. 10.1038/35072085.PubMedGoogle Scholar
- DeRose-Wilson L, Gaut BS: Mapping salinity tolerance during Arabidopsis thaliana germination and seedling growth. PloS One. 2011, 6: e22832-10.1371/journal.pone.0022832.PubMed CentralPubMedGoogle Scholar
- Almeida GD, Makumbi D, Magorokosho C, Nair S, Borém A, Ribaut J-M, Bänziger M, Prasanna BM, Crossa J, Babu R: QTL mapping in three tropical maize populations reveals a set of constitutive and adaptive genomic regions for drought tolerance. TAG Theor Appl Genet Theor Angew Genet. 2013, 126: 583-600. 10.1007/s00122-012-2003-7.Google Scholar
- Motomura Y, Kobayashi F, Iehisa JCM, Takumi S: A major quantitative trait locus for cold-responsive gene expression is linked to frost-resistance gene Fr-A2 in common wheat. Breed Sci. 2013, 63: 58-67. 10.1270/jsbbs.63.58.PubMed CentralPubMedGoogle Scholar
- Collard BCY, Mackill DJ: Marker-assisted selection: an approach for precision plant breeding in the twenty-first century. Philos Trans R Soc Lond B Biol Sci. 2008, 363: 557-572. 10.1098/rstb.2007.2170.PubMed CentralPubMedGoogle Scholar
- Mir RR, Zaman-Allah M, Sreenivasulu N, Trethowan R, Varshney RK: Integrated genomics, physiology and breeding approaches for improving drought tolerance in crops. TAG Theor Appl Genet Theor Angew Genet. 2012, 125: 625-645. 10.1007/s00122-012-1904-9.Google Scholar
- Haudry A, Cenci A, Ravel C, Bataillon T, Brunel D, Poncet C, Hochu I, Poirier S, Santoni S, Glémin S, David J: Grinding up wheat: a massive loss of nucleotide diversity since domestication. Mol Biol Evol. 2007, 24: 1506-1517. 10.1093/molbev/msm077.PubMedGoogle Scholar
- Feuillet C, Langridge P, Waugh R: Cereal breeding takes a walk on the wild side. Trends Genet TIG. 2008, 24: 24-32. 10.1016/j.tig.2007.11.001.Google Scholar
- Zamir D: Improving plant breeding with exotic genetic libraries. Nat Rev Genet. 2001, 2: 983-989. 10.1038/35103590.PubMedGoogle Scholar
- Hajjar R, Hodgkin T: The use of wild relatives in crop improvement: a survey of developments over the last 20 years. Euphytica. 2007, 156: 1-13. 10.1007/s10681-007-9363-0.Google Scholar
- Cavalli-Sforza LL, Menozzi P, Piazza A: The History and Geography of Human Genes. 1996, Princeton, N.J.: Abridged edition. Princeton University PressGoogle Scholar
- Hancock AM, Witonsky DB, Alkorta-Aranburu G, Beall CM, Gebremedhin A, Sukernik R, Utermann G, Pritchard JK, Coop G, Di Rienzo A: Adaptations to climate-mediated selective pressures in humans. PLoS Genet. 2011, 7: e1001375-10.1371/journal.pgen.1001375.PubMed CentralPubMedGoogle Scholar
- Fournier-Level A, Korte A, Cooper MD, Nordborg M, Schmitt J, Wilczek AM: A map of local adaptation in Arabidopsis thaliana. Science. 2011, 334: 86-89. 10.1126/science.1209271.PubMedGoogle Scholar
- Manel S, Schwartz MK, Luikart G, Taberlet P: Landscape genetics: combining landscape ecology and population genetics. Trends Ecol Evol. 2003, 18: 189-197. 10.1016/S0169-5347(03)00008-9.Google Scholar
- Storfer A, Murphy MA, Evans JS, Goldberg CS, Robinson S, Spear SF, Dezzani R, Delmelle E, Vierling L, Waits LP: Putting the “landscape” in landscape genetics. Heredity. 2007, 98: 128-142. 10.1038/sj.hdy.6800917.PubMedGoogle Scholar
- Wagner HH, Fortin M-J: A conceptual framework for the spatial analysis of landscape genetic data. Conserv Genet. 2013, 14: 253-261. 10.1007/s10592-012-0391-5.Google Scholar
- Schoville SD, Bonin A, François O, Lobreaux S, Melodelima C, Manel S: Adaptive Genetic Variation on the Landscape: Methods and Cases. Annu Rev Ecol Evol Syst. 2012, 43: 23-43. 10.1146/annurev-ecolsys-110411-160248.Google Scholar
- Wright S: The Interpretation of Population Structure by F-Statistics with Special Regard to Systems of Mating. Evolution. 1965, 19: 395-10.2307/2406450.Google Scholar
- Beaumont MA, Nichols RA: Evaluating Loci for Use in the Genetic Analysis of Population Structure. Proc R Soc Lond B Biol Sci. 1996, 263: 1619-1626. 10.1098/rspb.1996.0237.Google Scholar
- Nielsen EE, Hemmer-Hansen J, Poulsen NA, Loeschcke V, Moen T, Johansen T, Mittelholzer C, Taranger G-L, Ogden R, Carvalho GR: Genomic signatures of local directional selection in a high gene flow marine organism; the Atlantic cod (Gadus morhua). BMC Evol Biol. 2009, 9: 276-10.1186/1471-2148-9-276.PubMed CentralPubMedGoogle Scholar
- DeFaveri J, Jonsson PR, Merilä J: Heterogeneous Genomic Differentiation in marine threespine sticklebacks: adaptation along an environmental gradient. Evol Int J Org Evol. 2013, 67: 2530-2546. 10.1111/evo.12097.Google Scholar
- Bothwell H, Bisbing S, Therkildsen NO, Crawford L, Alvarez N, Holderegger R, Manel S: Identifying genetic signatures of selection in a non-model species, alpine gentian (Gentiana nivalis L.), using a landscape genetic approach. Conserv Genet. 2013, 14: 467-481. 10.1007/s10592-012-0411-5.Google Scholar
- Narum SR, Hess JE: Comparison of F(ST) outlier tests for SNP loci under selection. Mol Ecol Resour. 2011, 11 (Suppl 1): 184-194.PubMedGoogle Scholar
- Hirschhorn JN, Daly MJ: Genome-wide association studies for common diseases and complex traits. Nat Rev Genet. 2005, 6: 95-108.PubMedGoogle Scholar
- Ingvarsson PK, Street NR: Association genetics of complex traits in plants. New Phytol. 2011, 189: 909-922. 10.1111/j.1469-8137.2010.03593.x.PubMedGoogle Scholar
- Weigel D: Natural variation in Arabidopsis: from molecular genetics to ecological genomics. Plant Physiol. 2012, 158: 2-22. 10.1104/pp.111.189845.PubMed CentralPubMedGoogle Scholar
- Hamblin MT, Buckler ES, Jannink J-L: Population genetics of genomics-based crop improvement methods. Trends Genet TIG. 2011, 27: 98-106. 10.1016/j.tig.2010.12.003.Google Scholar
- Eckert AJ, van Heerwaarden J, Wegrzyn JL, Nelson CD, Ross-Ibarra J, González-Martínez SC, Neale DB: Patterns of population structure and environmental associations to aridity across the range of loblolly pine (Pinus taeda L., Pinaceae). Genetics. 2010, 185: 969-982. 10.1534/genetics.110.115543.PubMed CentralPubMedGoogle Scholar
- Eckert AJ, Bower AD, González-Martínez SC, Wegrzyn JL, Coop G, Neale DB: Back to nature: ecological genomics of loblolly pine (Pinus taeda, Pinaceae). Mol Ecol. 2010, 19: 3789-3805. 10.1111/j.1365-294X.2010.04698.x.PubMedGoogle Scholar
- Poncet BN, Herrmann D, Gugerli F, Taberlet P, Holderegger R, Gielly L, Rioux D, Thuiller W, Aubert S, Manel S: Tracking genes of ecological relevance using a genome scan in two independent regional population samples of Arabis alpina. Mol Ecol. 2010, 19: 2896-2907. 10.1111/j.1365-294X.2010.04696.x.PubMedGoogle Scholar
- Hermisson J, Pennings PS: Soft Sweeps. Genetics. 2005, 169: 2335-2352. 10.1534/genetics.104.036947.PubMed CentralPubMedGoogle Scholar
- Hancock AM, Witonsky DB, Ehler E, Alkorta-Aranburu G, Beall C, Gebremedhin A, Sukernik R, Utermann G, Pritchard J, Coop G, Di Rienzo A: Colloquium paper: human adaptations to diet, subsistence, and ecoregion are due to subtle shifts in allele frequency. Proc Natl Acad Sci U S A. 2010, 107 (Suppl 2): 8924-8930.PubMed CentralPubMedGoogle Scholar
- Keller SR, Levsen N, Olson MS, Tiffin P: Local adaptation in the flowering-time gene network of balsam poplar, Populus balsamifera L. Mol Biol Evol. 2012, 29: 3143-3152. 10.1093/molbev/mss121.PubMedGoogle Scholar
- Pyhäjärvi T, Hufford MB, Mezmouk S, Ross-Ibarra J: Complex Patterns of Local Adaptation in Teosinte. Genome Biol Evol. 2013, 5: 1594-1609. 10.1093/gbe/evt109.PubMed CentralPubMedGoogle Scholar
- Yoder JB, Stanton-Geddes J, Zhou P, Briskine R, Young ND, Tiffin P: Genomic Signature of Adaptation to Climate in Medicago truncatula. Genetics. 2014, 196: 1263-1275. 10.1534/genetics.113.159319.PubMed CentralPubMedGoogle Scholar
- Meirmans PG: The trouble with isolation by distance. Mol Ecol. 2012, 21: 2839-2846. 10.1111/j.1365-294X.2012.05578.x.PubMedGoogle Scholar
- Kozak KH, Graham CH, Wiens JJ: Integrating GIS-based environmental data into evolutionary biology. Trends Ecol Evol. 2008, 23: 141-148. 10.1016/j.tree.2008.02.001.PubMedGoogle Scholar
- Chan LM, Brown JL, Yoder AD: Integrating statistical genetic and geospatial methods brings new power to phylogeography. Mol Phylogenet Evol. 2011, 59: 523-537. 10.1016/j.ympev.2011.01.020.PubMedGoogle Scholar
- Hijmans RJ, Guarino L, Cruz M, Rojas E: Computer tools for spatial analysis of plant genetic resources data: 1. DIVA-GIS. Plant Genet Resour Newsl. 2001, 127: 15-19.Google Scholar
- Mitchell-Olds T, Willis JH, Goldstein DB: Which evolutionary processes influence natural genetic variation for phenotypic traits?. Nat Rev Genet. 2007, 8: 845-856. 10.1038/nrg2207.PubMedGoogle Scholar
- Mitchell-Olds T: Complex-trait analysis in plants. Genome Biol. 2010, 11: 423-Google Scholar
- Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, Selker EU, Cresko WA, Johnson EA: Rapid SNP discovery and genetic mapping using sequenced RAD markers. PloS One. 2008, 3: e3376-10.1371/journal.pone.0003376.PubMed CentralPubMedGoogle Scholar
- Pasaniuc B, Rohland N, McLaren PJ, Garimella K, Zaitlen N, Li H, Gupta N, Neale BM, Daly MJ, Sklar P, Sullivan PF, Bergen S, Moran JL, Hultman CM, Lichtenstein P, Magnusson P, Purcell SM, Haas DW, Liang L, Sunyaev S, Patterson N, de Bakker PIW, Reich D, Price AL: Extremely low-coverage sequencing and imputation increases power for genome-wide association studies. Nat Genet. 2012, 44: 631-635. 10.1038/ng.2283.PubMed CentralPubMedGoogle Scholar
- Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE: A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PloS One. 2011, 6: e19379-10.1371/journal.pone.0019379.PubMed CentralPubMedGoogle Scholar
- Narum SR, Buerkle CA, Davey JW, Miller MR, Hohenlohe PA: Genotyping-by-sequencing in ecological and conservation genomics. Mol Ecol. 2013, 22: 2841-2847. 10.1111/mec.12350.PubMed CentralPubMedGoogle Scholar
- Draper J, Mur LAJ, Jenkins G, Ghosh-Biswas GC, Bablak P, Hasterok R, Routledge APM: Brachypodium distachyon. A New Model System for Functional Genomics in Grasses. Plant Physiol. 2001, 127: 1539-1555. 10.1104/pp.010196.PubMed CentralPubMedGoogle Scholar
- Opanowicz M, Vain P, Draper J, Parker D, Doonan JH: Brachypodium distachyon: making hay with a wild grass. Trends Plant Sci. 2008, 13: 172-177. 10.1016/j.tplants.2008.01.007.PubMedGoogle Scholar
- Catalán P, Müller J, Hasterok R, Jenkins G, Mur LAJ, Langdon T, Betekhtin A, Siwinska D, Pimentel M, López-Alvarez D: Evolution and taxonomic split of the model grass Brachypodium distachyon. Ann Bot. 2012, 109: 385-405. 10.1093/aob/mcr294.PubMed CentralPubMedGoogle Scholar
- International Brachypodium Initiative: Genome sequencing and analysis of the model grass Brachypodium distachyon. Nature. 2010, 463: 763-768. 10.1038/nature08747.Google Scholar
- Vogel JP, Tuna M, Budak H, Huo N, Gu YQ, Steinwand MA: Development of SSR markers and analysis of diversity in Turkish populations of Brachypodium distachyon. BMC Plant Biol. 2009, 9: 88-10.1186/1471-2229-9-88.PubMed CentralPubMedGoogle Scholar
- Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D: Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006, 38: 904-909. 10.1038/ng1847.PubMedGoogle Scholar
- Vilhjálmsson BJ, Nordborg M: The nature of confounding in genome-wide association studies. Nat Rev Genet. 2013, 14: 1-2. 10.1038/nri3591.PubMedGoogle Scholar
- Slavov GT, Nipper R, Robson P, Farrar K, Allison GG, Bosch M, Clifton-Brown JC, Donnison IS, Jensen E: Genome-wide association studies and prediction of 17 traits related to phenology, biomass and cell wall composition in the energy grass Miscanthus sinensis. New Phytol. 2014, 201: 1227-1239. 10.1111/nph.12621.PubMed CentralPubMedGoogle Scholar
- Bailey-Serres J, Vangala S, Szick K, Lee C: Acidic Phosphoprotein Complex of the 60S Ribosomal Subunit of Maize Seedling Roots (Components and Changes in Response to Flooding). Plant Physiol. 1997, 114: 1293-1305. 10.1104/pp.114.4.1293.PubMed CentralPubMedGoogle Scholar
- Ambawat S, Sharma P, Yadav NR, Yadav RC: MYB transcription factor genes as regulators for plant responses: an overview. Physiol Mol Biol Plants. 2013, 19: 307-321. 10.1007/s12298-013-0179-1.PubMed CentralPubMedGoogle Scholar
- Todd J, Post-Beittenmiller D, Jaworski JG: KCS1encodes a fatty acid elongase 3-ketoacyl-CoA synthase affecting wax biosynthesis in Arabidopsis thaliana. Plant J. 1999, 17: 119-130. 10.1046/j.1365-313X.1999.00352.x.PubMedGoogle Scholar
- Huang X, Lu T, Han B: Resequencing rice genomes: an emerging new era of rice genomics. Trends Genet. 2013, 29: 225-232. 10.1016/j.tig.2012.12.001.PubMedGoogle Scholar
- Cao J, Schneeberger K, Ossowski S, Günther T, Bender S, Fitz J, Koenig D, Lanz C, Stegle O, Lippert C, Wang X, Ott F, Müller J, Alonso-Blanco C, Borgwardt K, Schmid KJ, Weigel D: Whole-genome sequencing of multiple Arabidopsis thaliana populations. Nat Genet. 2011, 43: 956-963. 10.1038/ng.911.PubMedGoogle Scholar
- Beissinger TM, Hirsch CN, Sekhon RS, Foerster JM, Johnson JM, Muttoni G, Vaillancourt B, Buell CR, Kaeppler SM, de Leon N: Marker Density and Read Depth for Genotyping Populations Using Genotyping-by-Sequencing. Genetics. 2013, 193: 1073-1081. 10.1534/genetics.112.147710.PubMed CentralPubMedGoogle Scholar
- Takuno S, Gaut BS: Gene body methylation is conserved between plant orthologs and is of evolutionary consequence. Proc Natl Acad Sci U S A. 2013, 110: 1797-1802. 10.1073/pnas.1215380110.PubMed CentralPubMedGoogle Scholar
- Eichten SR, Briskine R, Song J, Li Q, Swanson-Wagner R, Hermanson PJ, Waters AJ, Starr E, West PT, Tiffin P, Myers CL, Vaughn MW, Springer NM: Epigenetic and genetic influences on DNA methylation variation in maize populations. Plant Cell. 2013, 25: 2783-2797. 10.1105/tpc.113.114793.PubMed CentralPubMedGoogle Scholar
- Charlesworth D, Pannell J: Mating systems and population genetic structure in the light of coalescent theory. Integrating Ecol Evol Spat Context. 2001, London: Blackwell Scientific, 73-95.Google Scholar
- Ingvarsson P: A Metapopulation Perspective on Genetic Diversity and Differentiation in Partially Self-Fertilizing Plants. Evolution. 2002, 56: 2368-2373. 10.1111/j.0014-3820.2002.tb00162.x.PubMedGoogle Scholar
- Hammami R, Jouve N, Soler C, Frieiro E, González JM: Genetic diversity of SSR and ISSR markers in wild populations of Brachypodium distachyon and its close relatives B. stacei and B. hybridum (Poaceae). Plant Syst Evol. 2014, doi:10.1007/s00606-014-1021-0Google Scholar
- Wright S: Isolation by Distance. Genetics. 1943, 28: 114-138.PubMed CentralPubMedGoogle Scholar
- Joost S, Bonin A, Bruford MW, Després L, Conord C, Erhardt G, Taberlet P: A spatial analysis method (SAM) to detect candidate loci for selection: towards a landscape genomics approach to adaptation. Mol Ecol. 2007, 16: 3955-3969. 10.1111/j.1365-294X.2007.03442.x.PubMedGoogle Scholar
- Frantz AC, Cellina S, Krier A, Schley L, Burke T: Using spatial Bayesian methods to determine the genetic structure of a continuously distributed population: clusters or isolation by distance?. J Appl Ecol. 2009, 46: 493-505. 10.1111/j.1365-2664.2008.01606.x.Google Scholar
- De Mita S, Thuillet A-C, Gay L, Ahmadi N, Manel S, Ronfort J, Vigouroux Y: Detecting selection along environmental gradients: analysis of eight methods and their effectiveness for outbreeding and selfing populations. Mol Ecol. 2013, 22: 1383-1399. 10.1111/mec.12182.PubMedGoogle Scholar
- Stinchcombe JR, Hoekstra HE: Combining population genomics and quantitative genetics: finding the genes underlying ecologically important traits. Heredity. 2008, 100: 158-170. 10.1038/sj.hdy.6800937.PubMedGoogle Scholar
- Li X, Zhu C, Yeh C-T, Wu W, Takacs EM, Petsch KA, Tian F, Bai G, Buckler ES, Muehlbauer GJ, Timmermans MCP, Scanlon MJ, Schnable PS, Yu J: Genic and nongenic contributions to natural variation of quantitative traits in maize. Genome Res. 2012, 22: 2436-2444. 10.1101/gr.140277.112.PubMed CentralPubMedGoogle Scholar
- Manel S, Conord C, Després L: Genome scan to assess the respective role of host-plant and environmental constraints on the adaptation of a widespread insect. BMC Evol Biol. 2009, 9: 288-10.1186/1471-2148-9-288.PubMed CentralPubMedGoogle Scholar
- Turchin MC, Chiang CWK, Palmer CD, Sankararaman S, Reich D, Hirschhorn JN, Genetic Investigation of ANthropometric Traits (GIANT) Consortium: Evidence of widespread selection on standing variation in Europe at height-associated SNPs. Nat Genet. 2012, 44: 1015-1019. 10.1038/ng.2368.PubMed CentralPubMedGoogle Scholar
- Cutter AD, Payseur BA: Genomic signatures of selection at linked sites: unifying the disparity among species. Nat Rev Genet. 2013, 14: 262-274. 10.1038/nrg3425.PubMed CentralPubMedGoogle Scholar
- Strasburg JL, Sherman NA, Wright KM, Moyle LC, Willis JH, Rieseberg LH: What can patterns of differentiation across plant genomes tell us about adaptation and speciation?. Philos Trans R Soc Lond B Biol Sci. 2012, 367: 364-373. 10.1098/rstb.2011.0199.PubMed CentralPubMedGoogle Scholar
- Johnson RC, Nelson GW, Troyer JL, Lautenberger JA, Kessing BD, Winkler CA, O’Brien SJ: Accounting for multiple comparisons in a genome-wide association study (GWAS). BMC Genomics. 2010, 11: 724-10.1186/1471-2164-11-724.PubMed CentralPubMedGoogle Scholar
- Teshima KM, Coop G, Przeworski M: How reliable are empirical genomic scans for selective sweeps?. Genome Res. 2006, 16: 702-712. 10.1101/gr.5105206.PubMed CentralPubMedGoogle Scholar
- Excoffier L, Hofer T, Foll M: Detecting loci under selection in a hierarchically structured population. Heredity. 2009, 103: 285-298. 10.1038/hdy.2009.74.PubMedGoogle Scholar
- Hijmans RJ, Cameron SE, Parra JL, Jones PG, Jarvis A: Very high resolution interpolated climate surfaces for global land areas. Int J Climatol. 2005, 25: 1965-1978. 10.1002/joc.1276.Google Scholar
- Beaumont LJ, Hughes L, Poulsen M: Predicting species distributions: use of climatic parameters in BIOCLIM and its impact on predictions of species’ current and future distributions. Ecol Model. 2005, 186: 251-270. 10.1016/j.ecolmodel.2005.01.030.Google Scholar
- R Development Core Team: R: A Language and Environment for Statistical Computing. 2013, Vienna: Austria: R Foundation for Statistical ComputingGoogle Scholar
- Vezzi F, Del Fabbro C, Tomescu AI, Policriti A: rNA: a fast and accurate short reads numerical aligner. Bioinforma Oxf Engl. 2012, 28: 123-124. 10.1093/bioinformatics/btr617.Google Scholar
- Li H, Durbin R: Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009, 25: 1754-1760. 10.1093/bioinformatics/btp324.PubMed CentralPubMedGoogle Scholar
- McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M: The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20: 1297-1303. 10.1101/gr.107524.110.PubMed CentralPubMedGoogle Scholar
- Farrer RA, Henk DA, MacLean D, Studholme DJ, Fisher MC: Using false discovery rates to benchmark SNP-callers in next-generation sequencing projects. Sci Rep. 2013, 3: 1512-PubMed CentralPubMedGoogle Scholar
- Liu X, Han S, Wang Z, Gelernter J, Yang B-Z: Variant Callers for Next-Generation Sequencing Data: A Comparison Study. PLoS ONE. 2013, 8: e75619-10.1371/journal.pone.0075619.PubMed CentralPubMedGoogle Scholar
- Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES: TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007, 23: 2633-2635. 10.1093/bioinformatics/btm308.PubMedGoogle Scholar
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The sequence alignment/map format and SAMtools. Bioinformatics. 2009, 25: 2078-2079. 10.1093/bioinformatics/btp352.PubMed CentralPubMedGoogle Scholar
- Smit A, Hubley R, Green P: RepeatMasker Open-3.0. 1996, [http://www.repeatmasker.org/]Google Scholar
- Huson DH: SplitsTree: analyzing and visualizing evolutionary data. Bioinforma Oxf Engl. 1998, 14: 68-73. 10.1093/bioinformatics/14.1.68.Google Scholar
- Huson DH, Bryant D: Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2006, 23: 254-267.PubMedGoogle Scholar
- Lipka AE, Tian F, Wang Q, Peiffer J, Li M, Bradbury PJ, Gore MA, Buckler ES, Zhang Z: GAPIT: genome association and prediction integrated tool. Bioinformatics. 2012, 28: 2397-2399. 10.1093/bioinformatics/bts444.PubMedGoogle Scholar
- VanRaden PM: Efficient methods to compute genomic predictions. J Dairy Sci. 2008, 91: 4414-4423. 10.3168/jds.2007-0980.PubMedGoogle Scholar
- Dyer RJ, Nason JD, Garrick RC: Landscape modelling of gene flow: improved power using conditional genetic distance derived from the topology of population networks. Mol Ecol. 2010, 19: 3746-3759. 10.1111/j.1365-294X.2010.04748.x.PubMedGoogle Scholar
- Dyer RJ, Nason JD: Population Graphs: the graph theoretic shape of genetic structure. Mol Ecol. 2004, 13: 1713-1727. 10.1111/j.1365-294X.2004.02177.x.PubMedGoogle Scholar
- Phillipsen IC, Lytle DA: Aquatic insects in a sea of desert: population genetic structure is shaped by limited dispersal in a naturally fragmented landscape. Ecography. 2013, 36: 731-743. 10.1111/j.1600-0587.2012.00002.x.Google Scholar
- Jombart T, Devillard S, Dufour A-B, Pontier D: Revealing cryptic spatial patterns in genetic variability by a new multivariate method. Heredity. 2008, 101: 92-103. 10.1038/hdy.2008.34.PubMedGoogle Scholar
- Jombart T, Ahmed I: adegenet 1.3–1: new tools for the analysis of genome-wide SNP data. Bioinformatics. 2011, 27: 3070-3071. 10.1093/bioinformatics/btr521.PubMed CentralPubMedGoogle Scholar
- Pritchard JK, Stephens M, Donnelly P: Inference of population structure using multilocus genotype data. Genetics. 2000, 155: 945-959.PubMed CentralPubMedGoogle Scholar
- Earl DA, von Holdt BM: STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour. 2012, 4: 359-361. 10.1007/s12686-011-9548-7.Google Scholar
- Frichot E, Schoville SD, Bouchard G, François O: Testing for Associations between Loci and Environmental Gradients Using Latent Factor Mixed Models. Mol Biol Evol. 2013, 30: 1687-1699. 10.1093/molbev/mst063.PubMed CentralPubMedGoogle Scholar
- Zhang Z, Ersoz E, Lai C-Q, Todhunter RJ, Tiwari HK, Gore MA, Bradbury PJ, Yu J, Arnett DK, Ordovas JM: Mixed linear model approach adapted for genome-wide association studies. Nat Genet. 2010, 42: 355-360. 10.1038/ng.546.PubMed CentralPubMedGoogle Scholar
- Yu J, Pressoir G, Briggs WH, Bi IV, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland JB: A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006, 38: 203-208. 10.1038/ng1702.PubMedGoogle Scholar
- Gao X, Becker LC, Becker DM, Starmer JD, Province MA: Avoiding the high Bonferroni penalty in genome-wide association studies. Genet Epidemiol. 2010, 34: 100-105.PubMed CentralPubMedGoogle Scholar
- Flint-Garcia SA, Thornsberry JM, S E, IV B: Structure of Linkage Disequilibrium in Plants*. Annu Rev Plant Biol. 2003, 54: 357-374. 10.1146/annurev.arplant.54.031902.134907.PubMedGoogle Scholar
- Holger S, Qing L, Christoph N, Margaret Taub I, Ingo R: trio: Testing of SNPs and SNP Interactions in Case-Parent Trio Studies. 2014, [http://www.bioconductor.org/packages/release/bioc/html/trio.html]Google Scholar
- Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, Liu-Cordero SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D: The structure of haplotype blocks in the human genome. Science. 2002, 296: 2225-2229. 10.1126/science.1069424.PubMedGoogle Scholar
- Foll M, Gaggiotti O: A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective. Genetics. 2008, 180: 977-993. 10.1534/genetics.108.092221.PubMed CentralPubMedGoogle Scholar
- Lawrence M, Huber W, Pagès H, Aboyoun P, Carlson M, Gentleman R, Morgan MT, Carey VJ: Software for Computing and Annotating Genomic Ranges. PLoS Comput Biol. 2013, 9: e1003118-10.1371/journal.pcbi.1003118.PubMed CentralPubMedGoogle Scholar
- Ter Braak CJF: Canonical Correspondence Analysis: A New Eigenvector Technique for Multivariate Direct Gradient Analysis. Ecology. 1986, 67: 1167-1179. 10.2307/1938672.Google Scholar
- Oksanen J, Blanchet F, Kindt R, Legendre P, Minchin P, O’Hara R, Simpson G, Solymos P, Stevens M, Wagner H: vegan: Community Ecology Package. 2013, [http://cran.r-project.org/web/packages/vegan/index.html]Google Scholar
- Sork VL, Aitken SN, Dyer RJ, Eckert AJ, Legendre P, Neale DB: Putting the landscape into the genomics of trees: approaches for understanding local adaptation and population responses to changing climate. Tree Genet Genomes. 2013, 9: 901-911. 10.1007/s11295-013-0596-x.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.