Horvath R, Josephs EB, Pesquet E, Stinchcombe JR, Wright SI, Scofield D, Slotte T
Mol. Biol. Evol. - (-) - [2021-09-08; online 2021-09-08]
Accurate estimates of genome-wide rates and fitness effects of new mutations are essential for an improved understanding of molecular evolutionary processes. Although eukaryotic genomes generally contain a large non-coding fraction, functional non-coding regions and fitness effects of mutations in such regions are still incompletely characterized. A promising approach to characterize functional non-coding regions relies on identifying accessible chromatin regions (ACRs) tightly associated with regulatory DNA. Here, we applied this approach to identify and estimate selection on ACRs in Capsella grandiflora, a crucifer species ideal for population genomic quantification of selection due to its favourable population demography. We describe a population-wide ACR distribution based on ATAC-seq data for leaf samples of 16 individuals from a natural population. We use population genomic methods to estimate fitness effects and proportions of positively selected fixations (α) in ACRs and find that intergenic ACRs harbor a considerable fraction of weakly deleterious new mutations, as well as a significantly higher proportion of strongly deleterious mutations than comparable inaccessible intergenic regions. ACRs are enriched for expression quantitative trait loci (eQTL) and depleted of transposable element (TE) insertions, as expected if intergenic ACRs are under selection because they harbor regulatory regions. By integrating empirical identification of intergenic ACRs with analyses of eQTL and population genomic analyses of selection, we demonstrate that intergenic regulatory regions are an important source of nearly neutral mutations. These results improve our understanding of selection on non-coding regions and the role of nearly neutral mutations for evolutionary processes in outcrossing Brassicaceae species.
Bioinformatics Support for Computational Resources [Service]
NGI Stockholm (Genomics Applications) [Service]
NGI Stockholm (Genomics Production) [Service]
National Genomics Infrastructure [Service]
PubMed 34498072
DOI 10.1093/molbev/msab270
Crossref 10.1093/molbev/msab270
pii: 6366554