Section: Evolutionary Biology
Topic:
Evolution,
Genetics/Genomics,
Population biology
The landscape of nucleotide diversity in Drosophila melanogaster is shaped by mutation rate variation
10.24072/pcjournal.267 - Peer Community Journal, Volume 3 (2023), article no. e40.
Get full text PDF Peer reviewed and recommended by PCIWhat shapes the distribution of nucleotide diversity along the genome? Attempts to answer this question have sparked debate about the roles of neutral stochastic processes and natural selection in molecular evolution. However, the mechanisms of evolution do not act in isolation, and integrative models that simultaneously consider the influence of multiple factors on diversity are lacking; without them, confounding factors lurk in the estimates. Here we present a new statistical method that jointly infers the genomic landscapes of genealogies, recombination rates and mutation rates. In doing so, our model captures the effects of genetic drift, linked selection and local mutation rates on patterns of genomic variation. We then formalize a causal model of how these micro-evolutionary mechanisms interact, and cast it as a linear regression to estimate their individual contributions to levels of diversity along the genome. Our analyses reclaim the well-established signature of linked selection in Drosophila melanogaster, but we estimate that the mutation landscape is the major driver of the genome-wide distribution of diversity in this species. Furthermore, our simulation results suggest that in many evolutionary scenarios the mutation landscape will be a crucial factor shaping diversity, depending notably on the genomic window size. We argue that incorporating mutation rate variation into the null model of molecular evolution will lead to more realistic inferences in population genomics.
Type: Research article
Barroso, Gustavo V 1, 2; Dutheil, Julien Y 1, 3
@article{10_24072_pcjournal_267, author = {Barroso, Gustavo V and Dutheil, Julien Y}, title = {The landscape of nucleotide diversity in {\protect\emph{Drosophila} melanogaster} is shaped by mutation rate variation}, journal = {Peer Community Journal}, eid = {e40}, publisher = {Peer Community In}, volume = {3}, year = {2023}, doi = {10.24072/pcjournal.267}, url = {https://peercommunityjournal.org/articles/10.24072/pcjournal.267/} }
TY - JOUR AU - Barroso, Gustavo V AU - Dutheil, Julien Y TI - The landscape of nucleotide diversity in Drosophila melanogaster is shaped by mutation rate variation JO - Peer Community Journal PY - 2023 VL - 3 PB - Peer Community In UR - https://peercommunityjournal.org/articles/10.24072/pcjournal.267/ DO - 10.24072/pcjournal.267 ID - 10_24072_pcjournal_267 ER -
%0 Journal Article %A Barroso, Gustavo V %A Dutheil, Julien Y %T The landscape of nucleotide diversity in Drosophila melanogaster is shaped by mutation rate variation %J Peer Community Journal %D 2023 %V 3 %I Peer Community In %U https://peercommunityjournal.org/articles/10.24072/pcjournal.267/ %R 10.24072/pcjournal.267 %F 10_24072_pcjournal_267
Barroso, Gustavo V; Dutheil, Julien Y. The landscape of nucleotide diversity in Drosophila melanogaster is shaped by mutation rate variation. Peer Community Journal, Volume 3 (2023), article no. e40. doi : 10.24072/pcjournal.267. https://peercommunityjournal.org/articles/10.24072/pcjournal.267/
PCI peer reviews and recommendation, and links to data, scripts, code and supplementary information: 10.24072/pci.evolbiol.100636
Conflict of interest of the recommender and peer reviewers:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article.
[1] Inferring the landscape of recombination using recurrent neural networks, bioRxiv (2019) | DOI
[2] Annotating non-coding regions of the genome, Nature Reviews Genetics, Volume 11 (2010) no. 8, pp. 559-571 | DOI
[3] Hitchhiking effects of recurrent beneficial amino acid substitutions in the Drosophila melanogaster genome, Genome Research, Volume 17 (2007) no. 12, pp. 1755-1762 | DOI
[4] Mechanisms in Molecular Biology, Cambridge University Press, 2019 | DOI
[5] Inference of recombination maps from a single pair of genomes and its application to ancient samples, PLOS Genetics, Volume 15 (2019) no. 11 | DOI
[6] gvbarroso/iSMC: v0.0.24 (Software package and Source code), Zenodo, 2023a | DOI
[7] gvbarroso/ismc_dm_analyses: v0.0.1 (Scripts), Zenodo, 2023b | DOI
[8] Quantifying the determinants of the genome-wide diversity in Drosophila using iSMC (Data), FigShare, 2023c | DOI
[9] Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster, Nature, Volume 356 (1992) no. 6369, pp. 519-520 | DOI
[10] Population genomics: Whole-genome analysis of polymorphism and divergence in Drosophila simulans, PLoS Biology, Volume 5 (2007) no. 11 | DOI
[11] Using genomic data to infer historic population dynamics of nonmodel organisms, Annual Review of Ecology, Evolution, and Systematics, Volume 49 (2018) no. 1, pp. 433-456 | DOI
[12] Variants of the protein PRDM9 differentially regulate a set of human meiotic recombination hotspots highly active in African populations, Proceedings of the National Academy of Sciences, Volume 108 (2011) no. 30, pp. 12378-12383 | DOI
[13] Direct estimation of mutations in great apes reconciles phylogenetic dating, Nature Ecology and Evolution, Volume 3 (2019) no. 2, pp. 286-292 | DOI
[14] Detecting positive selection in the genome, BMC Biology, Volume 15 (2017) no. 1 | DOI
[15] Variation in recombination rate affects detection of outliers in genome scans under neutrality, Molecular Ecology, Volume 29 (2020) no. 22, pp. 4274-4279 | DOI
[16] Natural selection is unlikely to explain why species get a thin slice of π, bioRxiv (2021) | DOI
[17] Molecular population genetics, Genetics, Volume 205 (2017) no. 3, pp. 1003-1035 | DOI
[18] Adaptive evolution is substantially impeded by Hill–Robertson interference in Drosophila, Molecular Biology and Evolution, Volume 33 (2016) no. 2, pp. 442-455 | DOI
[19] Impact of mutation rate and selection at linked sites on DNA variation across the genomes of humans and other Homininae, Genome Biology and Evolution, Volume 12 (2020) no. 1, pp. 3550-3561 | DOI
[20] Nearly neutral evolution across the Drosophila melanogaster genome, Molecular Biology and Evolution, Volume 35 (2018), pp. 2685-2694 | DOI
[21] Genome-wide fine-scale recombination rate variation in Drosophila melanogaster, PLoS Genetics, Volume 8 (2012) no. 12 | DOI
[22] Background selection 20 years on: The Wilhelmine E. Key 2012 invitational lecture, Journal of Heredity, Volume 104 (2013) no. 2, pp. 161-171 | DOI
[23] Molecular population genomics: a short history, Genetics Research, Volume 92 (2010) no. 5-6, pp. 397-411 | DOI
[24] Effective population size and patterns of molecular evolution and variation, Nature Reviews Genetics, Volume 10 (2009) no. 3, pp. 195-205 | DOI
[25] Population genetics from 1966 to 2016, Heredity, Volume 118 (2017) no. 1, pp. 2-9 | DOI
[26] The effect of deleterious mutations on neutral molecular variation, Genetics, Volume 134 (1993) no. 4, pp. 1289-1303 | DOI
[27] Background selection as null hypothesis in population genomics: insights and challenges from Drosophila studies, Philosophical Transactions of the Royal Society B: Biological Sciences, Volume 372 (2017) no. 1736 | DOI
[28] Background selection as baseline for nucleotide variation across the Drosophila genome, PLoS Genetics, Volume 10 (2014) no. 6 | DOI
[29] The many landscapes of recombination in Drosophila melanogaster, PLoS Genetics, Volume 8 (2012) no. 10 | DOI
[30] Ensembl 2022, Nucleic Acids Research, Volume 50 (2022) no. D1 | DOI
[31] Genomic signatures of selection at linked sites: Unifying the disparity among species, Nature Reviews Genetics, Volume 14 (2013) no. 4, pp. 262-274 | DOI
[32] Nonparametric coalescent inference of mutation spectrum history and demography, Proceedings of the National Academy of Sciences, Volume 118 (2021) no. 21 | DOI
[33] Biological sequence analysis: Probabilistic models of proteins and nucleic acids, Cambridge University Press, 1998 | DOI
[34] Towards more realistic models of genomes in populations: The Markov-modulated sequentially Markov coalescent, EMS Series of Congress Reports, EMS Press, 2021, pp. 383-408 | DOI
[35] Hidden Markov Models in Population Genomics, Methods in Molecular Biology, Springer New York, New York, NY, 2017, pp. 149-164 | DOI
[36] Determinants of genetic diversity, Nature Reviews Genetics, Volume 17 (2016) no. 7, pp. 422-433 | DOI
[37] A genomic map of the effects of linked selection in Drosophila, PLOS Genetics, Volume 12 (2016) no. 8 | DOI
[38] A Hidden Markov Model approach to variation among sites in rate of evolution, Molecular Biology and Evolution, Volume 13 (1996) no. 1, pp. 93-104 | DOI
[39] Regression diagnostics, Comprehensive Chemometrics, Elsevier, 2009, pp. 33-89 | DOI
[40] Genome-wide patterns and properties of de novo mutations in humans, Nature Genetics, Volume 47 (2015) no. 7, pp. 822-826 | DOI
[41] How much does Ne vary among species?, Genetics, Volume 216 (2020) no. 2, pp. 559-572 | DOI
[42] Fifteen years of genomewide scans for selection: Trends, lessons and unaddressed genetic sources of complication, Molecular Ecology, Volume 25 (2016) no. 1, pp. 5-23 | DOI
[43] Epigenetic modifications affect the rate of spontaneous mutations in a pathogenic fungus, Nature Communications, Volume 12 (2021) no. 1 | DOI
[44] The effect of variation of fitness, The American Naturalist, Volume 71 (1937) no. 735, pp. 337-349 | DOI
[45] SLiM 3: Forward genetic simulations beyond the Wright–Fisher model, Molecular Biology and Evolution, Volume 36 (2018) no. 3, pp. 632-637 | DOI
[46] Mutation rate variation is a primary determinant of the distribution of allele frequencies in humans, PLOS Genetics, Volume 12 (2016) no. 12 | DOI
[47] Population genomics on the fly: Recent advances in Drosophila, Methods in Molecular Biology, Springer US, New York, NY, 2020, pp. 357-396 | DOI
[48] Gene genealogies, variation and evolution: A primer in coalescent theory , Oxford University Press, Oxford, 2004 | DOI
[49] The neutralist, the fly and the selectionist, Trends in Ecology & Evolution, Volume 14 (1999) no. 1, pp. 35-38 | DOI
[50] Variation in the mutation rate across mammalian genomes, Nature Reviews Genetics, Volume 12 (2011) no. 11, pp. 756-766 | DOI
[51] Mapping gene flow between ancient hominins through demography-aware inference of the ancestral recombination graph, PLOS Genetics, Volume 16 (2020) no. 8 | DOI
[52] Properties of a neutral allele model with intragenic recombination, Theoretical Population Biology, Volume 23 (1983) no. 2, pp. 183-201 | DOI
[53] Deleterious background selection with recombination, Genetics, Volume 141 (1995) no. 4, pp. 1605-1617 | DOI
[54] Gene trees with background selection, Non-neutral evolution: Theories and molecular data, Springer US, Boston, MA, 1994, pp. 140-153 | DOI
[55] The coalescent process in models with selection and recombination, Genetics, Volume 120 (1988) no. 3, pp. 831-840 | DOI
[56] Statistical properties of the number of recombination events in the history of a sample of DNA sequences, Genetics, Volume 111 (1985) no. 1, pp. 147-164 | DOI
[57] A common genomic code for chromatin architecture and recombination landscape, PLOS ONE, Volume 14 (2019) no. 3 | DOI
[58] The importance of the Neutral Theory in 1968 and 50 years on: A response to Kern and Hahn 2018, Evolution, Volume 73 (2019) no. 1, pp. 111-114 | DOI
[59] Toward an evolutionarily appropriate null model: Jointly inferring demography and purifying selection, Genetics, Volume 215 (2020) no. 1, pp. 173-192 | DOI
[60] Multiple transmissions of de novo mutations in families, Nature Genetics, Volume 50 (2018) no. 12, pp. 1674-1680 | DOI
[61] Efficient coalescent simulation and genealogical analysis for large sample sizes, PLOS Computational Biology, Volume 12 (2016) no. 5 | DOI
[62] Efficient pedigree recording for fast population genetics simulation, PLOS Computational Biology, Volume 14 (2018) no. 11 | DOI
[63] The neutral theory in light of natural selection, Molecular Biology and Evolution, Volume 35 (2018) no. 6, pp. 1366-1371 | DOI
[64] Mutational signatures: From methods to mechanisms, Annual Review of Biomedical Data Science, Volume 4 (2021) no. 1, pp. 189-206 | DOI
[65] Evolutionary rate at the molecular level, Nature, Volume 217 (1968) no. 5129, pp. 624-626 | DOI
[66] On the genealogy of large populations, Journal of Applied Probability, Volume 19 (1982) no. A, pp. 27-43 | DOI
[67] The Drosophila genome nexus: A population genomic resource of 623 Drosophila melanogaster genomes, including 197 from a single ancestral range population, Genetics, Volume 199 (2015) no. 4, pp. 1229-1241 | DOI
[68] Strong purifying selection at synonymous sites in D. melanogaster, PLoS Genetics, Volume 9 (2013) no. 5 | DOI
[69] The genetic basis of evolutionary change, Columbia University Press, 1974
[70] Inference of human population history from individual whole-genome sequences, Nature, Volume 475 (2011) no. 7357, pp. 493-496 | DOI
[71] Rate, molecular spectrum, and consequences of human mutation, Proceedings of the National Academy of Sciences, Volume 107 (2010) no. 3, pp. 961-968 | DOI
[72] The cellular, developmental and population-genetic determinants of mutation-rate evolution, Genetics, Volume 180 (2008) no. 2, pp. 933-943 | DOI
[73] Genetic drift, selection and the evolution of the mutation rate, Nature Reviews Genetics, Volume 17 (2016) no. 11, pp. 704-714 | DOI
[74] Pervasive strong selection at the level of codon usage bias in Drosophila melanogaster, Genetics, Volume 214 (2020) no. 2, pp. 511-528 | DOI
[75] A genomic history of Aboriginal Australia, Nature, Volume 538 (2016) no. 7624, pp. 207-214 | DOI
[76] Fast "coalescent" simulation, BMC Genetics, Volume 7 (2006) no. 1 | DOI
[77] Non-random mutation: The evolution of targeted hypermutation and hypomutation, BioEssays, Volume 35 (2013) no. 2, pp. 123-130 | DOI
[78] Unlinked background selection reduces neutral diversity more than linked background selection, bioRxiv (2023) | DOI
[79] Recombination modulates how selection affects linked sites in Drosophila, PLoS Biology, Volume 10 (2012) no. 11 | DOI
[80] The structure of linkage disequilibrium around a selective sweep, Genetics, Volume 175 (2007) no. 3, pp. 1395-1406 | DOI
[81] Approximating the coalescent with recombination, Philosophical Transactions of the Royal Society B: Biological Sciences, Volume 360 (2005) no. 1459, pp. 1387-1393 | DOI
[82] Widespread genomic signatures of natural selection in Hominid evolution, PLoS Genetics, Volume 5 (2009) no. 5 | DOI
[83] Recent loss of the Dim2 DNA methyltransferase decreases mutation rate in repeats and changes evolutionary trajectory in a fungal pathogen, PLOS Genetics, Volume 17 (2021) no. 3 | DOI
[84] Mutation bias reflects natural selection in Arabidopsis thaliana, Nature, Volume 602 (2022) no. 7895, pp. 101-105 | DOI
[85] The impact of protein architecture on adaptive evolution, Molecular Biology and Evolution, Volume 36 (2019) no. 9, pp. 2013-2028 | DOI
[86] Broad-scale variation in human genetic diversity levels is predicted by purifying selection on coding and non-coding elements, eLife, Volume 11 (2022) | DOI
[87] Mutation-driven evolution, Oxford University Press, OUP Oxford, 2013
[88] The effect of recombination on background selection, Genetical Research, Volume 67 (1996) no. 2, pp. 159-174 | DOI
[89] The nearly neutral theory of molecular evolution, Annual Review of Ecology and Systematics, Volume 23 (1992) no. 1, pp. 263-286 | DOI
[90] How sequence context-dependent mutability drives mutation rate variation in the genome, Genome Biology and Evolution, Volume 14 (2022) no. 3 | DOI
[91] High-throughput inference of pairwise coalescence times identifies signals of selection and enriched disease heritability, Nature Genetics, Volume 50 (2018) no. 9, pp. 1311-1317 | DOI
[92] The book of why: The new science of cause and effect, 1st ed. , Basic Books, Inc., United States of America, 2018
[93] Determining the effect of natural selection on linked neutral divergence across species, PLOS Genetics, Volume 12 (2016) no. 8 | DOI
[94] Towards an improved understanding of molecular evolution: the relative roles of selection, drift, and everything in between, Peer Community Journal, Volume 1 (2021) | DOI
[95] An efficient method for finding the minimum of a function of several variables without calculating derivatives, The Computer Journal, Volume 7 (1964) no. 2, pp. 155-162 | DOI
[96] An unusual suspect: the mutation landscape as a determinant of local variation in nucleotide diversity (Recommendation), PCI Evolutionary Biology, 2023 | DOI
[97] Genome-wide inference of ancestral recombination graphs, PLoS Genetics, Volume 10 (2014) no. 5 | DOI
[98] Genealogical trees, coalescent theory and the analysis of genetic polymorphisms, Nature Reviews Genetics, Volume 3 (2002) no. 5, pp. 380-390 | DOI
[99] Natural selection shapes variation in genome-wide recombination rate in Drosophila pseudoobscura, Current Biology, Volume 30 (2020) no. 8 | DOI
[100] Inferring human population size and separation history from multiple genome sequences, Nature Genetics, Volume 46 (2014) no. 8, pp. 919-925 | DOI
[101] MSMC and MSMC2: The multiple sequentially Markovian coalescent, Methods in Molecular Biology, Springer US, New York, NY, 2020, pp. 147-166 | DOI
[102] The impact of genetic surfing on neutral genomic diversity, Molecular Biology and Evolution, Volume 39 (2022) no. 11 | DOI
[103] Inference of past demography, dormancy and self-fertilization rates from whole genome sequence data, PLOS Genetics, Volume 16 (2020) no. 4 | DOI
[104] Linkage disequilibrium — understanding the evolutionary past and mapping the medical future, Nature Reviews Genetics, Volume 9 (2008) no. 6, pp. 477-485 | DOI
[105] The hitch-hiking effect of a favourable gene, Genetics Research, Volume 23 (1974), pp. 23-35 | DOI
[106] Large scale variation in the rate of germ-line de novo mutation, base composition, divergence and diversity in humans, PLOS Genetics, Volume 14 (2018) no. 3 | DOI
[107] Inference of population history using coalescent HMMs: Review and outlook, Current Opinion in Genetics & Development, Volume 53 (2018), pp. 70-76 | DOI
[108] scrm: Efficiently simulating long sequences using the approximated coalescent with recombination, Bioinformatics, Volume 31 (2015) no. 10, pp. 1680-1682 | DOI
[109] Widespread selection and gene flow shape the genomic landscape during a radiation of monkeyflowers, PLOS Biology, Volume 17 (2019) no. 7 | DOI
[110] The effect of strongly selected substitutions on neutral polymorphism: Analytical results based on diffusion theory, Theoretical Population Biology, Volume 41 (1992) no. 2, pp. 237-254 | DOI
[111] An approximate full-likelihood method for inferring selection and allele frequency trajectories from DNA sequence data, PLOS Genetics, Volume 15 (2019) no. 9 | DOI
[112] Drift-barrier hypothesis and mutation-rate evolution, Proceedings of the National Academy of Sciences, Volume 109 (2012) no. 45, pp. 18488-18492 | DOI
[113] Robust and scalable inference of population history from hundreds of unphased whole genomes, Nature Genetics, Volume 49 (2017) no. 2, pp. 303-309 | DOI
[114] Multinucleotide mutations cause false inferences of lineage-specific positive selection, Nature Ecology & Evolution, Volume 2 (2018) no. 8, pp. 1280-1288 | DOI
[115] Analysis of a genetic hitchhiking model, and its application to DNA polymorphism data from Drosophila melanogaster., Molecular Biology and Evolution, Volume 10 (1993) no. 4, pp. 842-854 | DOI
[116] Recombination as a point process along sequences, Theoretical Population Biology, Volume 55 (1999) no. 3, pp. 248-259 | DOI
[117] The joint effects of background selection and genetic recombination on local gene genealogies, Genetics, Volume 189 (2011) no. 1, pp. 251-266 | DOI
[118] Methods for estimating demography and detecting between-locus differences in the effective population size and mutation rate, Molecular Biology and Evolution, Volume 36 (2018) no. 2, pp. 423-433 | DOI
Cited by Sources: