Section: Evolutionary Biology
Topic:
Evolution,
Computer sciences
Rehabilitating the benefits of gene tree correction in the presence of incomplete lineage sorting
Corresponding author(s): Lafond, Manuel (manuel.lafond@USherbrooke.ca); Scornavacca, Celine (celine.scornavacca@umontpellier.fr)
10.24072/pcjournal.674 - Peer Community Journal, Volume 6 (2026), article no. e7
Get full text PDF Peer reviewed and recommended by PCIGene trees play an important role in various areas of phylogenomics. However, their reconstruction often relies on limited-length sequences and may not account for complex evolutionary events, such as gene duplications, losses, or incomplete lineage sorting (ILS), which are not modeled by standard phylogenetic methods. To address these challenges, it is common to first infer gene trees using fast algorithms for conventional models, then refine them through species tree-aware correction methods. Recently, it has been argued that such corrections can lead to overfitting and force gene trees to resemble the species tree, thereby obscuring genuine gene-level variation caused by ILS. In this paper, we challenge and refute this hypothesis, and we demonstrate that, when applied carefully, correction methods can offer significant benefits, even in the presence of ILS.
Type: Research article
Lafond, Manuel  1 ; Scornavacca, Celine  2
CC-BY 4.0
@article{10_24072_pcjournal_674,
author = {Lafond, Manuel and Scornavacca, Celine},
title = {Rehabilitating the benefits of gene tree correction in the presence of incomplete lineage sorting
},
journal = {Peer Community Journal},
eid = {e7},
year = {2026},
publisher = {Peer Community In},
volume = {6},
doi = {10.24072/pcjournal.674},
language = {en},
url = {https://peercommunityjournal.org/articles/10.24072/pcjournal.674/}
}
TY - JOUR AU - Lafond, Manuel AU - Scornavacca, Celine TI - Rehabilitating the benefits of gene tree correction in the presence of incomplete lineage sorting JO - Peer Community Journal PY - 2026 VL - 6 PB - Peer Community In UR - https://peercommunityjournal.org/articles/10.24072/pcjournal.674/ DO - 10.24072/pcjournal.674 LA - en ID - 10_24072_pcjournal_674 ER -
%0 Journal Article %A Lafond, Manuel %A Scornavacca, Celine %T Rehabilitating the benefits of gene tree correction in the presence of incomplete lineage sorting %J Peer Community Journal %D 2026 %V 6 %I Peer Community In %U https://peercommunityjournal.org/articles/10.24072/pcjournal.674/ %R 10.24072/pcjournal.674 %G en %F 10_24072_pcjournal_674
Lafond, M.; Scornavacca, C. Rehabilitating the benefits of gene tree correction in the presence of incomplete lineage sorting. Peer Community Journal, Volume 6 (2026), article no. e7. https://doi.org/10.24072/pcjournal.674
PCI peer reviews and recommendation, and links to data, scripts, code and supplementary information: 10.24072/pci.evolbiol.100872
Conflict of interest of the recommender and peer reviewers:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article.
[1] Genome-scale coestimation of species and gene trees, Genome research, Volume 23 (2013) no. 2, pp. 323-330 (Publisher: Cold Spring Harbor Lab) | DOI
[2] Non-parametric correction of estimated gene trees using TRACTION, Algorithms for Molecular Biology, Volume 15 (2020), pp. 1-18 (Publisher: Springer) | DOI
[3] Gene tree discordance, phylogenetic inference and the multispecies coalescent, Trends in ecology & evolution, Volume 24 (2009) no. 6, pp. 332-340 (Publisher: Elsevier) | DOI
[4] Reassessing gene tree correction under incomplete lineage sorting, Peer Community in Evolutionary Biology (2025), p. 100872 | DOI
[5] Reconciling multiple genes trees via segmental duplications and losses, Algorithms for Molecular Biology, Volume 14 (2019), pp. 1-19 (Publisher: Springer) | DOI
[6] Species tree inference with BPP using genomic sequences and the multispecies coalescent, Molecular biology and evolution, Volume 35 (2018) no. 10, pp. 2585-2593 (Publisher: Oxford University Press) | DOI
[7] Dynamics of gene loss following ancient whole-genome duplication in the cryptic Paramecium complex, Molecular biology and evolution, Volume 40 (2023) no. 5, p. msad107 (Publisher: Oxford University Press US) | DOI
[8] A justification for reporting the majority-rule consensus tree in Bayesian phylogenetics, Systematic biology, Volume 57 (2008) no. 5, pp. 814-821 (Publisher: Taylor & Francis) | DOI
[9] ecceTERA: comprehensive gene tree-species tree reconciliation using parsimony, Bioinformatics, Volume 32 (2016) no. 13, pp. 2056-2058 (Publisher: Oxford University Press) | DOI
[10] Resolution and reconciliation of non-binary gene trees with transfers, duplications and losses, Bioinformatics, Volume 33 (2017) no. 7, pp. 980-987 (Publisher: Oxford University Press) | DOI
[11] A rigorous framework to classify the postduplication fate of paralogous genes, Journal of Computational Biology, Volume 31 (2024) no. 9, pp. 815-833 (Publisher: Mary Ann Liebert, Inc., publishers 140 Huguenot Street, 3rd Floor New …) | DOI
[12] Error detection and correction of gene trees, Models and algorithms for genome evolution, Springer, 2013, pp. 261-285 | DOI
[13] Inferring phylogeny despite incomplete lineage sorting, Systematic biology, Volume 55 (2006) no. 1, pp. 21-30 (Publisher: Oxford University Press) | DOI
[14] Detecting hybrid speciation in the presence of incomplete lineage sorting using gene tree incongruence: a model, Theoretical population biology, Volume 75 (2009) no. 1, pp. 35-45 (Publisher: Elsevier) | DOI
[15] AleRax: a tool for gene and species tree co-estimation and reconciliation under a probabilistic model of gene duplication, transfer, and loss, Bioinformatics, Volume 40 (2024) no. 4, p. btae162 | DOI
[16] IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies, Molecular biology and evolution, Volume 32 (2015) no. 1, pp. 268-274 (Publisher: Oxford University Press) | DOI
[17] Efficient gene tree correction guided by genome evolution, PLoS One, Volume 11 (2016) no. 8, p. e0159559 (Publisher: Public Library of Science San Francisco, CA USA) | DOI
[18] StarBEAST2 brings faster species tree inference and accurate estimates of substitution rates, Molecular biology and evolution, Volume 34 (2017) no. 8, pp. 2101-2114 (Publisher: Oxford University Press) | DOI
[19] Evolution by gene duplication, Springer Science & Business Media, 2013 | DOI
[20] Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees, Bioinformatics, Volume 13 (1997) no. 3, pp. 235-238 (Publisher: Oxford University Press) | DOI
[21] MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space, Systematic biology, Volume 61 (2012) no. 3, pp. 539-542 (Publisher: Oxford University Press) | DOI
[22] Joint amalgamation of most parsimonious reconciled gene trees, Bioinformatics, Volume 31 (2015) no. 6, pp. 841-848 (Publisher: Oxford University Press) | DOI
[23] Rehabilitating the benefits of gene tree correction in the presence of incomplete lineage sorting, zenodo (2025) | DOI
[24] To what extent current limits of phylogenomics can be overcome? In: Phylogenetics in the genomic era, HAL (2020) no. 2.1, pp. 1-34 (https://hal.science/hal-02535366v1)
[25] To what extent current limits of phylogenomics can be overcome?, https://hal.science/hal-02535366v1, 2020, p. 2
[26] Simulating trees with a fixed number of extant species, Systematic biology, Volume 60 (2011) no. 5, pp. 676-684 (Publisher: Oxford University Press) | DOI
[27] TreeFix: statistically informed gene tree error correction using species trees, Systematic biology, Volume 62 (2013) no. 1, pp. 110-120 (Publisher: Oxford University Press) | DOI
[28] Average gene length is highly conserved in prokaryotes and eukaryotes and diverges only between the two kingdoms, Molecular biology and evolution, Volume 23 (2006) no. 6, pp. 1107-1108 (Publisher: Oxford University Press) | DOI
[29] “Correcting” gene trees to be more like species trees frequently increases topological error, Genome Biology and Evolution, Volume 15 (2023) no. 6, p. evad094 (Publisher: Oxford University Press US) | DOI
[30] From gene trees to species trees II: Species tree inference by minimizing deep coalescence events, IEEE/ACM Transactions on Computational Biology and Bioinformatics, Volume 8 (2011) no. 6, pp. 1685-1691 (Publisher: IEEE) | DOI
[31] A linear-time algorithm for reconciliation of non-binary gene tree and binary species tree, Combinatorial Optimization and Applications: 7th International Conference, COCOA 2013, Chengdu, China, December 12-14, 2013, Proceedings, Springer, 2013, pp. 190-201 | DOI
[32] Reconciliation with nonbinary gene trees revisited, Journal of the ACM (JACM), Volume 64 (2017) no. 4, pp. 1-28 (Publisher: ACM New York, NY, USA) | DOI
Cited by Sources: