Section: Mathematical & Computational Biology
Topic:
Agricultural sciences,
Statistics
Bayesian joint-regression analysis of unbalanced series of on-farm trials
Corresponding author(s): Turbet Delof, Michel (michel.turbet_delof@cirad.fr)
10.24072/pcjournal.495 - Peer Community Journal, Volume 5 (2025), article no. e4.
Get full text PDF Peer reviewed and recommended by PCIParticipatory plant breeding (PPB) is aimed at developing varieties adapted to agroecologically-based systems. In PPB, selection is decentralized in the target environments, and relies on collaboration between farmers, farmers' organisations and researchers. By doing so, evaluation of new genotypes takes genotype x environment (GxE) interactions into account to select for specific adaptation. In many cases, there is little overlap among genotypes assessed from farm to farm because the farmers participating in a PPB project choose which ones to assess on their farm. In addition, on-farm trials can often generate more extreme observations than trials carried out on research stations. These features make the estimation of genotype, environment and interaction effects more difficult. This challenge is not unique to PPB, as many breeding programs use sparse testing or incomplete block designs to evaluate more genotypes, however in PPB genotypes are not always assigned randomly to environments. To explore methods of overcoming these challenges, this article tests various data analysis scenarios using a Bayesian approach with different models and a real wheat PPB dataset over 11 years. Four morpho-agronomic traits were studied, representing over 1000 GxE combinations from 189 on-farm trials. This dataset was severely unbalanced with more than 90% of GxE combinations missing. We compared various Bayesian Finlay-Wilkinson models and found that placing hierarchical distributions on model parameters and modelling residuals using a Student's t distribution jointly improved the estimates of main effects and interactions. Environment effects were the most important and explained more than 50% of the variance of observations. This statistical framework allowed us to estimate two indicators of genotype stability (one static and one dynamic) despite the high disequilibrium of the data. We found differences in mean and stability between genotype categories, with registred varieties consistently shorter (-30 cm) and containing less protein (-0.3%) than other types of varieties. The methods developed could be used for evaluation and/or selection within networks of various stakeholders such as farmers, gardeners, plant breeders or managers of genetic resource centres.
Type: Research article
Turbet Delof, Michel 1; Rivière, Pierre 2; Dawson, Julie C 3; Gauffreteau, Arnaud 4; Goldringer, Isabelle 1; van Frank, Gaëlle 1; David, Olivier 5
@article{10_24072_pcjournal_495, author = {Turbet Delof, Michel and Rivi\`ere, Pierre and Dawson, Julie C and Gauffreteau, Arnaud and Goldringer, Isabelle and van Frank, Ga\"elle and David, Olivier}, title = {Bayesian joint-regression analysis of unbalanced series of on-farm trials}, journal = {Peer Community Journal}, eid = {e4}, publisher = {Peer Community In}, volume = {5}, year = {2025}, doi = {10.24072/pcjournal.495}, language = {en}, url = {https://peercommunityjournal.org/articles/10.24072/pcjournal.495/} }
TY - JOUR AU - Turbet Delof, Michel AU - Rivière, Pierre AU - Dawson, Julie C AU - Gauffreteau, Arnaud AU - Goldringer, Isabelle AU - van Frank, Gaëlle AU - David, Olivier TI - Bayesian joint-regression analysis of unbalanced series of on-farm trials JO - Peer Community Journal PY - 2025 VL - 5 PB - Peer Community In UR - https://peercommunityjournal.org/articles/10.24072/pcjournal.495/ DO - 10.24072/pcjournal.495 LA - en ID - 10_24072_pcjournal_495 ER -
%0 Journal Article %A Turbet Delof, Michel %A Rivière, Pierre %A Dawson, Julie C %A Gauffreteau, Arnaud %A Goldringer, Isabelle %A van Frank, Gaëlle %A David, Olivier %T Bayesian joint-regression analysis of unbalanced series of on-farm trials %J Peer Community Journal %D 2025 %V 5 %I Peer Community In %U https://peercommunityjournal.org/articles/10.24072/pcjournal.495/ %R 10.24072/pcjournal.495 %G en %F 10_24072_pcjournal_495
Turbet Delof, Michel; Rivière, Pierre; Dawson, Julie C; Gauffreteau, Arnaud; Goldringer, Isabelle; van Frank, Gaëlle; David, Olivier. Bayesian joint-regression analysis of unbalanced series of on-farm trials. Peer Community Journal, Volume 5 (2025), article no. e4. doi : 10.24072/pcjournal.495. https://peercommunityjournal.org/articles/10.24072/pcjournal.495/
PCI peer reviews and recommendation, and links to data, scripts, code and supplementary information: 10.24072/pci.mcb.100272
Conflict of interest of the recommender and peer reviewers:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article.
[1] Genotype x environment interaction: challenges and opportunities for plant breeding and cultivar recommendations, FAO plant production and protection paper, Food and Agriculture Organization of the United Nations, Rome, 2002 no. 174
[2] Defining Subregions and Estimating Benefits for a Specific-Adaptation Strategy by Breeding Programs: A Case Study, Crop Science, Volume 45 (2005) no. 5, pp. 1741-1749 | DOI
[3] Response of Common Wheat Varieties to Organic and Conventional Production Systems across Italian Locations, and Implications for Selection, Field Crops Research, Volume 116 (2010) no. 3, pp. 230-238 | DOI
[4] Correlations among Some Statistical Measures of Phenotypic Stability, Euphytica, Volume 30 (1981) no. 3, pp. 835-840 | DOI
[5] Stability Analysis in Plant Breeding, Plant breeding, Volume 101 (1988) no. 1, pp. 1-23 | DOI
[6] Root and shoot traits of bread wheat (Triticum aestivum L.) landraces and cultivars, Euphytica, Volume 212 (2016) no. 2, pp. 297-311 | DOI
[7] Bayesian Analysis of Agricultural Field Experiments, Journal of the Royal Statistical Society: Series B (Statistical Methodology), Volume 61 (1999) no. 4, pp. 691-746 | DOI
[8] What are the prospects for citizen science in agriculture? Evidence from three continents on motivation and mobile telephone use of resource-poor farmers, PloS one, Volume 12 (2017) no. 5, p. e0175700 | DOI
[9] Visions Paysannes de La Recherche Dans Le Contexte de La Sélection Participative, Emergence, Pour l'émergence d'une université du Vivant, 2011
[10] Points de repère dans l'analyse de la stabilité et de l'interaction génotype-milieu en amélioration des plantes, Agronomie, Volume 17 (1997) no. 4, pp. 219-246 | DOI
[11] Plant Functional Trait Variability and Trait Syndromes among Wheat Varieties: The Footprint of Artificial Selection, Journal of Experimental Botany, Volume 72 (2021) no. 4, pp. 1166-1180 | DOI
[12] Bayesian inference of spatially correlated random parameters for on-farm experiment, Field Crops Research, Volume 281 (2022), p. 108477 | DOI
[13] Inference for Nonconjugate Bayesian Models Using the Gibbs Sampler, Canadian Journal of Statistics, Volume 19 (1991) no. 4, pp. 399-405 | DOI
[14] Bayesian methods for data analysis, Chapman & Hall/CRC Texts in Statistical Science Series, CRC Press, Boca Raton, Florida ; London, England ; New York, New York, 2009
[15] Farmer Participation in Barley Breeding in Syria, Morocco and Tunisia, Euphytica, Volume 122 (2001) no. 3, pp. 521-536 | DOI
[16] Decentralized-Participatory Plant Breeding: An Example of Demand Driven Research, Euphytica, Volume 155 (2007) no. 3, pp. 349-360 | DOI
[17] Participatory Plant Breeding: Who Did It, Who Does It and Where?, Experimental Agriculture, Volume 56 (2020) no. 1, pp. 1-11 | DOI
[18] Scale Mixtures Distributions in Statistical Modelling, Australian & New Zealand Journal of Statistics, Volume 50 (2008) no. 2, pp. 135-146 | DOI
[19] Scale Mixtures Distributions in Statistical Modelling, Australian & New Zealand Journal of Statistics, Volume 50 (2008) no. 2, pp. 135-146 | DOI
[20] A Bayesian Approach for Assessing the Stability of Genotypes, Crop Science, Volume 46 (2006) no. 6, pp. 2654-2665 | DOI
[21] Eberhart and Russel's Bayesian Method in the Selection of Popcorn Cultivars, Crop Science, Volume 55 (2015) no. 2, pp. 571-577 | DOI
[22] Bayesian Inference of Natural Selection from Spatiotemporal Phenotypic Data, Theoretical Population Biology, Volume 131 (2020), pp. 100-109 | DOI
[23] Decentralized Selection and Participatory Approaches in Plant Breeding for Low-Input Systems, Euphytica, Volume 160 (2008) no. 2, pp. 143-154 | DOI
[24] Collaborative Plant Breeding for Organic Agricultural Systems in Developed Countries, Sustainability, Volume 3 (2011) no. 8, pp. 1206-1223 | DOI
[25] Changes in the Concept of Genotype × Environment Interactions to Fit Agriculture Diversification and Decentralized Participatory Plant Breeding: Pluridisciplinary Point of View, Euphytica, Volume 163 (2008) no. 3, pp. 533-546 | DOI
[26] Handling Data Imbalance and G×E Interactions in On-Farm Trials Using Bayesian Hierarchical Models, Peer Community in Mathematical and Computational Biology, Volume 1 (2024), p. 100272 | DOI
[27] Comparative analysis of performance and stability among composite cross populations, variety mixtures and pure lines of winter wheat in organic and conventional cropping systems, Field Crops Research, Volume 183 (2015), pp. 235-245 | DOI
[28] Introduction to Quantitative Genetics., Oliver & Boyd, Edinburgh/London, 1960
[29] An Overview on the Use of Stability Parameters in Plant Breeding, Biometrics & Biostatistics International Journal, Volume 2 (2015) no. 5, pp. 149-159 | DOI
[30] The Analysis of Adaptation in a Plant-Breeding Programme, Australian Journal of Agricultural Research, Volume 14 (1963) no. 6, pp. 742-754 | DOI
[31] Statistical Analysis of Successive Series of Experiments in Plant Breeding: A Bayesian Approach., Institut National de la Recherche Agronomique, Paris, France, 2001, pp. 49-56
[32] Statistical Analysis of Yield Trials by AMMI and GGE: Further Considerations, Crop Science, Volume 48 (2008) no. 3, pp. 866-889 | DOI
[33] Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper), Bayesian Analysis, Volume 1 (2006) no. 3, pp. 515-534 | DOI
[34] R-squared for Bayesian Regression Models, The American Statistician, Volume 73 (2019) no. 3, pp. 307-309 | DOI
[35] Prediction of complex traits: robust alternatives to best linear unbiased prediction, Frontiers in genetics, Volume 9 (2018), p. 195 | DOI
[36] Average Information REML: An Efficient Algorithm for Variance Parameter Estimation in Linear Mixed Models, Biometrics (1995), pp. 1440-1450 | DOI
[37] Agronomic Evaluation of Bread Wheat Varieties from Participatory Breeding: A Combination of Performance and Robustness, Sustainability, Volume 12 (2020) no. 1, p. 128 | DOI
[38] Robust Statistics: The Approach Based on Influence Functions, John Wiley & Sons, 2011 | DOI
[39] On the Interpretation and Use of R² in Regression Analysis, Biometrics, Volume 43 (1987) no. 1, pp. 61-69 | DOI
[40] Genotype by Environment Interactions in Wheat Quality Breeding Programs in Southeast Europe, Euphytica, Volume 174 (2010), pp. 315-324 | DOI
[41] Robust Statistics, John Wiley & Sons, 1981 | DOI
[42] Model-Based Clustering of Non-Gaussian Panel Data Based on Skew-t Distributions, Journal of Business & Economic Statistics, Volume 28 (2010) no. 1, pp. 52-66 | DOI
[43] Sustaining Sustainability Science: The Role of Established Inter-Disciplines, Ecological Economics, Volume 70 (2011) no. 4, pp. 835-843 | DOI
[44] Are Trait-based Species Rankings Consistent across Data Sets and Spatial Scales?, Journal of Vegetation Science, Volume 25 (2014) no. 1, pp. 235-247 | DOI
[45] Effects of Inter-Varietal Diversity, Biotic Stresses and Environmental Productivity on Grain Yield of Spring Barley Variety Mixtures, Euphytica, Volume 185 (2012), pp. 123-138 | DOI
[46] Quality Traits in Winter Wheat: Comparison of Stability Parameters and Correlations between Traits Regarding Their Stability, Journal of Cereal Science, Volume 77 (2017), pp. 186-193 | DOI
[47] High-Yielding, Drought-Tolerant, Stable Rice Genotypes for the Shallow Rainfed Lowland Drought-Prone Ecosystem, Field crops research, Volume 133 (2012), pp. 37-47 | DOI
[48] Robust Statistical Modeling Using the t Distribution, Journal of the American Statistical Association, Volume 84 (1989) no. 408, pp. 881-896 | DOI
[49] Identifying the Best Approximating Model in Bayesian Phylogenetics: Bayes Factors, Cross-Validation or wAIC?, Systematic Biology, Volume 72 (2023) no. 3, pp. 616-638 | DOI
[50] FW: An R Package for Finlay–Wilkinson Regression That Incorporates Genomic/Pedigree Information and Covariance Structures Between Environments, G3 GenesGen-omesGenetics, Volume 6 (2016) no. 3, pp. 589-597 | DOI
[51] Stability Analysis: Where Do We Stand?, Crop science, Volume 26 (1986) no. 5, pp. 894-900 | DOI
[52] Participatory Research Methods— Implementation, Effectiveness and Institutional Context, Agricultural systems, Volume 55 (1997) no. 2, pp. 195-216 | DOI
[53] Identifying Superior Rainfed Barley Genotypes in Farmers' Fields Using Participatory Varietal Selection, Journal of Crop Science and Biotechnology, Volume 14 (2011), pp. 281-288 | DOI
[54] Evidence of Varietal Adaptation to Organic Farming Systems, Field Crops Research, Volume 102 (2007) no. 3, pp. 172-177 | DOI
[55] Stability of Some Quality Traits in Bread Wheat (Triticum Aestivum) Genotypes, Journal of Environmental Biology, Volume 31 (2010) no. 4, p. 489
[56] Analysis of Series of Trials Where Varieties Differ in Sensitivity to Locations, Journal of Agricultural, Biological, and Environmental Statistics, Volume 4 (1999) no. 3, pp. 310-325 | DOI
[57] Bayesian Segmented Regression Model for Adaptability and Stability Evaluation of Cotton Genotypes, Euphytica, Volume 216 (2020) no. 2, pp. 1-10 | DOI
[58] MCMC Using Hamiltonian Dynamics, Handbook of Markov Chain Monte Carlo, Chapman and Hall/CRC, 2011, pp. 113-162 | DOI
[59] Joint-Regression Analysis for Incomplete Two-way Tables, Australian & New Zealand Journal of Statistics, Volume 43 (2001) no. 2, pp. 201-206 | DOI
[60] Analysis of series of variety trials, Statistical methods for plant variety evaluation, Chapman & Hall, 1997, pp. 139-161 | DOI
[61] Statutory and Recommended List Trials of Crop Varieties in the United Kingdom, Journal of the Royal Statistical Society: Series A (General), Volume 143 (1980) no. 3, pp. 219-240 | DOI
[62] 'Green Revolution' Genes Encode Mutant Gibberellin Response Modulators, Nature, Volume 400 (1999) no. 6741, pp. 256-261 | DOI
[63] Robustness of Joint Regression Analysis, Biometrical Letters, Volume 44 (2007) no. 2, pp. 105-128 | DOI
[64] Environmental and genotype-environmental components of variability III. Multiple lines and crosses, Heredity, Volume 23 (1968) no. 3, pp. 339-356 | DOI
[65] Stability analysis using the SAS system, Agronomy Journal, Volume 91 (1999) no. 1, pp. 154-160 | DOI
[66] An algorithm for a letter-based representation of all-pairwise comparisons, Journal of Computational and Graphical Statistics, Volume 13 (2004) no. 2, pp. 456-466 | DOI
[67] Extending Finlay–Wilkinson regression with environmental covariates, Plant Breeding, Volume 142 (2023) no. 5, pp. 621-631 | DOI
[68] R: A Language and Environment for Statistical Computing, http://www.R-project.org/, 2024 (Place: Vienna, Austria)
[69] Methods of Yield Stability Analysis in Long-Term Field Experiments. A Review, Agronomy for Sustainable Development, Volume 41 (2021), pp. 1-28 | DOI
[70] Hierarchical Bayesian Modeling for Flexible Experiments in Decentralized Participatory Plant Breeding, Crop Science, Volume 55 (2015) no. 3, pp. 1053-1067 | DOI
[71] Response to Farmer Mass Selection in Early Generation Progeny of Bread Wheat Landrace Crosses, Renewable Agriculture and Food Systems, Volume 30 (2015) no. 02, pp. 190-201 | DOI
[72] The Bayesian Choice: From Decision-Theoretic Foundations to Computational Implementation, Springer, 2007 | DOI
[73] A Comparison between Joint Regression Analysis and the Additive Main and Multiplicative Interaction Model: The Robustness with Increasing Amounts of Missing Data, Scientia Agricola, Volume 68 (2011), pp. 679-686 | DOI
[74] Robust Linear Mixed Models with Normal/Independent Distributions and Bayesian MCMC Implementation, Biometrical Journal, Volume 45 (2003) no. 5, pp. 573-590 | DOI
[75] Statistical Inference, Chapman and Hall, 1975
[76] Les modèles de base de l'analyse bayésienne, Méthodes Bayésiennes en statistique, Editions Technip, 2002, pp. 103-132 | DOI
[77] The Analysis of Crop Cultivar Breeding and Evaluation Trials: An Overview of Current Mixed Model Approaches, The Journal of Agricultural Science, Volume 143 (2005) no. 6, pp. 449-462 | DOI
[78] Participatory Plant Breeding with Maize in Mexico and Honduras, Euphytica, Volume 122 (2001) no. 3, pp. 551-563 | DOI
[79] Farmer Preferences and Legume Intensification for Low Nutrient Environments, Plant and soil, Volume 245 (2002), pp. 181-192 | DOI
[80] RStan: The R Interface to Stan, http://mc-stan.org/, 2024
[81] Yield variability of crop varieties in the UK, The Journal of Agricultural Science, Volume 102 (1984) no. 2, pp. 315-321 | DOI
[82] A Sparse Implementation of the Average Information Algorithm for Factor Analytic and Reduced Rank Variance Models, Australian & New Zealand Journal of Statistics, Volume 45 (2003) no. 4, pp. 445-459 | DOI
[83] Impacts de l'environnement sur les pratiques de sélection paysanne et le comportement des variétés qui en résultent, Université Paris-Saclay (2024) | DOI
[84] Supplementary Data and models : Bayesian joint-regression analysis of unbalanced series of on-farm trials, 2024 | DOI
[85] Crop variety management for climate adaptation supported by citizen science, Proceedings of the National Academy of Sciences, Volume 116 (2019) no. 10, pp. 4194-4199 | DOI
[86] Practical Bayesian Model Evaluation Using Leave-One-out Cross-Validation and WAIC, Statistics and Computing, Volume 27 (2017) no. 5, pp. 1413-1432 | DOI
[87] Pareto Smoothed Importance Sampling, arXiv:1507.02646 (2021) | DOI
[88] Increasing the Client Orientation of Maize Breeding Using Farmer Participation in Eastern India, Experimental Agriculture, Volume 41 (2005) no. 4, pp. 413-426 | DOI
[89] Identification of parametric models from experimental data, Communications and control engineering, Springer Masson, London, 1997
[90] A comparison of analysis methods for late-stage variety evaluation trials, Australian & New Zealand Journal of Statistics, Volume 52 (2010) no. 2, pp. 125-149 | DOI
[91] Developments in Breeding Cereals for Organic Agriculture, Euphytica, Volume 163 (2008) no. 3, pp. 323-346 | DOI
[92] Estimation of Missing Values Affects Important Aspects of GGE Biplot Analysis, Crop Science, Volume 57 (2017) no. 1, pp. 40-52 | DOI
[93] Uber Eine Methode Zur Erfassung Der Okologischen Streubreite in Feldverzuchen, Z. Pflanzenzuchtg, Volume 47 (1962), pp. 92-96 | DOI
[94] Heritability, Variance Components and Correlations of Yield and Quality Traits in Durum Wheat (Triticum Durum Desf.), Pak. J. Bot, Volume 41 (2009) no. 2, pp. 753-759 | DOI
[95] Biplot Analysis of Incomplete Two-way Data, Crop Science, Volume 53 (2013) no. 1, pp. 48-57 | DOI
[96] The Analysis of Groups of Experiments, The Journal of Agricultural Science, Volume 28 (1938) no. 4, pp. 556-580 | DOI
[97] A weighted AMMI algorithm for nonreplicated data, Pesquisa Agropecuária Brasileira, Volume 53 (2018) no. 5, pp. 557-565 | DOI
[98] Gestion participative de la diversité cultivée et création de mélanges diversifiés de blé tendre à la ferme, Université Paris Saclay (COmUE) (2018) | DOI
[99] Influence of Experimental Design on Decentralized, on-Farm Evaluation of Populations: A Simulation Study, Euphytica, Volume 215 (2019) no. 7, p. 126 | DOI
[100] Genetic Diversity and Stability of Performance of Wheat Population Varieties Developed by Participatory Breeding, Sustainability, Volume 12 (2020) no. 1, p. 384 | DOI
Cited by Sources: