Section: Genomics
Topic: Genetics/Genomics, Plant biology

SNP discovery by exome capture and resequencing in a pea genetic resource collection

10.24072/pcjournal.332 - Peer Community Journal, Volume 3 (2023), article no. e100.

Get full text PDF Peer reviewed and recommended by PCI

Pea is a major pulse crop in temperate regions and a model plant in genetics. Large genetic marker resources are needed to assess the genetic diversity in the species genepool and to provide selection tools for breeders. In this study, we used second-generation sequencing to perform an exome-capture protocol using a diverse pea germplasm collection, and produced a resource of over 2 million Single Nucleotide Polymorphisms. This dataset was then used to characterize the genetic diversity present in the panel and compute phylogenetic and structure analyses. The development of this resource paves the way for Genome-wide association studies and the development of powerful genotyping tools.

Published online:
DOI: 10.24072/pcjournal.332
Type: Research article
Keywords: Pea; Single Nucleotide Polymorphism; Exome capture; diversity
Aubert, Grégoire 1; Kreplak, Jonathan 1; Leveugle, Magalie 2, 3; Duborjal, Hervé 2, 3; Klein, Anthony 1; Boucherot, Karen 1; Vieille, Emilie 1; Chabert-Martinello, Marianne 1; Cruaud, Corinne 4; Bourion, Virginie 1; Lejeune-Hénaut, Isabelle 5; Pilet-Nayel, Marie-Laure 6; Bouchenak-Khelladi, Yanis 1; Francillonne, Nicolas 7, 8; Tayeh, Nadim 1; Pichon, Jean-Philippe 2, 3; Rivière, Nathalie 2, 9; Burstin, Judith 1

1 Agroécologie, INRAE, Institut Agro, Univ. Bourgogne, Univ. Bourgogne Franche-Comté, F-21000 Dijon, France
2 Biogemma, F-63720 Chappes, France
3 Present address: Limagrain F-63720 Chappes, France
4 Genoscope, Institut François Jacob, CEA, Université Paris-Saclay, 2 Rue Gaston Crémieux, 91057 Evry, France
5 BioEcoAgro, INRAE, Univ. Liège, Univ. Lille, Univ. Picardie Jules Verne, 2, Chaussée Brunehaut, F-80203, Estrées-Mons, France
6 IGEPP, INRAE, Institut Agro, Univ Rennes, 35653, Le Rheu, France
7 Université Paris-Saclay, INRAE, URGI, 78026, Versailles, France
8 Université Paris-Saclay, INRAE, BioinfOmics, Plant bioinformatics facility, 78026, Versailles, France
9 Present address: HM-Clause 49800 Loire-Authion
License: CC-BY 4.0
Copyrights: The authors retain unrestricted copyrights and publishing rights
     author = {Aubert, Gr\'egoire and Kreplak, Jonathan and Leveugle, Magalie and Duborjal, Herv\'e and Klein, Anthony and Boucherot, Karen and Vieille, Emilie and Chabert-Martinello, Marianne and Cruaud, Corinne and Bourion, Virginie and Lejeune-H\'enaut, Isabelle and Pilet-Nayel, Marie-Laure and Bouchenak-Khelladi, Yanis and Francillonne, Nicolas and Tayeh, Nadim and Pichon, Jean-Philippe and Rivi\`ere, Nathalie and Burstin, Judith},
     title = {SNP discovery by exome capture and resequencing in a pea genetic resource collection},
     journal = {Peer Community Journal},
     eid = {e100},
     publisher = {Peer Community In},
     volume = {3},
     year = {2023},
     doi = {10.24072/pcjournal.332},
     language = {en},
     url = {}
AU  - Aubert, Grégoire
AU  - Kreplak, Jonathan
AU  - Leveugle, Magalie
AU  - Duborjal, Hervé
AU  - Klein, Anthony
AU  - Boucherot, Karen
AU  - Vieille, Emilie
AU  - Chabert-Martinello, Marianne
AU  - Cruaud, Corinne
AU  - Bourion, Virginie
AU  - Lejeune-Hénaut, Isabelle
AU  - Pilet-Nayel, Marie-Laure
AU  - Bouchenak-Khelladi, Yanis
AU  - Francillonne, Nicolas
AU  - Tayeh, Nadim
AU  - Pichon, Jean-Philippe
AU  - Rivière, Nathalie
AU  - Burstin, Judith
TI  - SNP discovery by exome capture and resequencing in a pea genetic resource collection
JO  - Peer Community Journal
PY  - 2023
VL  - 3
PB  - Peer Community In
UR  -
DO  - 10.24072/pcjournal.332
LA  - en
ID  - 10_24072_pcjournal_332
ER  - 
%0 Journal Article
%A Aubert, Grégoire
%A Kreplak, Jonathan
%A Leveugle, Magalie
%A Duborjal, Hervé
%A Klein, Anthony
%A Boucherot, Karen
%A Vieille, Emilie
%A Chabert-Martinello, Marianne
%A Cruaud, Corinne
%A Bourion, Virginie
%A Lejeune-Hénaut, Isabelle
%A Pilet-Nayel, Marie-Laure
%A Bouchenak-Khelladi, Yanis
%A Francillonne, Nicolas
%A Tayeh, Nadim
%A Pichon, Jean-Philippe
%A Rivière, Nathalie
%A Burstin, Judith
%T SNP discovery by exome capture and resequencing in a pea genetic resource collection
%J Peer Community Journal
%D 2023
%V 3
%I Peer Community In
%R 10.24072/pcjournal.332
%G en
%F 10_24072_pcjournal_332
Aubert, Grégoire; Kreplak, Jonathan; Leveugle, Magalie; Duborjal, Hervé; Klein, Anthony; Boucherot, Karen; Vieille, Emilie; Chabert-Martinello, Marianne; Cruaud, Corinne; Bourion, Virginie; Lejeune-Hénaut, Isabelle; Pilet-Nayel, Marie-Laure; Bouchenak-Khelladi, Yanis; Francillonne, Nicolas; Tayeh, Nadim; Pichon, Jean-Philippe; Rivière, Nathalie; Burstin, Judith. SNP discovery by exome capture and resequencing in a pea genetic resource collection. Peer Community Journal, Volume 3 (2023), article  no. e100. doi : 10.24072/pcjournal.332.

PCI peer reviews and recommendation, and links to data, scripts, code and supplementary information: 10.24072/pci.genomics.100237

Conflict of interest of the recommender and peer reviewers:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article.

[1] Alves-Carvalho, S.; Aubert, G.; Carrère, S.; Cruaud, C.; Brochot, A.-L.; Jacquin, F.; Klein, A.; Martin, C.; Boucherot, K.; Kreplak, J.; da Silva, C.; Moreau, S.; Gamas, P.; Wincker, P.; Gouzy, J.; Burstin, J. Full-length de novo assembly of RNA-seq data in pea ( Pisum sativum L.) provides a gene expression atlas and gives insights into root nodulation in this species, The Plant Journal, Volume 84 (2015), pp. 1-19 | DOI

[2] Bainbridge, M.; Wang, M.; Burgess, D.; Kovar, C.; Rodesch, M.; D’Ascenzo, M.; Kitzman, J.; Wu, Y.; Newsham, I.; Richmond, T.; Jeddeloh, J.; Muzny, D.; Albert, T.; Gibbs, R. Whole exome capture in solution with 3 Gbp of data, Genome Biology, Volume 11 (2010) | DOI

[3] Burstin, J.; Avia, K.; Carillo‐Perdomo, E.; Lecomte, C.; Beji, S.; Hanocq, E.; Aubert, G.; Tayeh, N.; Klein, A.; Geffroy, V.; Le Signor, C.; Pflieger, S.; Dalmais, M.; Desgroux, A.; Lavaud, C.; Quillévéré‐Hamard, A.; Kreplak, J.; Lejeune‐Hénaut, I.; Bourion, V.; Pilet‐Nayel, M.; Leveugle, M.; Pinochet, X.; Thompson, R.; Consortium, P. PeaMUST (Pea MultiStress Tolerance), a multidisciplinary French project uniting researchers, plant breeders, and the food industry, Legume Science, Volume 3 (2021) | DOI

[4] Cingolani, P.; Platts, A.; Wang, L.; Coon, M.; Nguyen, T.; Wang, L.; Land, S.; Lu, X.; Ruden, D. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3 (Fly), Volume 6, 2012 | DOI

[5] Doležel, J.; Greilhuber, J.; Lucretti, S.; Meister, A.; Lysák, M.; Nardi, L.; Obermayer, R. Plant genome size estimation by flow cytometry: Inter-laboratory comparison, Annals of Botany, Volume 82 (1998) | DOI

[6] Duarte, J.; Rivière, N.; Baranger, A.; Aubert, G.; Burstin, J.; Cornet, L.; Lavaud, C.; Lejeune-Hénaut, I.; Martinant, J.; Pichon, J.; Pilet-Nayel, M.; Boutet, G. Transcriptome sequencing for high throughput SNP development and genetic mapping in Pea, BMC Genomics, Volume 15 (2014) | DOI

[7] Hill, C.; Wong, D.; Tibbits, J.; Forrest, K.; Hayden, M.; Zhang, X.; Westcott, S.; Angessa, T.; Li, C. Targeted enrichment by solution-based hybrid capture to identify genetic sequence variants in barley, Scientific Data, Volume 6 (2019) | DOI

[8] Jombart, T.; Devillard, S.; Balloux, F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations, BMC Genetics, Volume 11 (2010), p. 94 | DOI

[9] Kreplak, J.; Aubert, G.; Leveugle, M.; Duborjal, H.; Pichon, J.-P.; Burstin, J. Exome capture genotyping data on the pea Architecture Multi-Stress collection (Peamust project, Recherche Data Gouv, Volume V4 (2021) | DOI

[10] Kreplak, J.; Madoui, M.; Cápal, P.; Novák, P.; Labadie, K.; Aubert, G.; Bayer, P.; Gali, K.; Syme, R.; Main, D.; Klein, A.; Bérard, A.; Vrbová, I.; Fournier, C.; d’Agata, L.; Belser, C.; Berrabah, W.; Toegelová, H.; Milec, Z.; Vrána, J.; Lee, H.; Kougbeadjo, A.; Térézol, M.; Huneau, C.; Turo, C.; Mohellibi, N.; Neumann, P.; Falque, M.; Gallardo, K.; McGee, R.; Tar’an, B.; Bendahmane, A.; Aury, J.; Batley, J.; Le Paslier, M.; Ellis, N.; Warkentin, T.; Coyne, C.; Salse, J.; Edwards, D.; Lichtenzveig, J.; Macas, J.; Doležel, J.; Wincker, P.; Burstin, J. A reference genome for pea provides insight into legume genome evolution, Nature Genetics, Volume 51 (2019) | DOI

[11] Ku, C.; Wu, M.; Cooper, D.; Naidoo, N.; Pawitan, Y.; Pang, B.; Iacopetta, B.; Soong, R. Exome versus transcriptome sequencing in identifying coding region variants, Expert Review of Molecular Diagnostics, Volume 12 (2012) | DOI

[12] Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. Genome Project Data Processing Subgroup (2009) The Sequence Alignment/Map format and SAMtools, Bioinformatics, Volume 25 (1000), pp. 2078-2079 | DOI

[13] Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads, EMBnet.journal, Volume 17 (2011) | DOI

[14] Mendel, G. Versuche über Pflanzenhybriden, Versuche über Pflanzenhybriden, Vieweg+Teubner Verlag, Wiesbaden, 1970, pp. 21-64 | DOI

[15] Minh, B.; Schmidt, H.; Chernomor, O.; Schrempf, D.; Woodhams, M.; Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era, Molecular Biology and Evolution, Volume 37 (2020), pp. 1530-1534 | DOI

[16] Nawae, W. The value of a large Pisum SNP dataset, Peer Community in Genomics (2023) | DOI

[17] Ng, S.; Buckingham, K.; Lee, C.; Bigham, A.; Tabor, H.; Dent, K.; Huff, C.; Shannon, P.; Jabs, E.; Nickerson, D.; Shendure, J.; Bamshad, M. Exome sequencing identifies the cause of a mendelian disorder, Nature Genetics, Volume 42 (2010) | DOI

[18] Ng, S.; Turner, E.; Robertson, P.; Flygare, S.; Bigham, A.; Lee, C.; Shaffer, T.; Wong, M.; Bhattacharjee, A.; Eichler, E.; Bamshad, M.; Nickerson, D.; Shendure, J. Targeted capture and massively parallel sequencing of 12 human exomes, Nature, Volume 461 (2009) | DOI

[19] Ollivier, R.; Glory, I.; Cloteau, R.; Le Gallic, J.-F.; Denis, G.; Morlière, S.; Miteul, H.; Rivière, J.-P.; Lesné, A.; Klein, A.; Aubert, G.; Kreplak, J.; Burstin, J.; Pilet-Nayel, M.-L.; Simon, J.-C.; Sugio, A. A major-effect genetic locus, ApRVII, controlling resistance against both adapted and non-adapted aphid biotypes in pea, Theoretical and Applied Genetics, Volume 135 (2022) no. 5, pp. 1511-1528 | DOI

[20] Raj, A.; Stephens, M.; Pritchard, J. FastSTRUCTURE: Variational inference of population structure in large SNP data sets, Genetics, Volume 197 (2014) | DOI

[21] Siol, M.; Jacquin, F.; Chabert-Martinello, M.; Smýkal, P.; Le Paslier, M.-C.; Aubert, G.; Burstin, J. Patterns of Genetic Structure and Linkage Disequilibrium in a Large Collection of Pea Germplasm, G3 Genes|Genomes|Genetics, Volume 7 (2017) no. 8, pp. 2461-2471 | DOI

[22] Wang, Y.; Huang, J.; Li, E.; Xu, S.; Zhan, Z.; Zhang, X.; Yang, Z.; Guo, F.; Liu, K.; Liu, D.; Shen, X.; Shang, C.; Zhang, Z. Phylogenomics and Biogeography of Populus Based on Comprehensive Sampling Reveal Deep-Level Relationships and Multiple Intercontinental Dispersals, Frontiers in Plant Science, Volume 13 (2022) | DOI

[23] Yu, G.; Smith, D.; Zhu, H.; Guan, Y.; Lam, T.-Y. ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data, Methods in Ecology and Evolution, Volume 8 (2017), pp. 28-36 | DOI

[24] Zhang, W.; Dasmahapatra, K.; Mallet, J.; Moreira, G.; Kronforst, M. Genome-wide introgression among distantly related Heliconius butterfly species, Genome biology, Volume 17 (2016), pp. 1-15 | DOI

Cited by Sources: