Section: Genomics
Topic:
Genetics/Genomics
CulebrONT: a streamlined long reads multi-assembler pipeline for prokaryotic and eukaryotic genomes
10.24072/pcjournal.153 - Peer Community Journal, Volume 2 (2022), article no. e46.
Get full text PDF Peer reviewed and recommended by PCIUsing long reads provides higher contiguity and better genome assemblies. However, producing such high quality sequences from raw reads requires to chain a growing set of tools, and determining the best workflow is a complex task.
Type: Software tool
Orjuela, Julie 1, 2, 3; Comte, Aurore 2, 3; Ravel, Sébastien 2, 3; Charriat, Florian 2, 3; Vi, Tram 2, 4; Sabot, François  1, 3; Cunnac, Sébastien  2, 3
@article{10_24072_pcjournal_153, author = {Orjuela, Julie and Comte, Aurore and Ravel, S\'ebastien and Charriat, Florian and Vi, Tram and Sabot, Fran\c{c}ois and Cunnac, S\'ebastien }, title = {CulebrONT: a streamlined long reads multi-assembler pipeline for prokaryotic and eukaryotic genomes}, journal = {Peer Community Journal}, eid = {e46}, publisher = {Peer Community In}, volume = {2}, year = {2022}, doi = {10.24072/pcjournal.153}, url = {https://peercommunityjournal.org/articles/10.24072/pcjournal.153/} }
TY - JOUR AU - Orjuela, Julie AU - Comte, Aurore AU - Ravel, Sébastien AU - Charriat, Florian AU - Vi, Tram AU - Sabot, François AU - Cunnac, Sébastien TI - CulebrONT: a streamlined long reads multi-assembler pipeline for prokaryotic and eukaryotic genomes JO - Peer Community Journal PY - 2022 VL - 2 PB - Peer Community In UR - https://peercommunityjournal.org/articles/10.24072/pcjournal.153/ DO - 10.24072/pcjournal.153 ID - 10_24072_pcjournal_153 ER -
%0 Journal Article %A Orjuela, Julie %A Comte, Aurore %A Ravel, Sébastien %A Charriat, Florian %A Vi, Tram %A Sabot, François %A Cunnac, Sébastien %T CulebrONT: a streamlined long reads multi-assembler pipeline for prokaryotic and eukaryotic genomes %J Peer Community Journal %D 2022 %V 2 %I Peer Community In %U https://peercommunityjournal.org/articles/10.24072/pcjournal.153/ %R 10.24072/pcjournal.153 %F 10_24072_pcjournal_153
Orjuela, Julie; Comte, Aurore; Ravel, Sébastien; Charriat, Florian; Vi, Tram; Sabot, François ; Cunnac, Sébastien . CulebrONT: a streamlined long reads multi-assembler pipeline for prokaryotic and eukaryotic genomes. Peer Community Journal, Volume 2 (2022), article no. e46. doi : 10.24072/pcjournal.153. https://peercommunityjournal.org/articles/10.24072/pcjournal.153/
PCI peer reviews and recommendation, and links to data, scripts, code and supplementary information: 10.24072/pci.genomics.100018
Conflict of interest of the recommender and peer reviewers:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article.
[1] Anaconda Software Distribution. Computer software. Vers. 2-2.4.0. Anaconda, Nov. 2016. Web. https://anaconda.com.
[2] Benchmarking Long-Read Assemblers for Genomic Analyses of Bacterial Pathogens Using Oxford Nanopore Sequencing, International Journal of Molecular Sciences, Volume 21 (2020) no. 23 | DOI
[3] Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm, Nature Methods, Volume 18 (2021) no. 2, pp. 170-175 | DOI
[4] Mauve: Multiple Alignment of Conserved Genomic Sequence With Rearrangements, Genome Research, Volume 14 (2004) no. 7, pp. 1394-1403 | DOI
[5] QUAST: quality assessment tool for genome assemblies, Bioinformatics, Volume 29 (2013) no. 8, pp. 1072-1075 | DOI
[6] Circlator: automated circularization of genome assemblies using long sequencing reads, Genome Biology, Volume 16 (2015) no. 1 | DOI
[7] Katuali. Katuali: A flexible consensus pipeline implemented in Snakemake to basecall, assemble, and polish Oxford Nanopore Technologies’ sequencing data. URL: https://nanoporetech.github.io/katuali/index.html (Accessed 25th July 2022)
[8] Assembly of long, error-prone reads using repeat graphs, Nature Biotechnology, Volume 37 (2019) no. 5, pp. 540-546 | DOI
[9] Canu: scalable and accurate long-read assembly via adaptive ik/i-mer weighting and repeat separation, Genome Research, Volume 27 (2017) no. 5, pp. 722-736 | DOI
[10] Snakemake--a scalable bioinformatics workflow engine, Bioinformatics, Volume 28 (2012) no. 19, pp. 2520-2522 | DOI
[11] Singularity: Scientific containers for mobility of compute, PLOS ONE, Volume 12 (2017) no. 5 | DOI
[12] BlobTools: Interrogation of genome assemblies, F1000Research, Volume 6 (2017) | DOI
[13] Assembly methods for nanopore-based metagenomic sequencing: a comparative study, Scientific Reports, Volume 10 (2020) no. 1 | DOI
[14] The Sequence Alignment/Map format and SAMtools, Bioinformatics, Volume 25 (2009) no. 16, pp. 2078-2079 | DOI
[15] Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences, Bioinformatics, Volume 32 (2016) no. 14, pp. 2103-2110 | DOI
[16] Completing Circular Bacterial Genomes With Assembly Complexity by Using a Sampling Strategy From a Single MinION Run With Barcoding, Frontiers in Microbiology, Volume 10 (2019) (https://www.frontiersin.org/articles/10.3389/fmicb.2019.0206) | DOI
[17] SMARTdenovo: a de novo assembler using long noisy reads, Gigabyte, Volume 2020 (2021), pp. 1-9 | DOI
[18] A complete bacterial genome assembled de novo using only nanopore sequencing data, Nature Methods, Volume 12 (2015) no. 8, pp. 733-735 | DOI
[19] KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies, Bioinformatics, Volume 33 (2017) | DOI
[20] Medaka. Medaka: Sequence correction provided by ONT Research. URL: https://github.com/nanoporetech/medaka (Accessed 25th July 2022)
[21] A Transposon Story: From TE Content to TE Dynamic Invasion of Drosophila Genomes Using the Single-Molecule Sequencing Technology from Oxford Nanopore, Cells, Volume 9 (2020) no. 8 | DOI
[22] Comparison of long-read methods for sequencing and assembly of a plant genome, GigaScience, Volume 9 (2020) no. 12 | DOI
[23] MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction, BMC Genomics, Volume 22 (2021) no. 1 | DOI
[24] Assemblytics: a web analytics tool for the detection of variants from an assembly, Bioinformatics, Volume 32 (2016) no. 19, pp. 3021-3023 | DOI
[25] Source code of CulebrONT: a streamlined long reads multi-assembler pipeline for prokaryotic and eukaryotic genomes. Version 3, 2022 | DOI
[26] Test data, reports and documentation for CulebrONT software: a streamlined long reads multi-assembler pipeline for prokaryotic and eukaryotic genomes, datasuds, 2022 | DOI
[27] Genome structure and content of the rice root‐knot nematode ( iMeloidogyne graminicola/i ), Ecology and Evolution, Volume 10 (2020) no. 20, pp. 11006-11021 | DOI
[28] Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies, Genome Biology, Volume 21 (2020) no. 1 | DOI
[29] Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes, Nature Biotechnology, Volume 38 (2020) no. 9, pp. 1044-1053 | DOI
[30] BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs, Bioinformatics, Volume 31 (2015) no. 19, pp. 3210-3212 | DOI
[31] Raven: a de novo genome assembler for long reads, bioRxiv, 2020 (https://www.biorxiv.org/content/early/2020/08/10/2020.08.07.242461.full.pd) | DOI
[32] Fast and accurate de novo genome assembly from long uncorrected reads, Genome Research, Volume 27 (2017) no. 5, pp. 737-746 | DOI
[33] Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement, PLoS ONE, Volume 9 (2014) no. 11 | DOI
[34] Trycycler: consensus long-read assemblies for bacterial genomes, Genome Biology, Volume 22 (2021) no. 1 | DOI
[35] Benchmarking of long-read assemblers for prokaryote whole genome sequencing, F1000Research, Volume 8 (2021) | DOI
Cited by Sources: