BacPipe: A rapid, user-friendly whole genome sequencing pipeline for clinical diagnostic bacteriology

Research output: Contribution to journalArticle

Standard

BacPipe: A rapid, user-friendly whole genome sequencing pipeline for clinical diagnostic bacteriology. / Xavier, Basil B.; Ahmed, Mohamed; Bolzan, Mattia; Ribeiro-Gonçalves, Bruno; Alako, Blaise T.F.; Harrison, Peter; Lammens, Christine; Kumar-Singh, Samir; Goossens, Herman; Carriço, João André; Cochrane, Guy R.; Malhotra-kumar, Surbhi.

In: iScience, Vol. 23, No. 100769, 100767, 09.12.2019, p. 1-28.

Research output: Contribution to journalArticle

Harvard

Xavier, BB, Ahmed, M, Bolzan, M, Ribeiro-Gonçalves, B, Alako, BTF, Harrison, P, Lammens, C, Kumar-Singh, S, Goossens, H, Carriço, JA, Cochrane, GR & Malhotra-kumar, S 2019, 'BacPipe: A rapid, user-friendly whole genome sequencing pipeline for clinical diagnostic bacteriology' iScience, vol 23, no. 100769, 100767, pp. 1-28.

APA

Xavier, B. B., Ahmed, M., Bolzan, M., Ribeiro-Gonçalves, B., Alako, B. T. F., Harrison, P., ... Malhotra-kumar, S. (2019). BacPipe: A rapid, user-friendly whole genome sequencing pipeline for clinical diagnostic bacteriology. iScience, 23(100769), 1-28. [100767].

Vancouver

Xavier BB, Ahmed M, Bolzan M, Ribeiro-Gonçalves B, Alako BTF, Harrison P et al. BacPipe: A rapid, user-friendly whole genome sequencing pipeline for clinical diagnostic bacteriology. iScience. 2019 Dec 9;23(100769):1-28. 100767.

Author

Xavier, Basil B.; Ahmed, Mohamed; Bolzan, Mattia; Ribeiro-Gonçalves, Bruno; Alako, Blaise T.F.; Harrison, Peter; Lammens, Christine; Kumar-Singh, Samir; Goossens, Herman; Carriço, João André; Cochrane, Guy R.; Malhotra-kumar, Surbhi / BacPipe: A rapid, user-friendly whole genome sequencing pipeline for clinical diagnostic bacteriology.

In: iScience, Vol. 23, No. 100769, 100767, 09.12.2019, p. 1-28.

Research output: Contribution to journalArticle

Bibtex - Download

@article{7fcc4ac707a349eea219cc7dcdf7dc49,
title = "BacPipe: A rapid, user-friendly whole genome sequencing pipeline for clinical diagnostic bacteriology",
keywords = "Biological Sciences Research Methodologies, Microbiology, Sequence Analysis, BacPipe",
author = "Xavier, {Basil B.} and Mohamed Ahmed and Mattia Bolzan and Bruno Ribeiro-Gonçalves and Alako, {Blaise T.F.} and Peter Harrison and Christine Lammens and Samir Kumar-Singh and Herman Goossens and Carriço, {João André} and Cochrane, {Guy R.} and Surbhi Malhotra-kumar",
note = "Score=10",
year = "2019",
month = "12",
volume = "23",
pages = "1--28",
journal = "iScience",
issn = "2589-0042",
publisher = "Elsevier",
number = "100769",

}

RIS - Download

TY - JOUR

T1 - BacPipe: A rapid, user-friendly whole genome sequencing pipeline for clinical diagnostic bacteriology

AU - Xavier,Basil B.

AU - Ahmed,Mohamed

AU - Bolzan,Mattia

AU - Ribeiro-Gonçalves,Bruno

AU - Alako,Blaise T.F.

AU - Harrison,Peter

AU - Lammens,Christine

AU - Kumar-Singh,Samir

AU - Goossens,Herman

AU - Carriço,João André

AU - Cochrane,Guy R.

AU - Malhotra-kumar,Surbhi

N1 - Score=10

PY - 2019/12/9

Y1 - 2019/12/9

N2 - Objectives: Despite rapid advances in whole genome sequencing (WGS) technologies, their integration into routine microbiological diagnostics has been hampered by the need for standardised downstream bioinformatics analysis. Here we developed a comprehensive and computationally low-resource bioinformatics pipeline (BacPipe) enabling direct analyses of bacterial whole-genome sequences (raw reads or contigs) obtained from second or third-generation sequencing technologies. Methods: Open-access tools for quality verification, de novo assembly (SPAdes), annotation (Prokka), bacterial typing (MLST, emm typing), and for identification of resistance genes (Resfams), plasmids, virulence genes, single nucleotide polymorphisms (SNPs) and core genome phylogeny were integrated into a single Python script. A graphical user interface (GUI) was developed to allow real-time progression of the analysis. The scalability and speed of BacPipe in handling large data-sets was further demonstrated using 4139 Illumina paired-end sequence files of publicly-available bacterial genomes (2.9−5.4 Mb) from the European Nucleotide Archive (ENA). Results: Computational time on Bacpipe, demonstrated on a 8 Gb RAM personal computer, was 21, 25, 28 and 30 minutes for sequencing coverage of 50-, 70-, 100- and 120-folds of a 5.1 Mb bacterial genome, respectively. Compiled results of every individual genome/strain are saved as an Excel file. Up to 56% reduction in analysis time was achieved by a unique parallelization of post-assembly and post-annotation tools in Bacpipe compared to running these tools in succession. On the 4139 Illumina paired-end sequence files, running time was on average 50 minutes/strain. Bacpipe is integrated in EBI-SELECTA, a project-specific portal (H2020 COMPARE), and is also available as an independent docker image that can be used across Windows- and Unix-based systems. Conclusion: BacPipe offers a fully automated ‘one-stop’ bacterial WGS analysis pipeline with a user-friendly GUI which can contribute to overcome the major hurdle of WGS data analysis in hospitals and public-health and for infection-control monitoring.

AB - Objectives: Despite rapid advances in whole genome sequencing (WGS) technologies, their integration into routine microbiological diagnostics has been hampered by the need for standardised downstream bioinformatics analysis. Here we developed a comprehensive and computationally low-resource bioinformatics pipeline (BacPipe) enabling direct analyses of bacterial whole-genome sequences (raw reads or contigs) obtained from second or third-generation sequencing technologies. Methods: Open-access tools for quality verification, de novo assembly (SPAdes), annotation (Prokka), bacterial typing (MLST, emm typing), and for identification of resistance genes (Resfams), plasmids, virulence genes, single nucleotide polymorphisms (SNPs) and core genome phylogeny were integrated into a single Python script. A graphical user interface (GUI) was developed to allow real-time progression of the analysis. The scalability and speed of BacPipe in handling large data-sets was further demonstrated using 4139 Illumina paired-end sequence files of publicly-available bacterial genomes (2.9−5.4 Mb) from the European Nucleotide Archive (ENA). Results: Computational time on Bacpipe, demonstrated on a 8 Gb RAM personal computer, was 21, 25, 28 and 30 minutes for sequencing coverage of 50-, 70-, 100- and 120-folds of a 5.1 Mb bacterial genome, respectively. Compiled results of every individual genome/strain are saved as an Excel file. Up to 56% reduction in analysis time was achieved by a unique parallelization of post-assembly and post-annotation tools in Bacpipe compared to running these tools in succession. On the 4139 Illumina paired-end sequence files, running time was on average 50 minutes/strain. Bacpipe is integrated in EBI-SELECTA, a project-specific portal (H2020 COMPARE), and is also available as an independent docker image that can be used across Windows- and Unix-based systems. Conclusion: BacPipe offers a fully automated ‘one-stop’ bacterial WGS analysis pipeline with a user-friendly GUI which can contribute to overcome the major hurdle of WGS data analysis in hospitals and public-health and for infection-control monitoring.

KW - Biological Sciences Research Methodologies

KW - Microbiology

KW - Sequence Analysis

KW - BacPipe

UR - http://ecm.sckcen.be/OTCS/llisapi.dll/open/36629876

M3 - Article

VL - 23

SP - 1

EP - 28

JO - iScience

T2 - iScience

JF - iScience

SN - 2589-0042

IS - 100769

M1 - 100767

ER -

ID: 5890914