Deep learning predicts tuberculosis drug resistance status from genome sequencing data

Mapping Intimacies ◽

10.1101/275628 ◽

2018 ◽

Cited By ~ 7

Author(s):

Michael L. Chen ◽

Akshith Doddi ◽

Jimmy Royer ◽

Luca Freschi ◽

Marco Schito ◽

...

Keyword(s):

Machine Learning ◽

Drug Resistance ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Predictive Performance ◽

Multidrug Resistant ◽

Whole Genome ◽

Second Line ◽

Sequencing Data ◽

First Line

AbstractBackgroundThe diagnosis of multidrug resistant and extensively drug resistant tuberculosis is a global health priority. Whole genome sequencing of clinicalMycobacteriumtuberculosis isolates promises to circumvent the long wait times and limited scope of conventional phenotypic antimicrobial susceptibility, but gaps remain for predicting phenotype accurately from genotypic data.Methods and FindingsUsing targeted or whole genome sequencing and conventional drug resistance phenotyping data from 3,601Mycobacterium tuberculosisstrains, 1,228 of which were multidrug resistant, we investigated the use of machine learning to predict phenotypic drug resistance to 10 anti-tuberculosis drugs. The final model, a multitask wide and deep neural network (MD-WDNN), achieved improved high predictive performance: the average AUCs were 0.979 for first-line drugs and 0.936 for second-line drugs during repeated cross-validation. On an independent validation set, the MD-WDNN showed average AUCs, sensitivities, and specificities, respectively, of 0.937, 87.9%, and 92.7% for first-line drugs and 0.891, 82.0% and 90.1% for second-line drugs. In addition to being able to learn from samples that have only been partially phenotyped, our proposed multidrug architecture shares information across different anti-tuberculosis drugs and genes to provide a more accurate phenotypic prediction. We uset-distributed Stochastic Neighbor Embedding (t-SNE) visualization and feature importance analyses to examine inter-drug similarities.ConclusionsMachine learning is capable of accurately predicting resistant status using genomic information and holds promise in bringing sequencing technologies closer to the bedside.

Download Full-text

Machine Learning Predicts Accurately Mycobacterium tuberculosis Drug Resistance From Whole Genome Sequencing Data

Frontiers in Genetics ◽

10.3389/fgene.2019.00922 ◽

2019 ◽

Vol 10 ◽

Cited By ~ 7

Author(s):

Wouter Deelder ◽

Sofia Christakoudi ◽

Jody Phelan ◽

Ernest Diez Benavente ◽

Susana Campino ◽

...

Keyword(s):

Machine Learning ◽

Drug Resistance ◽

Mycobacterium Tuberculosis ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Whole Genome Sequencing Data ◽

Whole Genome ◽

Sequencing Data

Download Full-text

Systematic Review of Whole-Genome Sequencing Data To Predict Phenotypic Drug Resistance and Susceptibility in Swedish Mycobacterium tuberculosis Isolates, 2016 to 2018

Antimicrobial Agents and Chemotherapy ◽

10.1128/aac.02550-19 ◽

2020 ◽

Vol 64 (5) ◽

Author(s):

Theresa Enkirch ◽

Jim Werngren ◽

Ramona Groenheit ◽

Erik Alm ◽

Reza Advani ◽

...

Keyword(s):

Drug Resistance ◽

Mycobacterium Tuberculosis ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Drug Susceptibility Testing ◽

Whole Genome Sequencing Data ◽

Whole Genome ◽

Second Line ◽

Sequencing Data ◽

Phenotypic Drug Resistance

ABSTRACT In this retrospective study, whole-genome sequencing (WGS) data generated on an Ion Torrent platform was used to predict phenotypic drug resistance profiles for first- and second-line drugs among Swedish clinical Mycobacterium tuberculosis isolates from 2016 to 2018. The accuracy was ∼99% for all first-line drugs and 100% for four second-line drugs. Our analysis supports the introduction of WGS into routine diagnostics, which might, at least in Sweden, replace phenotypic drug susceptibility testing in the future.

Download Full-text

Whole genome sequencing data of 1110 Mycobacterium tuberculosis isolates identifies insertions and deletions associated with drug resistance

BMC Genomics ◽

10.1186/s12864-018-4734-6 ◽

2018 ◽

Vol 19 (1) ◽

Cited By ~ 3

Author(s):

Xi Zeng ◽

Jamie Sui-Lam Kwok ◽

Kevin Yi Yang ◽

Kenneth Siu-Sing Leung ◽

Mai Shi ◽

...

Keyword(s):

Drug Resistance ◽

Mycobacterium Tuberculosis ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Whole Genome Sequencing Data ◽

Whole Genome ◽

Sequencing Data ◽

Insertions And Deletions

Download Full-text

Drug resistance profile and clonality of Plasmodium falciparum parasites in Cape Verde: the 2017 malaria outbreak

Malaria Journal ◽

10.1186/s12936-021-03708-z ◽

2021 ◽

Vol 20 (1) ◽

Author(s):

Silvania Da Veiga Leal ◽

Daniel Ward ◽

Susana Campino ◽

Ernest Diez Benavente ◽

Amy Ibrahim ◽

...

Keyword(s):

Drug Resistance ◽

Plasmodium Falciparum ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Family Members ◽

Cape Verde ◽

Whole Genome Sequencing Data ◽

Whole Genome ◽

Sequencing Data ◽

High Level

Abstract Background Cape Verde is an archipelago located off the West African coast and is in a pre-elimination phase of malaria control. Since 2010, fewer than 20 Plasmodium falciparum malaria cases have been reported annually, except in 2017, when an outbreak in Praia before the rainy season led to 423 autochthonous cases. It is important to understand the genetic diversity of circulating P. falciparum to inform on drug resistance, potential transmission networks and sources of infection, including parasite importation. Methods Enrolled subjects involved malaria patients admitted to Dr Agostinho Neto Hospital at Praia city, Santiago island, Cape Verde, between July and October 2017. Neighbours and family members of enrolled cases were assessed for the presence of anti-P. falciparum antibodies. Sanger sequencing and real-time PCR was used to identify SNPs in genes associated with drug resistance (e.g., pfdhfr, pfdhps, pfmdr1, pfk13, pfcrt), and whole genome sequencing data were generated to investigate the population structure of P. falciparum parasites. Results The study analysed 190 parasite samples, 187 indigenous and 3 from imported infections. Malaria cases were distributed throughout Praia city. There were no cases of severe malaria and all patients had an adequate clinical and parasitological response after treatment. Anti-P. falciparum antibodies were not detected in the 137 neighbours and family members tested. No mutations were detected in pfdhps. The triple mutation S108N/N51I/C59R in pfdhfr and the chloroquine-resistant CVIET haplotype in the pfcrt gene were detected in almost all samples. Variations in pfk13 were identified in only one sample (R645T, E668K). The haplotype NFD for pfmdr1 was detected in the majority of samples (89.7%). Conclusions Polymorphisms in pfk13 associated with artemisinin-based combination therapy (ACT) tolerance in Southeast Asia were not detected, but the majority of the tested samples carried the pfmdr1 haplotype NFD and anti-malarial-associated mutations in the the pfcrt and pfdhfr genes. The first whole genome sequencing (WGS) was performed for Cape Verdean parasites that showed that the samples cluster together, have a very high level of similarity and are close to other parasites populations from West Africa.

Download Full-text

Evaluation of parameters affecting performance and reliability of machine learning-based antibiotic susceptibility testing from whole genome sequencing data

PLoS Computational Biology ◽

10.1371/journal.pcbi.1007349 ◽

2019 ◽

Vol 15 (9) ◽

pp. e1007349 ◽

Cited By ~ 20

Author(s):

Allison L. Hicks ◽

Nicole Wheeler ◽

Leonor Sánchez-Busó ◽

Jennifer L. Rakeman ◽

Simon R. Harris ◽

...

Keyword(s):

Machine Learning ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Antibiotic Susceptibility ◽

Susceptibility Testing ◽

Whole Genome Sequencing Data ◽

Whole Genome ◽

Antibiotic Susceptibility Testing ◽

Sequencing Data ◽

Evaluation Of Parameters

Download Full-text

Whole Genome Sequencing Results Associated with Minimum Inhibitory Concentrations of 14 Anti-Tuberculosis Drugs among Rifampicin-Resistant Isolates of Mycobacterium Tuberculosis from Iran

Journal of Clinical Medicine ◽

10.3390/jcm9020465 ◽

2020 ◽

Vol 9 (2) ◽

pp. 465 ◽

Cited By ~ 5

Author(s):

Jalil Kardan-Yamchi ◽

Hossein Kazemian ◽

Simone Battaglia ◽

Hamidreza Abtahi ◽

Abbas Rahimi Foroushani ◽

...

Keyword(s):

Drug Resistance ◽

Mycobacterium Tuberculosis ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Drug Susceptibility ◽

Drug Susceptibility Testing ◽

Whole Genome ◽

Second Line ◽

Single Nucleotide Polymorphism Calling ◽

Broth Microdilution Method

Accurate and timely detection of drug resistance can minimize the risk of further resistance development and lead to effective treatment. The aim of this study was to determine the resistance to first/second-line anti-tuberculosis drugs in rifampicin/multidrug-resistant Mycobacterium tuberculosis (RR/MDR-MTB) isolates. Molecular epidemiology of strains was determined using whole genome sequencing (WGS)-based genotyping. A total of 35 RR/MDR-MTB isolates were subjected to drug susceptibility testing against first/second-line drugs using 7H9 Middlebrook in broth microdilution method. Illumina technology was used for paired-end WGS applying a Maxwell 16 Cell DNA Purification kit and the NextSeq platform. Data analysis and single nucleotide polymorphism calling were performed using MTBseq pipeline. The genome-based resistance to each drug among the resistant phenotypes was as follows: rifampicin (97.1%), isoniazid (96.6%), ethambutol (100%), levofloxacin (83.3%), moxifloxacin (83.3%), amikacin (100%), kanamycin (100%), capreomycin (100%), prothionamide (100%), D-cycloserine (11.1%), clofazimine (20%), bedaquiline (0.0%), and delamanid (44.4%). There was no linezolid-resistant phenotype, and a bedaquiline-resistant strain was wild type for related genes. The Beijing, Euro-American, and Delhi-CAS were the most populated lineage/sublineages. Drug resistance-associated mutations were mostly linked to minimum inhibitory concentration results. However, the role of well-known drug-resistant genes for D-cycloserine, clofazimine, bedaquiline, and delamanid was found to be more controversial.

Download Full-text

Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data

mSystems ◽

10.1128/msystems.00656-19 ◽

2020 ◽

Vol 5 (3) ◽

Cited By ~ 3

Author(s):

Nenad Macesic ◽

Oliver J. Bear Don’t Walk ◽

Itsik Pe’er ◽

Nicholas P. Tatonetti ◽

Anton Y. Peleg ◽

...

Keyword(s):

Machine Learning ◽

Klebsiella Pneumoniae ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Genomic Data ◽

Whole Genome Sequencing Data ◽

Whole Genome ◽

Sequencing Data ◽

Pr Genes ◽

Content Type

ABSTRACT Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polygenic nature of PR and our incomplete understanding of its genetic basis make it difficult to predict PR using detection of resistance determinants. We therefore applied machine learning (ML) to whole-genome sequencing data from >600 Klebsiella pneumoniae clonal group 258 (CG258) genomes to predict phenotypic PR. Using a reference-based representation of genomic data with ML outperformed a rule-based approach that detected variants in known PR genes (area under receiver-operator curve [AUROC], 0.894 versus 0.791, P = 0.006). We noted modest increases in performance by using a bacterial genome-wide association study to filter relevant genomic features and by integrating clinical data in the form of prior polymyxin exposure. Conversely, reference-free representation of genomic data as k-mers was associated with decreased performance (AUROC, 0.692 versus 0.894, P = 0.015). When ML models were interpreted to extract genomic features, six of seven known PR genes were correctly identified by models without prior programming and several genes involved in stress responses and maintenance of the cell membrane were identified as potential novel determinants of PR. These findings are a proof of concept that whole-genome sequencing data can accurately predict PR in K. pneumoniae CG258 and may be applicable to other forms of complex antimicrobial resistance. IMPORTANCE Polymyxins are last-resort antibiotics used to treat highly resistant Gram-negative bacteria. There are increasing reports of polymyxin resistance emerging, raising concerns of a postantibiotic era. Polymyxin resistance is therefore a significant public health threat, but current phenotypic methods for detection are difficult and time-consuming to perform. There have been increasing efforts to use whole-genome sequencing for detection of antibiotic resistance, but this has been difficult to apply to polymyxin resistance because of its complex polygenic nature. The significance of our research is that we successfully applied machine learning methods to predict polymyxin resistance in Klebsiella pneumoniae clonal group 258, a common health care-associated and multidrug-resistant pathogen. Our findings highlight that machine learning can be successfully applied even in complex forms of antibiotic resistance and represent a significant contribution to the literature that could be used to predict resistance in other bacteria and to other antibiotics.

Download Full-text

Detection of first and second line drug resistance mutations from multi drug resistantmycobacterium tuberculosisstrains by Ion Torrent whole genome sequencing (WGS)

10.1183/13993003.congress-2016.pa1910 ◽

2016 ◽

Author(s):

Ionut Pavel ◽

Ina Iuliana Macovei ◽

Simona Péter ◽

Daniela Diculencu ◽

Florin Rusu-Cordunean ◽

...

Keyword(s):

Drug Resistance ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Whole Genome ◽

Second Line ◽

Ion Torrent ◽

Resistance Mutations ◽

Drug Resistance Mutations ◽

Line Drug

Download Full-text

1831. Machine Learning Approaches to Predicting Resistance in Pseudomonas aeruginosa

Open Forum Infectious Diseases ◽

10.1093/ofid/ofz359.093 ◽

2019 ◽

Vol 6 (Supplement_2) ◽

pp. S42-S42

Author(s):

David E Greenberg ◽

Jiwoong Kim ◽

Xiaowei Zhan ◽

Samuel A Shelburne ◽

...

Keyword(s):

Machine Learning ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Prediction Models ◽

Validation Cohort ◽

Advisory Board ◽

Whole Genome Sequencing Data ◽

Whole Genome ◽

Learning Approaches ◽

Sequencing Data

Abstract Background Multi-drug-resistant (MDR) P. aeruginosa (PA) infections continue to cause significant morbidity and mortality in various patient groups including those with malignancies. Predicting antimicrobial resistance (AMR) from whole-genome sequencing data if done rapidly, could aid in providing optimal care to patients. Methods To better understand the connections between DNA variation and phenotypic AMR in PA, we developed a new algorithm, variant mapping and prediction of antibiotic resistance (VAMPr), to build association and machine learning prediction models of AMR based on publicly available whole-genome sequencing and antibiotic susceptibility testing (AST) data. A validation cohort of contemporary PA bloodstream isolates was sequenced and AST was performed. Accuracy of predicting AMR for various PA–drug combinations was calculated. Results VAMPr was built from 3,393 bacterial isolates (83 PA isolates included) from 9 species that contained AST data for 29 antibiotics. 14,615 variant genotypes were identified within the dataset and 93 association and prediction models were built. 120 PA bloodstream isolates from cancer patients were included for analysis in the validation cohort. ~15% of isolates were carbapenem resistant and ~20% were quinolone resistant. For drug-isolate combinations where >100 isolates were available, machine-learning prediction accuracies ranged from 75.6% (PA and ceftazidime; 90/119 correctly predicted) to 98.1% (PA and amikacin; 105/107 correctly predicted). Machine learning accurately identified known variants that strongly predicted resistance to various antibiotic classes. Examples included specific gyrA mutations (T83I; P < 0.00001) and quinolone resistance. Conclusion Machine learning predicted AMR in P. aeruginosa across a number of antibiotics with high accuracy. Given the genomic heterogeneity of PA, increased genomic data for this pathogen will aid in further improving prediction accuracy across all antibiotic classes. Disclosures Samuel L. Aitken, PharmD, Melinta Therapeutoics: Grant/Research Support, Research Grant; Merck, Sharpe, and Dohme: Advisory Board; Shionogi: Advisory Board

Download Full-text

S95 Identifying new hereditary haemorrhagic telangiectasia genes by applying a machine learning approach to screen whole genome sequencing data

10.1136/thorax-2019-btsabstracts2019.101 ◽

2019 ◽

Author(s):

S Xiao ◽

D Brown ◽

IG Mollet ◽

FS Govani ◽

D Patel ◽

...

Keyword(s):

Machine Learning ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Whole Genome Sequencing Data ◽

Learning Approach ◽

Hereditary Haemorrhagic Telangiectasia ◽

Whole Genome ◽

Sequencing Data ◽

Machine Learning Approach

Download Full-text