Alcock BP, Raphenya AR, Lau TTY, Tsang KK, Bouchard M, Edalatmand A, Huynh W, Nguyen A-LV, Cheng AA, Liu S, Min SY, Miroshnichenko A, Tran H-K, Werfalli RE, Nasir JA, Oloni M, Speicher DJ, Florescu A, Singh B, Faltyn M, Hernandez-Koutoucheva A, Sharma AN, Bordeleau E, Pawlowski AC, Zubyk HL, Dooley D, Griffiths E, Maguire F, Winsor GL, Beiko RG, Brinkman FSL, Hsiao WWL, Van Domselaar G, McArthur AG.
The Comprehensive Antibiotic Resistance Database (CARD; https://card.mcmaster.ca) is a curated resource providing reference DNA and protein sequences, detection models and bioinformatics tools on the molecular basis of bacterial antimicrobial resistance (AMR). CARD focuses on providing high-quality reference data and molecular sequences within a controlled vocabulary, the Antibiotic Resistance Ontology (ARO), designed by the CARD biocuration team to integrate with software development efforts for resistome analysis and prediction, such as CARD’s Resistance Gene Identifier (RGI) software. Since 2017, CARD has expanded through extensive curation of reference sequences, revision of the ontological structure, curation of over 500 new AMR detection models, development of a new classification paradigm and expansion of analytical tools. Most notably, a new Resistomes & Variants module provides analysis and statistical summary of in silico predicted resistance variants from 82 pathogens and over 100 000 genomes. By adding these resistance variants to CARD, we are able to summarize predicted resistance using the information included in CARD, identify trends in AMR mobility and determine previously undescribed and novel resistance variants. Here, we describe updates and recent expansions to CARD and its biocuration process, including new resources for community biocuration of AMR molecular reference data.
Glycopeptide antibiotics are produced by Actinobacteria through biosynthetic gene clusters that include genes supporting their regulation, synthesis, export and resistance. The chemical and biosynthetic diversities of glycopeptides are the product of an intricate evolutionary history. Extracting this history from genome sequences is difficult as conservation of the individual components of these gene clusters is variable and each component can have a different trajectory. We show that glycopeptide biosynthesis and resistance in Actinobacteria maps to approximately 150-400 million years ago. Phylogenetic reconciliation reveals that the precursors of glycopeptide biosynthesis are far older than other components, implying that these clusters arose from a pre-existing pool of genes. We find that resistance appeared contemporaneously with biosynthetic genes, raising the possibility that the mechanism of action of glycopeptides was a driver of diversification in these gene clusters. Our results put antibiotic biosynthesis and resistance into an evolutionary context and can guide the future discovery of compounds possessing new mechanisms of action, which are especially needed as the usefulness of the antibiotics available at present is imperilled by human activity.
More details at McMaster’s Brighter World.
Dr. McArthur and PhD student Kara Tsang taught together at the 2019 MacData Institute Summer School, with Dr. McArthur reviewing biocuration and bioinformatics for genomic surviellence of antimicrobial resistance and Kara following up with a lecture on machine learning techniques to predict clinical antimicrobial resistance from raw genomic sequence.
Also congratulations to Kara for being awarded a 2019 Faculty of Health Sciences Graduate Programs Excellence Award!
Updated August 6, 2019: Congratulations to Kara for also winning an Ontario Graduate Scholarship!
The Comprehensive Antibiotic Resistance Database has been updated, http://card.mcmaster.ca
CARD Curation: Expanded MCR, OXA & IMP beta-lactamase, and macrolide phosphotransferase (MPH) sequence curation. Updated nomenclature for MPHs and drug resistant dihydrofolate reductases (dfr). Updated classification of ADC beta-lactamases.
Ontologies: Addition of 518 draft virulence ontology (VIRO) terms.
Prevalence, Resistomes, & Variants: Expansion to 82 pathogens (more Brucella species), 81,000+ resistomes, and 173,000+ AMR allele sequences based on sequence data acquired from NCBI on 28-Feb-2019, analyzed using RGI 4.2.2 (DIAMOND homolog detection) and CARD 3.0.1.
During McMaster Spring Mid-Term Recess (February 18-24), the McArthur lab is pleased to present a series of lectures, demonstrations, and training sessions for the Comprehensive Antibiotic Resistance Database (card.mcmaster.ca) and its associated Resistance Gene Identifier (RGI) software, sponsored by the Michael G. DeGroote Institute for Infectious Disease Research (IIDR).
Questions? Email firstname.lastname@example.org
Workshop & Lecture material will be available here: https://github.com/arpcard/state-of-the-card-2019
Antimicrobial Resistance: Emergence, Transmission, and Ecology (ARETE). R. Beiko (PI; Dalhousie University), F. Brinkman (co-PI, Simon Fraser University), A.G. McArthur (co-Applicant) + 4 additional co-Applicants. Genome Canada Bioinformatics and Computational Biology Competition.
Bioinformatics Tools to Improve Data Sharing and Re-use in Public Health – applications in antimicrobial resistance profiling and source tracking. W. Hsiao (PI; University of British Columbia), A.G. McArthur (co-Applicant) + 8 additional co-Applicants. CIHR Project Grant.
Maguire, F., B. Alcock, F.S. Brinkman, A.G. McArthur, & R.G. Beiko. 2018. AMRtime: Rapid Accurate Identification of Antimicrobial Resistance Determinants from Metagenomic Data. Oral presentation at the Third American Society for Microbiology Meeting on Rapid Applied Microbial Next-Generation Sequencing and Bioinformatics Pipelines, Washington, D.C.
Abstract: Metagenomics, the direct sequencing of the mixture of genomes present in a sample, is an increasingly common workflow within the life sciences. It is frequently used to investigate previously intractable problems such as the functional characterisation of entire microbial environments. One such use-case of global and national public-health importance is analysing the nature and transmission dynamics of antimicrobial resistance (AMR) determinants in human, agri-food and environmental samples. Recently some tools have been developed to profile AMR from metagenomes, however, these are generally limited to profiling at the level of AMR genes clustered by % sequence identity, which may or may not be biologically meaningful. By exploiting the expertly curated ontological structure of the Comprehensive Antibiotic Resistance Database (CARD) and new CARD Prevalence datasets, we have developed an approach using a hierarchical set of machine learning classifiers. This allows us to produce gene-specific AMR profiles to 2386 determinants as well as profiles for higher order, biologically informed, AMR gene family groups. Firstly, DIAMOND based heuristically accelerated homology searches are used to filter out non-AMR related metagenomic reads. This filtering has been optimised to prioritise minimisation of false negatives over minimising false positives. Features generated from these homology searches as well as sequence features are then used to train a random forest classifier to classify filtered reads into one of 227 CARD AMR gene families (e.g. MCR phosphoethanolamine transferase). For each gene family an additional random forest classifier is trained to classify reads into one of the specific AMR determinants belonging to that family (e.g. MCR-1, MCR-2, MCR-3 etc.). This process involves very little computational overhead when classifying beyond the initial homology search. On a fully held out test-set of MiSeq reads simulated from the CARD canonical gene sequences this method resulted in an average precision and recall of 0.993 and 0.987 at the AMR gene family level. Within the 227 AMR families, 70% (158) had an average F1-score greater than 0.99 for classification to specific AMR determinants. A further 10% (24) averaged F1-scores between 0.8 and 0.99. In comparative analyses on the same dataset this outperformed homology searches alone, read mapping and variation graph based methods in terms of average overall accuracy and precision. Further work will aim to improve classification within certain families and expand AMRtime to include variant based AMR models as well as meta-models (e.g. multi-component efflux pump systems).
Welcome #TeamVirulence, left to right: Rachel Tran (Biochem 3R06), Sally Min (BiomedDC 4A15), Anatoly Miroshnichencko (BiomedDC 4A15), and Rafik El Werfalli (BiomedDC 4A15), who are collectively working on development of CARD:Virulence, a new branch of the Comprehensive Antibiotic Resistance Database dedicated to the molecular surveillance of bacterial virulence factors.