Convolutional neural network-based annotation of bacterial type IV secretion system effectors with enhanced accuracy and reduced false discovery

2019 ◽  
Vol 21 (5) ◽  
pp. 1825-1836 ◽  
Author(s):  
Jiajun Hong ◽  
Yongchao Luo ◽  
Minjie Mou ◽  
Jianbo Fu ◽  
Yang Zhang ◽  
...  

Abstract The type IV bacterial secretion system (SS) is reported to be one of the most ubiquitous SSs in nature and can induce serious conditions by secreting type IV SS effectors (T4SEs) into the host cells. Recent studies mainly focus on annotating new T4SE from the huge amount of sequencing data, and various computational tools are therefore developed to accelerate T4SE annotation. However, these tools are reported as heavily dependent on the selected methods and their annotation performance need to be further enhanced. Herein, a convolution neural network (CNN) technique was used to annotate T4SEs by integrating multiple protein encoding strategies. First, the annotation accuracies of nine encoding strategies integrated with CNN were assessed and compared with that of the popular T4SE annotation tools based on independent benchmark. Second, false discovery rates of various models were systematically evaluated by (1) scanning the genome of Legionella pneumophila subsp. ATCC 33152 and (2) predicting the real-world non-T4SEs validated using published experiments. Based on the above analyses, the encoding strategies, (a) position-specific scoring matrix (PSSM), (b) protein secondary structure & solvent accessibility (PSSSA) and (c) one-hot encoding scheme (Onehot), were identified as well-performing when integrated with CNN. Finally, a novel strategy that collectively considers the three well-performing models (CNN-PSSM, CNN-PSSSA and CNN-Onehot) was proposed, and a new tool (CNN-T4SE, https://idrblab.org/cnnt4se/) was constructed to facilitate T4SE annotation. All in all, this study conducted a comprehensive analysis on the performance of a collection of encoding strategies when integrated with CNN, which could facilitate the suppression of T4SS in infection and limit the spread of antimicrobial resistance.

2018 ◽  
Author(s):  
Debnath Ghosal ◽  
Yi-Wei Chang ◽  
Kwang Cheol Jeong ◽  
Joseph P. Vogel ◽  
Grant J. Jensen

AbstractLegionella pneumophilasurvives and replicates inside host cells by secreting ~300 effectors through the Dot/Icm type IVB secretion system (T4BSS). Understanding this machine’s structure is challenging because of its large number of components (27) and integration into all layers of the cell envelope. Previously we overcame this obstacle by imaging the Dot/Icm T4BSS in its native state within intact cells through electron cryotomography. Here we extend our observations by imaging a stabilized mutant that yielded a higher resolution map. We describe for the first time the presence of a well-ordered central channel that opens up into a windowed large (~32 nm wide) secretion chamber with an unusual 13-fold symmetry. We then dissect the complex by matching proteins to densities for many components, including all those with periplasmic domains. The placement of known and predicted structures of individual proteins into the map reveals the architecture of the T4BSS and provides a roadmap for further investigation of this amazing specialized secretion system.


2021 ◽  
Vol 12 ◽  
Author(s):  
Lezheng Yu ◽  
Fengjuan Liu ◽  
Yizhou Li ◽  
Jiesi Luo ◽  
Runyu Jing

Gram-negative bacteria can deliver secreted proteins (also known as secreted effectors) directly into host cells through type III secretion system (T3SS), type IV secretion system (T4SS), and type VI secretion system (T6SS) and cause various diseases. These secreted effectors are heavily involved in the interactions between bacteria and host cells, so their identification is crucial for the discovery and development of novel anti-bacterial drugs. It is currently challenging to accurately distinguish type III secreted effectors (T3SEs) and type IV secreted effectors (T4SEs) because neither T3SEs nor T4SEs contain N-terminal signal peptides, and some of these effectors have similar evolutionary conserved profiles and sequence motifs. To address this challenge, we develop a deep learning (DL) approach called DeepT3_4 to correctly classify T3SEs and T4SEs. We generate amino-acid character dictionary and sequence-based features extracted from effector proteins and subsequently implement these features into a hybrid model that integrates recurrent neural networks (RNNs) and deep neural networks (DNNs). After training the model, the hybrid neural network classifies secreted effectors into two different classes with an accuracy, F-value, and recall of over 80.0%. Our approach stands for the first DL approach for the classification of T3SEs and T4SEs, providing a promising supplementary tool for further secretome studies.


2018 ◽  
Author(s):  
KwangCheol C. Jeong ◽  
Jacob Gyore ◽  
Lin Teng ◽  
Debnath Ghosal ◽  
Grant J. Jensen ◽  
...  

SummaryLegionella pneumophila, the causative agent of Legionnaires’ disease, survives and replicates inside amoebae and macrophages by injecting a large number of protein effectors into the host cells’ cytoplasm via the Dot/Icm type IVB secretion system (T4BSS). Previously, we showed that the Dot/Icm T4BSS is localized to both poles of the bacterium and that polar secretion is necessary for the proper targeting of theLegionellacontaining vacuole (LCV). Here we show that polar targeting of the Dot/Icm core-transmembrane subcomplex (DotC, DotD, DotF, DotG and DotH) is mediated by two Dot/Icm proteins, DotU and IcmF, which are able to localize to the poles ofL. pneumophilaby themselves. Interestingly, DotU and IcmF are homologs of the T6SS components TssL and TssM, which are part of the T6SS membrane complex (MC). We propose thatLegionellaco-opted these T6SS components to a novel function that mediates subcellular localization and assembly of this T4SS. Finally, in depth examination of the biogenesis pathway revealed that polar targeting and assembly of theLegionellaT4BSS apparatus is mediated by an innovative “outside-inside” mechanism.


eLife ◽  
2020 ◽  
Vol 9 ◽  
Author(s):  
Clarissa L Durie ◽  
Michael J Sheedlo ◽  
Jeong Min Chung ◽  
Brenda G Byrne ◽  
Min Su ◽  
...  

Legionella pneumophila is an opportunistic pathogen that causes the potentially fatal pneumonia Legionnaires’ Disease. This infection and subsequent pathology require the Dot/Icm Type IV Secretion System (T4SS) to deliver effector proteins into host cells. Compared to prototypical T4SSs, the Dot/Icm assembly is much larger, containing ~27 different components including a core complex reported to be composed of five proteins: DotC, DotD, DotF, DotG, and DotH. Using single particle cryo-electron microscopy (cryo-EM), we report reconstructions of the core complex of the Dot/Icm T4SS that includes a symmetry mismatch between distinct structural features of the outer membrane cap (OMC) and periplasmic ring (PR). We present models of known core complex proteins, DotC, DotD, and DotH, and two structurally similar proteins within the core complex, DotK and Lpg0657. This analysis reveals the stoichiometry and contact interfaces between the key proteins of the Dot/Icm T4SS core complex and provides a framework for understanding a complex molecular machine.


2020 ◽  
Author(s):  
Clarissa L. Durie ◽  
Michael J. Sheedlo ◽  
Jeong Min Chung ◽  
Brenda G. Byrne ◽  
Min Su ◽  
...  

AbstractLegionella pneumophila is an opportunistic pathogen that causes the potentially fatal pneumonia Legionnaires’ Disease. This infection and subsequent pathology require the Dot/Icm Type IV Secretion System (T4SS) to deliver effector proteins into host cells. Compared to prototypical T4SSs, the Dot/Icm assembly is much larger, containing ~27 different components including a core complex reported to be composed of five proteins: DotC, DotD, DotF, DotG, and DotH. Using single particle cryo-electron microscopy (cryo-EM), we report reconstructions of the core complex of the Dot/Icm T4SS that includes a symmetry mismatch between distinct structural features of the outer membrane cap (OMC) and periplasmic ring (PR). We present models of known core complex proteins, DotC, DotD, and DotH, and two structurally similar proteins within the core complex, DotK and Lpg0657. This analysis reveals the stoichiometry and contact interfaces between the key proteins of the Dot/Icm T4SS core complex and provides a framework for understanding a complex molecular machine.


2005 ◽  
Vol 73 (11) ◽  
pp. 7578-7587 ◽  
Author(s):  
Gal Yerushalmi ◽  
Tal Zusman ◽  
Gil Segal

ABSTRACT Legionella pneumophila, the causative agent of Legionnaires' disease, utilizes a type IVB secretion system to subvert its host cells and grow intracellularly. This type IV secretion system is composed of 25 icm (or dot) genes that probably constitute parts of a secretion complex as well as more than 30 proteins that are translocated via this system into the host cells. Three of the Icm/Dot proteins (DotD, DotC, and IcmN) contain a lipobox motif at their N terminals and are predicted to be lipoproteins. Two of these lipoproteins (DotD and DotC) were found to be essential for intracellular growth in both HL-60-derived human macrophages and in the protozoan host Acanthamoeba castellanii, while the third lipoprotein (IcmN) was found to be partially required for intracellular growth only in A. castellanii. Mutation analysis of the lipobox cysteine residue, which was shown previously to be indispensable for the lipobox function, indicated that both DotC and DotD are partially functional without this conserved residue. Cysteine mutations in both DotC and DotD or in DotC together with an icmN deletion or in DotD together with an icmN deletion were found to be additive, indicating that each of these lipoproteins performs its function independently from the others. Analysis of the transcriptional regulation of both the dotDC operon and the icmN gene revealed that both had higher levels of expression at stationary phase which were partially dependent on the LetA regulator. Our results indicate that the lipoproteins of the L. pneumophila icm (or dot) system are essential components of the secretion system and that they perform their functions independently.


2008 ◽  
Vol 190 (6) ◽  
pp. 1985-1996 ◽  
Author(s):  
Efrat Altman ◽  
Gil Segal

ABSTRACT Legionella pneumophila has been shown to utilize the icm/dot type IV secretion system for pathogenesis. This system was shown to be composed of icm/dot complex components and accessory proteins, as well as a large number of translocated substrates. Bioinformatic analysis of the regulatory regions of all the genes revealed that several icm/dot genes, as well as two genes encoding icm/dot translocated substrates, contain the conserved CpxR regulatory element, a regulator that has been shown previously to control the expression of the icmR gene. An experimental analysis, which included a comparison of gene expression in a L. pneumophila wild-type strain and gene expression in a cpxR deletion mutant, construction of mutants with mutations in the CpxR conserved regulatory elements, controlled expression studies, and mobility shift assays, demonstrated the direct relationship between the CpxR regulator and the expression of the genes. Furthermore, genomic analysis identified nine additional genes that contain a putative CpxR regulatory element; five of these genes (two legA genes and three ceg genes) were suggested previously to be putative icm/dot translocated substrates. The three ceg genes identified, which were shown previously to contain a putative PmrA regulatory element, were found here to be regulated by both CpxR and PmrA. The other six genes (two legA genes and four new genes products were found to be regulated by CpxR. Moreover, using the CyaA translocation assay, these nine gene products were found to be translocated into host cells in an Icm/Dot-dependent manner. Our results establish that the CpxR regulator is a fundamental regulator of the icm/dot type IV secretion system in L. pneumophila.


2017 ◽  
Vol 114 (51) ◽  
pp. 13543-13548 ◽  
Author(s):  
Jianpo Xu ◽  
Dandan Xu ◽  
Muyang Wan ◽  
Li Yin ◽  
Xiaofei Wang ◽  
...  

The type IVb secretion system (T4BSS) of Legionella pneumophila is a multiple-component apparatus that delivers ∼300 virulent effector proteins into host cells. The injected effectors modulate host cellular processes to promote bacterial infection and proliferation. IcmS and IcmW are two conserved small, acidic adaptor proteins that form a binary complex to interact with many effectors and facilitate their translocation. IcmS and IcmW can also interact with DotL, an ATPase of the type IV coupling protein complex (T4CP). However, how IcmS–IcmW recognizes effectors, and what the roles of IcmS–IcmW are in T4BSSs are unclear. In this study, we found that IcmS and IcmW form a 1:1 heterodimeric complex to bind effector substrates. Both IcmS and IcmW adopt new structural folds and have no structural similarities with known effector chaperones. IcmS has a compact global structure with an α/β fold, while IcmW adopts a fully α-folded, relatively loose architecture. IcmS stabilizes IcmW by binding to its two C-terminal α-helices. Photocrosslinking assays revealed that the IcmS–IcmW complex binds its cognate effectors via an extended hydrophobic surface, which can also interact with the C terminus of DotL. A crystal structure of the DotL–IcmS–IcmW complex reveals extensive and highly stable interactions between DotL and IcmS–IcmW. Moreover, IcmS–IcmW recruits LvgA to DotL and assembles a unique T4CP. These data suggest that IcmS–IcmW also functions as an inseparable integral component of the DotL–T4CP complex in the bacterial inner membrane. This study provides molecular insights into the dual roles of the IcmS–IcmW complex in T4BSSs.


mBio ◽  
2014 ◽  
Vol 5 (4) ◽  
Author(s):  
Aurélie Mousnier ◽  
Gunnar N. Schroeder ◽  
Charlotte A. Stoneham ◽  
Ernest C. So ◽  
James A. Garnett ◽  
...  

ABSTRACTLegionella pneumophila, the causative agent of Legionnaires’ disease, uses the Dot/Icm type IV secretion system (T4SS) to translocate more than 300 effectors into host cells, where they subvert host cell signaling. The function and host cell targets of most effectors remain unknown. PieE is a 69-kDa Dot/Icm effector containing three coiled-coil (CC) regions and 2 transmembrane (TM) helices followed by a fourth CC region. Here, we report that PieE dimerized by an interaction between CC3 and CC4. We found that ectopically expressed PieE localized to the endoplasmic reticulum (ER) and induced the formation of organized smooth ER, while following infection PieE localized to theLegionella-containing vacuole (LCV). To identify the physiological targets of PieE during infection, we established a new purification method for which we created an A549 cell line stably expressing theEscherichia colibiotin ligase BirA and infected the cells withL. pneumophilaexpressing PieE fused to a BirA-specific biotinylation site and a hexahistidine tag. Following tandem Ni2+nitrilotriacetic acid (NTA) and streptavidin affinity chromatography, the effector-target complexes were analyzed by mass spectrometry. This revealed interactions of PieE with multiple host cell proteins, including the Rab GTPases 1a, 1b, 2a, 5c, 6a, 7, and 10. Binding of the Rab GTPases, which was validated by yeast two-hybrid binding assays, was mediated by the PieE CC1 and CC2. In summary, using a novel, highly specific strategy to purify effector complexes from infected cells, which is widely applicable to other pathogens, we identified PieE as a multidomain LCV protein with promiscuous Rab GTPase-binding capacity.IMPORTANCEThe respiratory pathogenLegionella pneumophilauses the Dot/Icm type IV secretion system to translocate more than 300 effector proteins into host cells. The function of most effectors in infection remains unknown. One of the bottlenecks for their characterization is the identification of target proteins. Frequently usedin vitroapproaches are not applicable to all effectors and suffer from high rates of false positives or missed interactions, as they are not performed in the context of an infection. Here, we determine key functional domains of the effector PieE and describe a new method to identify host cell targets under physiological infection conditions. Our approach, which is applicable to other pathogens, uncovered the interaction of PieE with several proteins involved in membrane trafficking, in particular Rab GTPases, revealing new details of theLegionellainfection strategy and demonstrating the potential of this method to greatly advance our understanding of the molecular basis of infection.


2004 ◽  
Vol 72 (10) ◽  
pp. 5983-5992 ◽  
Author(s):  
Jessica A. Sexton ◽  
Jennifer L. Miller ◽  
Aki Yoneda ◽  
Thomas E. Kehl-Fie ◽  
Joseph P. Vogel

ABSTRACT Legionella pneumophila utilizes a type IV secretion system (T4SS) encoded by 26 dot/icm genes to replicate inside host cells and cause disease. In contrast to all other L. pneumophila dot/icm genes, dotU and icmF have homologs in a wide variety of gram-negative bacteria, none of which possess a T4SS. Instead, dotU and icmF orthologs are linked to a locus encoding a conserved cluster of proteins designated IcmF-associated homologous proteins, which has been proposed to constitute a novel cell surface structure. We show here that dotU is partially required for L. pneumophila intracellular growth, similar to the known requirement for icmF. In addition, we show that dotU and icmF are necessary for optimal plasmid transfer and sodium sensitivity, two additional phenotypes associated with a functional Dot/Icm complex. We found that these effects are due to the destabilization of the T4SS at the transition into the stationary phase, the point at which L. pneumophila becomes virulent. Specifically, three Dot proteins (DotH, DotG, and DotF) exhibit decreased stability in a ΔdotU ΔicmF strain. Furthermore, overexpression of just one of these proteins, DotH, is sufficient to suppress the intracellular growth defect of the ΔdotU ΔicmF mutant. This suggests a model where the DotU and IcmF proteins serve to prevent DotH degradation and therefore function to stabilize the L. pneumophila T4SS. Due to their wide distribution among bacterial species and their genetic linkage to known or predicted cell surface structures, we propose that this function in complex stabilization may be broadly conserved.


Sign in / Sign up

Export Citation Format

Share Document