Novel Family of Carbohydrate-Binding Modules Revealed by the Genome Sequence of Spirochaeta thermophila DSM 6192
ABSTRACTSpirochaeta thermophilais a thermophilic, free-living, and cellulolytic anaerobe. The genome sequence data for this organism have revealed a high density of genes encoding enzymes from more than 30 glycoside hydrolase (GH) families and a noncellulosomal enzyme system for (hemi)cellulose degradation. Functional screening of a fosmid library whose inserts were mapped on theS. thermophilagenome sequence allowed the functional annotation of numerous GH open reading frames (ORFs). Seven different GH ORFs from theS. thermophilaDSM 6192 genome, all putative β-glycanase ORFs according to sequence similarity analysis, contained a highly conserved novel GH-associated module of unknown function at their C terminus. Four of these GH enzymes were experimentally verified as xylanase, β-glucanase, β-glucanase/carboxymethylcellulase (CMCase), and CMCase. Binding experiments performed with the recombinantly expressed and purified GH-associated module showed that it represents a new carbohydrate-binding module (CBM) that binds to microcrystalline cellulose and is highly specific for this substrate. In the course of this work, the new CBM type was only detected inSpirochaeta, but recently we found sequences with detectable similarity to the module in the draft genomes ofCytophaga fermentansandMahella australiensis, both of which are phylogenetically very distant fromS. thermophilaand noncellulolytic, yet inhabit similar environments. This suggests a possibly broad distribution of the module in nature.