Covalent and three-dimensional structure of the cyclodextrinase
from
Flavobacterium
sp. no. 92
Hanna B. Fritzsche, Torsten Schwede and Georg E. Schulz
Institut fu
¨
r Organische Chemie und Biochemie, Albert-Ludwigs-Universita
¨
t, Freiburg im Breisgau, Germany
Starting with oligopeptide sequences and using PCR, the
gene of the cyclodextrinase from Flavobacterium sp. no. 92
wasderivedfromthegenomicDNA.Thegenewas
sequenced and expressed in Escherichia coli; the gene pro-
duct was purified and crystallized. An X-ray diffraction
analysis using seleno-methionines with multiwavelength
anomalous diffraction techniques yielded the refined 3D
structure at 2.1 A
˚
resolution. The enzyme hydrolyzes a(1,4)-
glycosidic bonds of cyclodextrins and linear malto-oligo-
saccharides. It belongs to the glycosylhydrolase family no. 13
and has a chain fold similar to that of a-amylases, cyclo-
dextrin glycosyltransferases, and other cyclodextrinases. In
contrast with most family members but in agreement with
other cyclodextrinases, the enzyme contains an additional
characteristic N-terminal domain of about 100 residues. This
domain participates in the formation of a putative D
2
-sym-
metric tetramer but not in cyclodextrin binding at the active
formation. In these dimers, the N-terminal domain of one
subunit contacts the active center of the other subunit and
participates in CD binding. Moreover, it constricts the
active-center pocket, affecting substrate specificity, for
instance, by excluding large molecules such as starch
[11–13]. Beyond the common dimerization, BaCD forms a
hexamer of these dimers, i.e. a dodecamer in solution [9].
Here we investigate the cyclodextrinase from Flavobac-
terium sp. no. 92 (CDase) which hydrolyzes CDs and short
linear malto-oligosaccharides at comparable rates. The
enzyme exhibits only minor hydrolytic activity on the
a(1,4)-linkages of starch [14] and pullulan [15] but shows
considerable transglycosylation activity [16]. The sequence
and 3D structure of the enzyme is presented and compared
with related proteins.
Experimental procedures
Isolation and sequencing of the gene
The enzyme was purified from Flavobacterium sp. no. 92 as
described [4]. The N-terminal amino-acid sequence was
determined to be AAPTAIEHMEPPFW using Edman
degradation in a gas phase sequencer (Applied Biosystems).
Furthermore the enzyme was cleaved with CNBr, and the
sequences of the six resulting peptides were analyzed. One of
the fragments, with the sequence MPDRFANGDPSND,
was selected because it showed 60% amino-acid sequence
identity with several a-amylases and cyclodextrin glycosyl-
transferases (CGTases) in the SWISSPROT Data Bank. On
the basis of these two peptides the following two primers
were constructed (S denotes a C/G mixture and R stands for
Correspondence to G. E. Schulz, Institut fu
Genomic DNA from Flavobacterium sp. no. 92 was
prepared using a slightly modified protocol of Sambrook
et al. [18], partially digested with Sau3A and fractionated by
sucrose gradient centrifugation. Fragments ranging from 7
to 12 kb were used to prepare a genomic library in kZAP
Express DNA (Stratagene). The recombinant phages were
packaged in vitro with Gigapack II Packaging Extract
(Stratagene) and plated on Escherichia coli XL1-Blue MRF¢
(Stratagene) to a final concentration of 5000 pfu per plate
(diameter 15 cm). As determined by blue/white selection,
the library contained 55 000 independent plaques including
10% wild-type phages without inserts. Positive plaques were
identified by in situ hybridization with the radiolabeled
probe. They were subcloned in vivo into pBK-CMV
phagemides (Stratagene) by coinfection with the helper
phage M13 Exassist (Stratagene), and then analyzed with
restriction enzymes. A clone with the complete gene was
isolated. With the use of the dideoxy method [19], the gene
sequence was determined by PAGE and by more advanced
methods (SeqLab, Go
¨
ttingen, Sweden). The complete DNA
sequence has been deposited in the EMBL Nucleotide
Sequence Database under accession code AJ489171.
Expression, purification and crystallization
The CDase gene without the signal sequence was subcloned
into the expression vector pET22b+ (Novagene) using
restriction enzymes EcoRI and NdeI. Thereby, the first
alanine of the mature enzyme was replaced by a methionine.
The CDase gene was then expressed in E. coli strain
)1
protein. The yield of the purified protein was 8 mg per litre
of culture medium. For the crystallization experiments, the
CDase was dialyzed against deionized water.
For phasing the X-ray reflections with the multiwave-
length anomalous diffraction method, all methionines were
replaced with seleno-methionines. For this purpose, the
CDase was expressed at 25 °C in the methionine-auxo-
trophic E. coli strain B834(DE3) using a culture medium
containing seleno-
D
,
L
-methionine at a concentration of
50 mgÆL
)1
[20,21]. The purification procedure was similar to
that of the wild-type enzyme, but 3 m
M
dithiothreitol was
added to all buffers and solutions to avoid oxidation of the
incorporated seleno-methionines. The yield of Se-labeled
CDase was 6 mg per litre of culture medium and thus only
slightly lower than that of the wild-type enzyme.
Crystallization was carried out by the hanging drop
vapor diffusion method using a sparse matrix screen
(Hampton Research, La Jolla, CA, USA)
1
. After optimiza-
tion, the best crystal conditions for the wild-type enzyme
DESY Hamburg) at three different wavelengths, which
were selected on the basis of an X-ray fluorescence spectrum
taken from the same crystal. Data were processed and
scaled with the program suite
HKL
[22] bringing Friedel pairs
to the same scale. The positions of 48 selenium atoms were
determined with program
SHELX
-
D
[23]. Phases were cal-
culated with
SHELX
-
E
[24]andinasecondrunalsowith
program
SHARP
/
AUTOSHARP
[25]. The two resulting density
maps were of equal quality.
The model was built by a combination of
ARP
/
WARP
[26]
and manual operations using program
O
into the periplasm. The DNA sequence agreed with the
independently established amino-acid sequences of seven
peptides. The derived amino-acid sequence is given in Fig. 1
except for the 18-residue signal peptide. The native mature
protein consists of 601 residues with an M
r
of 67 946. On
the basis of sequence similarity, it belongs to the glyco-
sylhydrolase family no. 13 [33]. The four conserved
segments of family no. 13 (Fig. 1) represent the calcium
Ó FEBS 2003 Cyclodextrinase structure (Eur. J. Biochem. 270) 2333
site, Ca-I, and the three invariant catalytic acids, Asp311,
Glu340 and Asp418.
The 102 N-terminal residues of CDase form a domain
that is missing in most other members of family no. 13.
However, it is also present in the other structurally
established CD-degrading enzymes neopullulanase TVA-II
[10], maltogenic amylase ThMA [11], cyclodextrinase BaCD
[9], and a second neopullulanase that resembles TVA-II, but
is not yet available from the Protein Databank [34].
Furthermore, it is present in the a-amylase TVA-I [13],
a trehalohydrolase [35], and an isoamylase [36]. This
N-terminal domain is not present in the CD-producing
CGTases, which, however, contain about 150 additional
C-terminal residues that probably mediate starch binding
[37–39].
3D structure
The crystals of Se-labeled CDase belong to space group
R32 with unit cell dimensions a ¼ b ¼ 181.3 A
˚
peaks are almost exclusively in loop regions.
Following the assignments in related enzymes, CDase
was divided into four domains (Fig. 2). As a member of the
glycosylhydrolase family no. 13, its chain fold is similar to
that of known a-amylases consisting of a central TIM barrel
[40] (domain A, residues 103–516), with a 60-residue insert
after the third strand of the b-barrel (domain B, 223–282)
and a C-terminal domain (domain C, 517–601). In addition,
CDase contains an N-terminal domain (1–102), which
assumes a characteristic b-sandwich structure composed of
the antiparallel strands b1tob8. The N-terminal domain is
connected by an extended 10-residue linker to the TIM
barrel. It contacts the bulk of the molecule at helices a6, a7
and at domain B.
Domain A harbors the active center at the C-termini of the
TIM barrel b-strands. The loops at the C-terminal barrel end
connecting b-strands with the following a-helices are longer
and more complex than the loops at the opposite barrel end.
The lengths of the b-strands in the barrel vary from two (b14)
to seven (b11) residues. As in other enzymes of family no. 13,
the regularity of the CDase TIM barrel is broken by the
a-helices after the sixth b-barrel strand where helix a9
extends in the direction of the preceding strand b14 and only
the next helix a10 runs in the opposite direction (Fig. 2).
For historical reasons the large loop between the third
strand of the TIM barrel (b11) and helix a6 is called domain
B (indicated in Fig. 2). This inserted domain participates in
substrate binding and is rather variable. It is considered to
play a role in determining the enzyme specificity [41]. The
end of the TIM barrel domain A is connected to domain C
a
(%) 99.8 (99.8) 99.9 (99.9) 99.9 (99.9)
R
sym-I
a
(%) 7.7 (21) 5.1 (22) 6.0 (40)
Multiplicity
a
14.5 (14.4) 14.1 (14.1) 7.3 (7.3)
Average I/r
I
a
10.5 (3.4) 11.9 (3.1) 10.4 (1.9)
a
Values in parentheses refer to the outermost shell.
2334 H. B. Fritzsche et al.(Eur. J. Biochem. 270) Ó FEBS 2003
sheet, which supports the proposal that domain C stabilizes
the TIM barrel.
Like most other members of family no. 13, CDase
contains Ca
2+
ions. One of the two Ca
2+
ions in CDase is
at site Ca-I, which is widely conserved forming the first of
the four sequence fingerprints shown in Fig. 1 [42]. The
removal of Ca-I was shown to promote proteolysis [43,44].
Ca-I is co-ordinated by the side chains of Asp280 and
Ser222 at the beginning and end of domain B as well as by
the main-chain oxygens of Tyr315 (domain A) and Thr270
(%) 18.8 (21.9)
R
free
(%) (test set of 1991 reflections) 22.3 (27.8)
Rmsd bond lengths (A
˚
)/angles (°) 0.016/1.34
Ramachandran angles in:
Most favored region (%) 90.2
Allowed [generally allowed] region (%) 9.5 [0.3]
Fig. 2. Stereoview of a ribbon plot of CDase showing the N-terminal domain (red), the TIM-barrel domain A (blue), the inserted domain B (green) and
the C-terminal domain C (orange). The two Ca
2+
ions are represented by black spheres. The active center is indicated by the three invariant catalytic
residues (Fig. 1). All a-helices and b-strands are labeled.
Fig. 3. B-factor plots of the main chains of the two molecules of CDase in the asymmetric unit. The averages of molecules A (solid line) and B (broken
line) are 39 A
˚
2
and 44 A
˚
2
, respectively. The a-helices and b-strands are given for reference. Helices a1 through a14 and strands b9throughb16
comprise the TIM barrel. Domain B is inserted between b11 and a6. The seven 3
10
-helices are not indicated.
Ó FEBS 2003 Cyclodextrinase structure (Eur. J. Biochem. 270) 2335
co-ordinated by the side chains of Asp125, Asp146, Asn119
and Asn124, the main-chain oxygens of Gly144 and
Asp121, and by a water molecule. Ca-II stabilizes a surface
strong dimers. A preliminary size-exclusion chromato-
graphy run (Sephacryl 300S, 100 m
M
Hepes, pH 7.5) at
0.15
M
NaClcomparedwith3.8
M
NaCl in the crystals
showed a dominating dimer mixed with other oligomers.
Similar runs under other conditions have yet to be
performed to determine the detailed oligomerization
pattern in solution.
Fig. 4. D
2
-symmetric tetramer structure of CDase in the crystal together with the symmetry axes. (A) Front view placing the crystallographic twofold
axis horizontally in the paper plane. The crystallographic axis runs through the large interface and the vertical noncrystallographic axis runs
through the small interface between the N-terminal domains. One subunit is given in the colors and in an orientation similar to Fig. 2. A b-CD
(orange) derived from a superposition with the complex between b-CD and the homologous enzyme TVA-II [47] marks the active center. (B) View
from the left side of (A), which is along the crystallographic twofold axis, showing a smooth silhouette.
2336 H. B. Fritzsche et al.(Eur. J. Biochem. 270) Ó FEBS 2003
The chain fold of the N-terminal domain of CDase is
similar to that of the related CD-degrading enzymes TVA-II
[10], ThMA [11] and BaCD [9]. However, the positions of
these domains relative to the respective TIM barrel are
completely at variance as shown in Fig. 5. The N-terminal
domains of TVA-II, ThMA and BaCD attach to the active
center of the other subunit and participate in substrate
selection [11,12]. This dimer interface is not related to either
of the two interfaces in the CDase tetramer. It seems very
hexameric association in BaCD) and have similar catalytic
activities. Therefore, we classify them as the TVA-II group.
When comparing CDase with this group, only 380 of the
600 residues can be structure-aligned, and only 28% of
the aligned residues are identical. This renders CDase an
outlier among the structurally established CD-degrading
enzymes. As for the other enzymes, a-amylase TVA-I
shows considerable structural similarity to the TVA-II
group, although its function differs greatly (Table 3).
Moreover, the data reveal that the CD-degrading enzymes
are more similar to the CD-producing CGTase than to the
a-amylase TAKA.
A more obvious difference between CDase and the others
is the deviating spatial position of its N-terminal domain
Fig. 5. Stereoview of the superposition of
CDase(coloredasinFig.2)withTVA-II
(black, Ca
2+
at Ca-II grey) given as
Ca-backbone plots. The completely different
positions of the N-terminal domains and the
differences in domain B near Ca-I (right) are
clearly visible.
Table 3. Chain-fold comparisons within glycosylhydrolase family no. 13. The upper right triangle shows the number of Ca atoms aligned within the
3A
˚
distance criterion in superpositions of the complete polypeptide chains using program
LSQMAN
(30). The numbers in parentheses are the
percentages of identical residues in the aligned segments. For CDase, CGTase and TAKA, only domains A, B and C could be superimposed with
of CDase, CGTase and TAKA are fixed by Ca-I, which is
absent in the TVA-II group with their small B domains. The
surprisingly large difference between CDase on one hand
and the TVA-II group on the other corresponds to the
different oligomeric structures. CDase uses the long exten-
sion of its B domain to make an intimate contact across the
strong dimer interface with domains A and C of the other
subunit. In contrast, the TVA-II group dimer attaches the B
domain to an N-terminal domain of the other subunit,
which restricts the size of the B domain (Fig. 7).
Active center
The active center of CDase is depicted in Fig. 8, which
includes the superimposed structure of a TVA-II dimer with
bound b-CD [47]. Interestingly, the superposition causes a
Fig. 6. Structural alignment of the N-terminal domain of CDase with those of TVA-II [10], ThMA [11], BaCD [9], TVA-I [13], a trehalohydrolase [35]
and an isoamylase [36], which are the only structurally similar domains within glycosylhydrolase family no. 13. CD-degrading activity has been
reported for the top four enzymes. The secondary structure of CDase is given, and every 10th amino acid residue is marked by a dot. Residues
87–175 of the isoamylase have been omitted (marked #). All residues that superimpose within the 3 A
˚
distance criterion of program
LSQMAN
[30]
are underlined. For reference, strand b9 of the TIM barrel has been included, and the alignments with the TIM barrels are given in bold.
Fig. 7. Superposition of the inserted B domains
of CDase (green, His251 marked by a ball),
TVA-II (red), CGTase (blue) and TAKA (grey)
as aligned on the TIM barrels. TVA-II, ThMa
and BaCD are so similar that only one of them
was drawn out for clarity. As TVA-I varies
only slightly from TVA-II, it was omitted. The
Moreover, the conformation at the scissile bond in a CD
complex with CGTase showed a substantial deviation from
the circular symmetry [48]. These observations indicate that
the observed CD-binding position in TVA-II is most likely
displaced by about 3 A
˚
. For catalysis, the CD molecule has
to be pushed 3 A
˚
deeper into the active-center pocket and
deformed at its scissile bond [48]. Such a CD position has
not yet been observed in any crystal structure. It would
enable Phe274 of CDase (or Phe286 of TVA-II) to rotate
around its Ca–Cb bond and enter the CD hollow, as has
been implied for TVA-II [10] and for Tyr195 of CGTase
[49]. Crystal experiments to clarify the CD position in
CDase are under way.
The required induced fit and deformation of the bound
CD need energy, part of which may be derived from
co-operative effects in the CDase tetramer (or TVA-II
dimer) association. This proposal is consistent with the
observation of a higher rate of CD hydrolysis for dimeric
ThMA than for the monomeric ThMA [12]. Although such
an energy source is conceivable for the TVA-II group in
which the bound CD contacts the N-terminal domain of the
other subunit, it is also possible for CDase in which the
bound CD is very close to the long B-domain extension
(Fig. 7) as well as to A-domain and C-domain residues of
the other subunit (Fig. 4A). In fact, the interface mediating
the strong dimer association would appear to explain the
¨
rke 44, 321–315.
4. Bender, H. (1993) Purification and characterization of a cyclo-
dextrin-degrading enzyme from Flavobacterium sp. Appl. Micro-
biol. Biotechnol. 39, 714–719.
5. Oguma, T., Matsuyama, A., Kikuchi, M. & Nakano, E. (1993)
Cloning and sequence analysis of the cyclodextrinase gene from
Bacillus sphaericus and expression in Escherichia coli cells. Appl.
Microbiol. Biotechnol. 39, 197–203.
Fig. 8. Active-center region in a superposition of CDase (blue with light green domain B) with the TVA-II dimer (grey with dark green domain B and
pink N-terminal domain). The N-terminal domain of the other subunit of the TVA-II dimer is shown in red including Tyr45¢. The bound b-CD
molecule is from a complex with TVA-II [47]. Active-center residues of CDase are given as ball-and-stick models.
Ó FEBS 2003 Cyclodextrinase structure (Eur. J. Biochem. 270) 2339
6. Abe, J., Onitsuka, N., Nakano, T., Shibata, Y., Hizukuri, S. &
Entani, E. (1994) Purification and characterization of periplasmic
alpha-amylase from Xanthomonas campestris K-11151. J. Bacter-
iol. 176, 3584–3588.
7. Bender, H. (1995) Purification and characterisation of a soluble,
cytoplasmic decycling maltodextrinase from Lactobacillus sp.
strain 26X, isolated from kitchen waste water. Appl. Microbiol.
Biotechnol. 43, 838–843.
8. Feederle, R., Pajatsch, M., Kremmer, E. & Bo
¨
ck, A. (1996)
Metabolism of cyclodextrins by Klebsielle oxytoca m5a1: puri-
fication and characterisation of a cytoplasmatically located
cyclodextrinase. Arch. Microbiol. 165, 206–212.
9. Lee, H S., Kim, M S., Cho, H S., Kim, J I., Kim, T J., Choi,
J H., Park, C., Lee, H S., Oh, B H. & Park, K H. (2002)
Cyclomaltodextrinase, neopullulanase and maltogenic amylase
of the decycling maltodextrinase from Flavobacterium sp, 92.
Carbohydr. Res. 263, 137–147.
15. Bender, H. (1994) Studies of the degradation of pullulan by the
decycling maltodextrinase of Flavobacterium sp, 92. Carbohydr.
Res. 260, 119–130.
16. Bender, H. (1994) Studies of the transglycosylation reaction
catalysed by the decycling maltodextrinase of Flavobacterium sp,
92 with malto-oligosaccharides and cyclodextrins. Carbohydr. Res.
263, 123–135.
17. Feinberg, A.P. & Vogelstein, B. (1983) A technique for radio-
labeling DNA restriction endonuclease fragments to high specific
activity. Anal. Biochem. 132, 6–13.
18. Sambrook, J., Fritsch, E.F. & Maniatis, T. (1989) Molecular
Cloning: A Laboratory Manual, 2nd edn. Cold Spring Harbor
Laboratory Press, Cold Spring Harbor, NY.
19. Sanger, F. (1981) Determination of nucleotide sequences in DNA.
Science 214, 1205–1210.
20. LeMaster, D.M. & Richards, F.M. (1985) 1H)15N heteronuclear
NMR studies of Escherichia coli thioredoxin in samples isotopi-
cally labeled by residue type. Biochemistry 24, 7263–7268.
21. Hendrickson, W.A., Horton, J.R. & Le Master, D.M. (1990)
Selenomethionyl proteins produced for analysis by multi-
wavelength anomalous diffraction (MAD): a vehicle for direct
determination of three-dimensional structure. EMBO J. 9, 1665–
1672.
22. Otwinowski, Z. & Minor, W. (1997) Processing of X-ray diffrac-
tion data collected in oscillation mode. Methods Enzymol. 276,
307–326.
23. Uson, I. & Sheldrick, G.M. (1999) Advances in direct methods for
protein crystallography. Curr. Opin. Struct. Biol. 9, 642–648.
suite: programs for protein crystallography. Acta Crystallogr.
Sect. D 50, 760–763.
31. Kraulis, P.J. (1991)
MOLSCRIPT
: a program to produce both
detailed and schematic plots of protein structures. J. Appl. Crys-
tallogr. 24, 946–950.
32. Merritt, E.A. & Bacon, D.J. (1997) Raster3D photorealistic
molecular graphics. Methods Enzymol. 277, 505–524.
33. Henrissat, B. & Bairoch, A. (1996) Updating the sequence-based
classification of glycosyl hydrolases. Biochem. J. 316, 695–696.
34. Hondoh, H., Kuriki, T. & Matsuura, Y. (2002) Three-dimensional
structure of Bacillus stearothermophilus neopullulanase. Biologia
(Bratisl.), 57, 77–82.
35. Feese, M.D., Kato, Y., Tamada, T., Kato, M., Komeda, T.,
Miura, Y., Hirose, M., Hondo, K., Kobayashi, K. & Kuroki, R.
(2000) Crystal structure of glycosyltrehalose trehalohydrolase
from the hyperthermophilic archaeum Sulfolobus solfataricus.
J. Mol. Biol. 301, 451–464.
36. Katsuya, Y., Mezaki, Y., Kubota, M. & Matsuura, Y. (1998)
Three-dimensional structure of Pseudomonas isoamylase at 2.2 A
˚
resolution. J. Mol. Biol. 281, 885–897.
37. Hofmann, B.E., Bender, H. & Schulz, G.E. (1989) Three-dimen-
sional structure of cyclodextrin glycosyltransferase from Bacillus
circulans at 3.4 A
˚
resolution. J. Mol. Biol. 209, 793–800.
38. Klein, C. & Schulz, G.E. (1991) Structure of cyclodextrin glyco-
syltransferase refined at 2.0 A
44. Machius, M., Wiegand, G. & Huber, R. (1995) Crystal structure
of calcium-depleted Bacillus licheniformis a-amylase at 2.2 A
˚
resolution. J. Mol. Biol. 246, 545–559.
45. Kadziola, A., Abe, J., Svensson, B. & Haser, R. (1994) Crystal
and molecular structure of barley a-amylase. J. Mol. Biol. 239,
104–121.
46. Matsuura, Y., Kusunoki, M., Harada, W. & Kakudo, M. (1984)
Structure and possible catalytic residues of Taka-amylase A.
J. Biochem. (Tokyo) 95, 697–702.
47. Kondo, S., Ohtaki, A., Tonozuka, T., Sakano, Y. & Kamitori, S.
(2001) Studies on the hydrolyzing mechanism for cyclodextrins of
Thermoactinomyces vulgaris R 47 a-amylase 2 (TVA II). X-ray
structure of the mutant E354A complexed with b-cyclodextrin,
and kinetic analyses on cyclodextrins. J. Biochem. (Tokyo) 129,
423–428.
48. Schmidt,A.K.,Cottaz,S.,Driguez,H.&Schulz,G.E.(1998)
Structure of cyclodextrin glycosyltransferase complexed with a
derivative of its main product b-cyclodextrin. Biochemistry 37,
5909–5915.
49. Parsiegla, G., Schmidt, A.K. & Schulz, G.E. (1998) Substrate
binding to a cyclodextrin glycosyltransferase and mutations
increasing the cyclodextrin production. Eur. J. Biochem. 255,
710–717.
Ó FEBS 2003 Cyclodextrinase structure (Eur. J. Biochem. 270) 2341