De novo computational design of retro-aldol enzymes
|
Using new algorithms that employ hashing techniques to construct active sites for multi-step reactions, we designed retro-aldolases that employ four different catalytic motifs to catalyze the breaking of a carbon-carbon bond in a non-natural substrate. Designs that utilized an explicit water molecule to mediate proton shuffling were significantly more successful, with rate accelerations of up to four orders of magnitude and multiple turnovers, than those involving charged sidechain networks. The atomic accuracy of the design process was confirmed by the X-ray crystal structure of active designs embedded in two protein scaffolds, both of which were nearly superimposable on the design model.
|
 |
Structures of artificial retro-aldol enzymes
(A, B, C). Examples of design models for active designs highlighting groups important for catalysis. The nucleophilic imine-forming lysine is in orange, the transition state model is in yellow, hydrogen bonding groups are in light green, and the catalytic water is shown explicitly. The designed hydrophobic binding site for the aromatic portion of the transition state model is indicated by the gray mesh. (D, E). Overlay of design model (purple) on X-ray crystal structure (green).
|
Other related papers:
Zanghellini A, Jiang L, Wollacott AM, Cheng G, Meiler J, Althoff EA, Rothlisberger D, Baker D (2006). New algorithms and an in silico benchmark for computational enzyme design. Protein Sci. 15, 2785-94.
|
Reference:
Jiang L, Althoff EA, Clemente FR, Doyle L, Röthlisberger D, Zanghellini A, Gallaher JL, Betker JL, Tanaka F, Barbas CF, Hilvert D, Houk KN, Stoddard BL, Baker D (2008).De novo computational design of retro-aldol enzymes.Science 319, 1387-91.
High Resolution Protein Structure Refinement
|
A longstanding problem in computational biology is the refinement of low resolution protein structure models to more atomic-level accurate structures. A related challenge is refining low-resolution NMR models to the quality of high-resolution structures. NMR is a valuable tool for determining protein structures, particularly because it does not require crystals. But some NMR structures, especially those determined from insufficient restraints or misinterpreted data, can be incorrect. Also, the core of an NMR structure can tend to be under-packed, possibly due to overlapping spectra. To tackle both challenges, comparative model refinement and NMR structure refinement, we have been developing the Rosetta high-resolution refinement protocol. This protocol involves focusing sampling on regions of the structure that are most likely to contain errors while allowing the whole structure to relax in a physically realistic all-atom forcefield.
|
 |
A stringent test of accuracy of protein structure models is the molecular replacement test. Molecular replacement solves the crystallographic phase problem by estimating the phases based on model. However, to be successful, the model has to be very close to the structure being solved (typically < 1.5 A). Comparative models used successfully for molecular replacement generally come from templates that share a sequence identity of > 50% with the native sequence. We have shown that models made by the Rosetta high resolution refinement protocols starting from comparative models ( < 20% sequence identity ) and NMR structures consistently provide good molecular replacement solutions.
|
Reference:
Computational Design of Protein-DNA Cleavage Specificity in a Homing Endonuclease
 |
High-resolution modeling of protein-DNA interactions has granted us the ability to estimate the specificities of real and hypothetical interfaces. This approach may be useful to design novel sequence-specific endonucleases for biotechnology and medicine.
The figure at left depicts the (crystallographically and biochemically validated) model of a strong switch in the basepair specificity of the endonuclease I-MsoI at a single symmetric position in the recognition region. Generalization of this method may allow us to broadly engineer novel DNA cleavage specificities using computational design.
|
Reference:
Design of a novel globular protein fold with atomic level accuracy
|
A major challenge of computational protein design is the creation of novel proteins with arbitrarily chosen three-dimensional structures. Here, we
used a general computional strategy that iterates between sequence design and structure prediction to design a 93-residue alpha/beta protein called Top7 with a novel sequence and topology. Top7 was found experimentally to be folded and extremely stable, and the x-ray crystal structure of Top7
is similar (root mean square deviation equals 1.2 angstroms) to the design model. The ability to design a new protein fold makes possible the
exploration of the large regions of the protein universe not yet observed in nature.
|
Superimposition of the Top7 computational model and x-ray structure shows the remarkable atomic-level accuracy of the design (1.2� RMSD).
The backbones are respresented as ribbons (computational model : helices - dark blue, strands - red; x-ray structure : helices - light blue,
strands - yellow), and selected amino-acid sidechains in the protein core are represented as sticks.
Reference:
Blind ab initio structure prediction of
CASP3 targets
|
Try our decoy structure test: Pick the structure made
by ROSETTA
that is closest to the native structure.
|
| Abstract |
To generate structures consistent with both the local and non-local interactions responsible for protein stability, 3 and 9 residue fragments of known structures with local sequences similar to the target sequence were assembled into complete tertiary structures using a Monte Carlo simulated annealing procedure (Simons, K.T. et al., J. Mol. Biol., 268:209-25, 1997 [Full Text PDF]). The scoring function used in the simulated annealing procedure consists of sequence dependent terms representing hydrophobic burial and specific pair interactions such as electrostatics and disulfide bonding, and sequence independent terms representing hard sphere packing, alpha-helix and beta-strand packing and the collection of beta-strands in beta-sheets [Simons, K.T. et al., Proteins, 34:82-95, 1999 ]. For each of 21 small, ab initio targets, 1200 final structures were constructed, each the result of 100,000 attempted fragment substitutions. The five structures submitted for the CASP III experiment were chosen from the ~25 structures with the lowest scores in the broadest minima (assessed through the number of structural neighbors, Shortle, D., et al., PNAS, 95:1158-62, 1998 [Full Text PDF]). The results were encouraging: highlights of the predictions include a 99 residue segment (target 79) for MarA with an rmsd of 6.4 angstroms to the native structure, a 95 residue (full length) prediction for the EH2 domain of EPS15 (target 74) with an rmsd of 6.0 angstroms, a 75 residue segment of DNAB helicase (target 56) with an rmsd of 4.7 angstroms, and a 67 residue segment of ribosomal protein L30 (target 77) with an rmsd of 3.8 angstroms. These results suggest that ab initio methods may soon become useful for low resolution structure prediction for proteins which lack a close homologue of known structure.
|
|
Simons, K.T., Bonneau, R., Ruczinski, I., Baker, D. (1999).
Ab initio Protein Structure Prediction of CASP III Targets Using ROSETTA.
Proteins 3, 171-176.
Experiment and theory highlight role of native state topology in SH3 folding
|
We use a combination of experiments, computer simulations and simple model calculations to characterize, first, the folding transition state ensemble of the src SH3 domain, and second, the features of the protein that determine its folding mechanism. Kinetic analysis of mutations at 52 of the 57 residues in the src SH3 domain revealed that the transition state ensemble is even more polarized than suspected earlier: no single alanine substitution in the N-terminal 15 residues or the C-terminal 9 residues has more than a two-fold effect on the folding rate, while such substitutions at 15 sites in the central three-stranded beta-sheet cause significant decreases in the folding rate. Molecular dynamics (MD) unfolding simulations and ab initio folding simulations on the src SH3 domain exhibit a hierarchy of folding similar to that observed in the experiments. The similarity in folding mechanism of different SH3 domains and the similar hierarchy of structure formation observed in the experiments and the simulations can be largely accounted for by a simple native state topology-based model of protein folding energy landscapes. |
|
Riddle, D.S., Grantcharova, V.P., Santiago, J.V., Alm, E., Ruczinski, I. and Baker, D. (1999).
Experiment and theory highlight role
of native state topology in SH3 folding . Nat Struct Biol 6, 1016-1024 [Full Text PDF]
Other related papers:
 |
Goldenberg, D. P. (1999). Finding the right fold. Nat Struct Biol 6, 987-990.
|
 |
Chiti, F., Taddei, N., White, P. M., Bucciantini, M., Magherini, F., Stefani, M., and Dobson, C. M. (1999).
Mutational analysis of acylphosphatase suggests the importance of topology and contact order in protein
folding. Nat Struct Biol 6, 1005-1009.
|
 |
Martinez, J. C. and Serrano, L. (1999). The folding transition state between SH3 domains is conformationally
restricted and evolutionarily conserved. Nat Struct Biol 6, 1010-1016.
|
A correlation between folding rate and contact order
Plaxco, K. W., Simons, K. T., and Baker, D. (1998). Contact order, transition state placement, and the refolding rates of single domain proteins.
J. Mol. Biol. 277, 985-994. [Full Text PDF]
The I-sites library of sequence-structure motifs
Bystroff, C. and Baker, D. (1998).
Local structure prediction using a library of sequence-structure motifs.
J. Mol. Biol. 281, 565-77.
[Full Text PDF]
Sequences of small proteins are not optimized for rapid folding
|
Distributions of free energies of unfolding and refolding rates
in randomized protein L variants compared to wild type. For more information you may view the
[Full Text PDF]
|
|
Kim, D. E., Gu, H., and Baker, D. (1998).
The sequences of small proteins are not extensively optimized for rapid folding by natural selection.
Proc. Natl. Acad. Sci. 95, 4982-4986. [Full Text PDF]
A folded, functional SH3 domain built largely from a five letter amino acid alphabet
Diagram showing the positions of simplified residues (I, K, E, A, G) in red for FP2 in the wild type SH3 structure. Side chains of residues involved in ligand binding are displayed and residues where simplification was not attempted are in light blue. The peptide ligand is shown in orange. Residues which did not tolerate simplification are in black.
More Information |
|
Riddle, D., Santiago, J., Grantcharova, V. and Baker, D. (1997).
Functional rapidly folding proteins from simplified amino acid sequences.
Nature Structural Biology 4, 805-809.
Last updated
Thu Mar 13 12:18:12 2008