Improved recognition of native-like protein structures using a combination of sequence-dependent and sequence-independent features of proteins

TitleImproved recognition of native-like protein structures using a combination of sequence-dependent and sequence-independent features of proteins
Publication TypeJournal Article
Year of Publication1999
AuthorsSimons, K. T., Ruczinski I., Kooperberg C., Fox B. A., Bystroff C., & Baker D.
JournalProteins
Volume34
Issue1
Pagination82-95
Date Published1999 Jan 1
ISSN0887-3585
KeywordsModels, Statistical, Primary Publication, Protein Conformation, Protein Structure, Secondary, Protein Structure, Tertiary
Abstract

We describe the development of a scoring function based on the decomposition P(structure/sequence) proportional to P(sequence/structure) *P(structure), which outperforms previous scoring functions in correctly identifying native-like protein structures in large ensembles of compact decoys. The first term captures sequence-dependent features of protein structures, such as the burial of hydrophobic residues in the core, the second term, universal sequence-independent features, such as the assembly of beta-strands into beta-sheets. The efficacies of a wide variety of sequence-dependent and sequence-independent features of protein structures for recognizing native-like structures were systematically evaluated using ensembles of approximately 30,000 compact conformations with fixed secondary structure for each of 17 small protein domains. The best results were obtained using a core scoring function with P(sequence/structure) parameterized similarly to our previous work (Simons et al., J Mol Biol 1997;268:209-225] and P(structure) focused on secondary structure packing preferences; while several additional features had some discriminatory power on their own, they did not provide any additional discriminatory power when combined with the core scoring function. Our results, on both the training set and the independent decoy set of Park and Levitt (J Mol Biol 1996;258:367-392), suggest that this scoring function should contribute to the prediction of tertiary structure from knowledge of sequence and secondary structure.

Alternate JournalProteins
AttachmentSize
simons98A.pdf313.04 KB