Glossary
Computational Protein Folding
Predicting the three-dimensional structure a protein adopts from its amino acid sequence alone — using computational methods to solve a problem that took decades of experimental effort per protein.
Computational protein folding is the problem of predicting how a protein will fold — what three-dimensional shape it will adopt — from its amino acid sequence alone. This is one of the foundational challenges in structural biology, because the shape of a protein determines its function, and understanding function is the gateway to understanding disease and designing drugs.
Proteins are chains of amino acids that fold into complex three-dimensional structures determined by the sequence of amino acids and the physical chemistry of molecular interactions. The sequence contains all the information needed for folding, but extracting that information computationally was, for decades, intractable for most proteins of biological interest.
The protein folding problem
The difficulty of protein folding prediction has several sources:
Conformational space: A protein with n residues can in principle adopt an astronomical number of distinct conformations. Brute-force search is computationally impossible.
Energy landscape complexity: The energy landscape governing folding has many local minima; finding the global minimum (the native fold) requires navigating a complex, high-dimensional surface.
Multiple stable states: Many proteins exist in multiple stable conformations that they switch between during function. Predicting the native fold is insufficient; predicting the functional conformational ensemble requires going further.
Validation: Comparing predictions to experiment requires crystallography, cryo-EM, or NMR data — each with its own limitations and artifacts.
Beyond static structure
Modern research increasingly focuses on dynamics rather than static structure:
Conformational ensembles: Predicting the distribution of structures a protein samples at physiological temperature, not just the single lowest-energy structure.
Molecular dynamics acceleration: Using AI to speed up physics-based simulation of how proteins move over time, enabling the study of processes that occur on timescales inaccessible to conventional simulation.
Allosteric communication: How perturbations at one site propagate through a protein to affect activity at distant sites — critical for drug design but invisible in static structure.
How Webbeon approaches Computational Protein Folding
Webbeon's biophysics research extends structure prediction toward dynamic function:
- 78% ensemble accuracy — predicting conformational ensembles rather than single structures
- 320x molecular dynamics speedup through adaptive coarse-graining
- 91.3% EC number accuracy — predicting enzymatic function from structure
- Systems-scale protein-protein interaction network modeling
Key facts
- Static protein structure prediction has largely been solved by deep learning methods; the frontier is now dynamics and function
- Webbeon's 320x MD speedup allows simulation of processes on millisecond timescales that were previously accessible only on timescales of microseconds
- Biophysics research and medicine research are tightly coupled at Webbeon: protein folding insights feed directly into drug discovery compound design
- 0.87 kinase selectivity AUC — predicting which kinase a compound will bind selectively, critical for avoiding off-target toxicity