John Capra
Last active: 3/3/2020

Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.

Capra JA, Laskowski RA, Thornton JM, Singh M, Funkhouser TA
PLoS Comput Biol. 2009 5 (12): e1000585

PMID: 19997483 · PMCID: PMC2777313 · DOI:10.1371/journal.pcbi.1000585

Identifying a protein's functional sites is an important step towards characterizing its molecular function. Numerous structure- and sequence-based methods have been developed for this problem. Here we introduce ConCavity, a small molecule binding site prediction algorithm that integrates evolutionary sequence conservation estimates with structure-based methods for identifying protein surface cavities. In large-scale testing on a diverse set of single- and multi-chain protein structures, we show that ConCavity substantially outperforms existing methods for identifying both 3D ligand binding pockets and individual ligand binding residues. As part of our testing, we perform one of the first direct comparisons of conservation-based and structure-based methods. We find that the two approaches provide largely complementary information, which can be combined to improve upon either approach alone. We also demonstrate that ConCavity has state-of-the-art performance in predicting catalytic sites and drug binding pockets. Overall, the algorithms and analysis presented here significantly improve our ability to identify ligand binding sites and further advance our understanding of the relationship between evolutionary sequence conservation and structural and functional attributes of proteins. Data, source code, and prediction visualizations are available on the ConCavity web site (

MeSH Terms (14)

Algorithms Apoenzymes Area Under Curve Binding Sites Catalytic Domain Computational Biology Conserved Sequence Enzymes Evolution, Molecular Holoenzymes Models, Molecular Protein Binding Protein Conformation Proteins

Connections (1)

This publication is referenced by other Labnodes entities: