Membrane protein contact and structure prediction using co-evolution in conjunction with machine learning.

Teixeira PL, Mendenhall JL, Heinze S, Weiner B, Skwark MJ, Meiler J
PLoS One. 2017 12 (5): e0177866

PMID: 28542325 · PMCID: PMC5443516 · DOI:10.1371/journal.pone.0177866

De novo membrane protein structure prediction is limited to small proteins due to the conformational search space quickly expanding with length. Long-range contacts (24+ amino acid separation)-residue positions distant in sequence, but in close proximity in the structure, are arguably the most effective way to restrict this conformational space. Inverse methods for co-evolutionary analysis predict a global set of position-pair couplings that best explain the observed amino acid co-occurrences, thus distinguishing between evolutionarily explained co-variances and these arising from spurious transitive effects. Here, we show that applying machine learning approaches and custom descriptors improves evolutionary contact prediction accuracy, resulting in improvement of average precision by 6 percentage points for the top 1L non-local contacts. Further, we demonstrate that predicted contacts improve protein folding with BCL::Fold. The mean RMSD100 metric for the top 10 models folded was reduced by an average of 2 Å for a benchmark of 25 membrane proteins.

MeSH Terms (8)

Algorithms Amino Acid Sequence Humans Machine Learning Membrane Proteins Models, Molecular Protein Folding Protein Structure, Secondary

Connections (1)

This publication is referenced by other Labnodes entities: