TY - GEN
T1 - Deep learning reveals many more inter-protein residue-residue contacts than direct coupling analysis
AU - Zhou, Tian Ming
AU - Wang, Sheng
AU - Xu, Jinbo
N1 - KAUST Repository Item: Exported on 2021-04-19
PY - 2017/12/30
Y1 - 2017/12/30
N2 - AbstractIntra-protein residue-level contact prediction has drawn a lot of attentions in recent years and made very good progress, but much fewer methods are dedicated to inter-protein contact prediction, which are important for understanding how proteins interact at structure and residue level. Direct coupling analysis (DCA) is popular for intra-protein contact prediction, but extending it to inter-protein contact prediction is challenging since it requires too many interlogs (i.e., interacting homologs) to be effective, which cannot be easily fulfilled especially for a putative interacting protein pair in eukaryotes. We show that deep learning, even trained by only intra-protein contact maps, works much better than DCA for inter-protein contact prediction. We also show that a phylogeny-based method can generate a better multiple sequence alignment for eukaryotes than existing genome-based methods and thus, lead to better inter-protein contact prediction. Our method shall be useful for protein docking, protein interaction prediction and protein interaction network construction.
AB - AbstractIntra-protein residue-level contact prediction has drawn a lot of attentions in recent years and made very good progress, but much fewer methods are dedicated to inter-protein contact prediction, which are important for understanding how proteins interact at structure and residue level. Direct coupling analysis (DCA) is popular for intra-protein contact prediction, but extending it to inter-protein contact prediction is challenging since it requires too many interlogs (i.e., interacting homologs) to be effective, which cannot be easily fulfilled especially for a putative interacting protein pair in eukaryotes. We show that deep learning, even trained by only intra-protein contact maps, works much better than DCA for inter-protein contact prediction. We also show that a phylogeny-based method can generate a better multiple sequence alignment for eukaryotes than existing genome-based methods and thus, lead to better inter-protein contact prediction. Our method shall be useful for protein docking, protein interaction prediction and protein interaction network construction.
UR - http://hdl.handle.net/10754/668806
UR - http://biorxiv.org/lookup/doi/10.1101/240754
UR - http://www.scopus.com/inward/record.url?scp=85046153644&partnerID=8YFLogxK
U2 - 10.1101/240754
DO - 10.1101/240754
M3 - Conference contribution
SN - 9783319899282
SP - 295
EP - 296
BT - 22nd International Conference on Research in Computational Molecular Biology, RECOMB 2018
PB - Cold Spring Harbor Laboratory
ER -