Title |
Identification of residue pairing in interacting β-strands from a predicted residue contact map
|
---|---|
Published in |
BMC Bioinformatics, April 2018
|
DOI | 10.1186/s12859-018-2150-1 |
Pubmed ID | |
Authors |
Wenzhi Mao, Tong Wang, Wenxuan Zhang, Haipeng Gong |
Abstract |
Despite the rapid progress of protein residue contact prediction, predicted residue contact maps frequently contain many errors. However, information of residue pairing in β strands could be extracted from a noisy contact map, due to the presence of characteristic contact patterns in β-β interactions. This information may benefit the tertiary structure prediction of mainly β proteins. In this work, we propose a novel ridge-detection-based β-β contact predictor to identify residue pairing in β strands from any predicted residue contact map. Our algorithm RDb2C adopts ridge detection, a well-developed technique in computer image processing, to capture consecutive residue contacts, and then utilizes a novel multi-stage random forest framework to integrate the ridge information and additional features for prediction. Starting from the predicted contact map of CCMpred, RDb2C remarkably outperforms all state-of-the-art methods on two conventional test sets of β proteins (BetaSheet916 and BetaSheet1452), and achieves F1-scores of ~ 62% and ~ 76% at the residue level and strand level, respectively. Taking the prediction of the more advanced RaptorX-Contact as input, RDb2C achieves impressively higher performance, with F1-scores reaching ~ 76% and ~ 86% at the residue level and strand level, respectively. In a test of structural modeling using the top 1 L predicted contacts as constraints, for 61 mainly β proteins, the average TM-score achieves 0.442 when using the raw RaptorX-Contact prediction, but increases to 0.506 when using the improved prediction by RDb2C. Our method can significantly improve the prediction of β-β contacts from any predicted residue contact maps. Prediction results of our algorithm could be directly applied to effectively facilitate the practical structure prediction of mainly β proteins. All source data and codes are available at http://166.111.152.91/Downloads.html or the GitHub address of https://github.com/wzmao/RDb2C . |
Twitter Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 1 | 50% |
Unknown | 1 | 50% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 1 | 50% |
Scientists | 1 | 50% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Unknown | 6 | 100% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 2 | 33% |
Researcher | 2 | 33% |
Professor > Associate Professor | 1 | 17% |
Student > Postgraduate | 1 | 17% |
Readers by discipline | Count | As % |
---|---|---|
Computer Science | 1 | 17% |
Agricultural and Biological Sciences | 1 | 17% |
Chemistry | 1 | 17% |
Unknown | 3 | 50% |