Genomics-aided structure prediction
Sulkowska, Joanna I.; Morcos, Faruck; Weigt, Martin; Hwa, Terence; Onuchic, José N.
We introduce a theoretical framework that exploits the everincreasing genomic sequence information for protein structure prediction. Structure-based models are modified to incorporate constraints by a large number of non-local contacts estimated from direct coupling analysis (DCA) of co-evolving genomic sequences. A simple hybrid method, called DCA-fold, integrating DCA contacts with an accurate knowledge of local information (e.g., the local secondary structure) is sufficient to fold proteins in the range of 1–3 Å resolution.
protein folding; residue contact prediction; contact map estimation; residue-residue coevolution; statistical potentials