Table 7. Total number of possible folding conformations for ten proteins
respectively. The protein names with other information were obtained
from UniProt database. Total number: the number of folding conformations
was obtained according PFVM. The PFVM for these ten proteins are listed
in the supplemental file.
Prediction of Most Possible Conformation and 3D Structure
The most possible conformation and 3D structure for protein can be
predicted from its PFVM. With a protein sequence, the local folding
variations are collected in PFVM. For examples, the PFVM of SUMO1_HUMAN
is at Table 1; the PFVM of P53_HUMAN at Table 3 and the PFVM of
K4GSD6_9SAUR, C4IXC1_9TELE, A0A851ZE52_9AVES and EP3B_HUMAN in
supplementary document. The alphabetic PFSC string on top of PFVM, which
is named as PFVM-01, represents the most possible folding conformation.
Their PFVM-01 are listed on Table 8, which are the most possible folding
conformations for proteins. In PFVM-01, each of PFSC letter represents
the folding shape of 5 amino acids in sequence, two PFSC letters next
each other share 4 amino acids, and then each PFVM-01 is a PFSC string
for folding conformation from N-terminus to C-terminus. As the PFSC
letters in PFVM-01 are on top folding shapes in PFVM, the PFVM-01
represents the most possible conformation for a protein.