Table 7. Total number of possible folding conformations for ten proteins respectively. The protein names with other information were obtained from UniProt database. Total number: the number of folding conformations was obtained according PFVM. The PFVM for these ten proteins are listed in the supplemental file.
Prediction of Most Possible Conformation and 3D Structure
The most possible conformation and 3D structure for protein can be predicted from its PFVM. With a protein sequence, the local folding variations are collected in PFVM. For examples, the PFVM of SUMO1_HUMAN is at Table 1; the PFVM of P53_HUMAN at Table 3 and the PFVM of K4GSD6_9SAUR, C4IXC1_9TELE, A0A851ZE52_9AVES and EP3B_HUMAN in supplementary document. The alphabetic PFSC string on top of PFVM, which is named as PFVM-01, represents the most possible folding conformation. Their PFVM-01 are listed on Table 8, which are the most possible folding conformations for proteins. In PFVM-01, each of PFSC letter represents the folding shape of 5 amino acids in sequence, two PFSC letters next each other share 4 amino acids, and then each PFVM-01 is a PFSC string for folding conformation from N-terminus to C-terminus. As the PFSC letters in PFVM-01 are on top folding shapes in PFVM, the PFVM-01 represents the most possible conformation for a protein.