文摘
Physicochemical and functional properties of proteins were modeled as a function of the contributionsof each of the 20 coded amino acids to three (z-scores) or five (extended z-scores) amino acid principalproperties using partial least squares regression. The five term models were in all cases stronger inboth fit and prediction than the three term models, indicating that useful information is contained inthe fourth and fifth property scores. Models predicting protein hydrophobicity (R = 0.932), viscosity(R = 0.737), and foam capacity (R = 0.880) from amino acid composition rather than sequencewere obtained. It is likely that additional functional and physicochemical properties of proteins canbe modeled in this way.Keywords: QSAR; functional properties; principal components analysis; partial least squares regression