文摘
It is shown that fuzzy search and data mining techniques of supersecondary structure homology for subunits of proteins using conformational code patterns of 伪-helix-type (3尾5伪4尾) and 尾-sheet-type (6伪4尾4尾) fragments can be used to extract correlations between fragments of MHC class I molecules and the light chain of immunoglobulins. The new method of conformational pattern analysis with fuzzy search of structural code homology reflects well the shape of main chain rather than secondary structure in comparison with the DSSP method. Further, the data mining technique using the combination of h- and s-fragment patterns can quantify the supersecondary structure homology between any subunits of proteins with different amino acid sequences. Characteristic fragment patterns (string 鈥渟hhshss鈥?, which were sandwiched between two identical amino acid sequences His and Pro, were found in light chains of various types of immunogloblins, 伪-chain and 尾-2 microglobulin of MHC class I and 伪-chain and 尾-chain of MHC class II, but not in heavy chains of Fab immunoglobulin fragments and T cell receptors (TCR). Leukocyte immunoglobulin-like receptors (LILR) are related by the conformational fragment (string 鈥渟hhshss鈥? to 尾-2 microglobulins as a type of pair forms (string 鈥渟ohsss鈥?. Further, human IgM rheumatoid factor, one of the immunogloblins, did not strongly exhibit the conformational fragment pattern. Nonclassic MHC class I molecules CD1D, MIC-A, and MIC-B, which have functions to activate NKT, NK, and T cells, did not also clearly show the patterns. These code-driven mining techniques can be utilized as a metadata-generating tool for systems biology to elucidate the biological function of such conformational fragments of MHC I and II molecules, which come in contact with various signal ligands on the surface of T cells and natural killer cells.