动态图像序列建模与分类及其在人体运动分析中的应用
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
动态图像序列是由一系列具有相对次序的帧图像组成。除了具有与图像一样的空间特性外,动态图像序列还具有时序特性,即运动信息。动态图像序列建模对序列中的运动模式进行分析和识别,并用自然语言加以描述,是计算机视觉领域的重要研究方向之一,在智能监控、人机交互、动作分析等领域有非常重要的应用。
     有效的检测和描述图像序列中的运动信息是动态图像序列建模的核心问题。时间序列分析方法是一种成熟的解决时间序列问题的统计工具,但用于分析图像序列则会出现很多问题。本文以图模型理论,特别是分析时间序列的图模型——动态贝叶斯网络为基础,对隐马尔可夫模型和状态空间模型这两种最常用的处理时间序列的方法在处理图像序列时遇到的问题进行了讨论,在此基础上对这两个模型的学习方法做了改进,并提出了更适合描述动态图像序列的模型,通过人体运动分析对我们的工作做了检验和评估。
     为克服单个特征的片面性,对图像序列进行更全面、准确和可靠的描述,引入了因子隐马尔可夫模型作为特征级融合方法和并联隐马尔可夫模型作为决策级融合方法,并根据算法的评估结果及特征间的相关性,深入探讨了影响融合性能的因素。当选择特征间识别性能差异小和单个特征识别性能好的特征组合时,两种融合方法会有效的提高识别性能。在此基础上,特征间的相关性越小越好。
     针对动态纹理模型不适合描述二值图像序列的问题,提出了两种改进动态纹理模型。第一种是二值动态纹理模型,该模型利用二值图像服从贝努利分布的特点,采用二值主成分分析的方法学习模型参数;第二种是张量子空间动态纹理模型,该方法采用张量子空间分析的方法将二值图像序列转换为低维灰度图像序列,然后使用动态纹理模型进行描述。实验结果表明,这两种改进的动态纹理模型能更准确的描述二值图像序列。
     隐马尔可夫模型及其扩展形式在描述动态图像序列时的最大局限在于难以准确的选择隐状态,而改进的动态纹理模型作为线性模型,无法对图像序列进行全面的描述。为解决上述不足,本文提出了分层时间序列模型。该模型是一个两层统计模型,第一层采用分层线性来逼近非线性,将图像序列分割为几段,每段用动态纹理模型或改进动态纹理模型来描述;第二层将这些模型看作隐状态,并在此基础上建立隐马尔可夫模型来刻画它们之间的关系,其中隐马尔可夫模型的观测概率是观测与各模型的对应合成观测之间距离的函数。两层模型参数的组合即为分层时间序列模型的参数。实验结果表明,该模型克服了隐马尔可夫模型和动态纹理模型的缺陷,保留了动态纹理模型的合成及预测能力和隐马尔可夫模型对过程的描述能力,是一种很好的描述和识别动态图像序列的方法。
     针对步态数据库中由于去除背景等原因带来的人肢体信息缺失问题,本文还提出了帧差主体图的表征方法。该表征方法通过主体图表现每类的共性,用前一帧与当前帧的帧差图中值为正的部分来表示运动中带来的变化。通过结合动态和静态信息,该方法既能有效地弥补图像信息缺失的不足,又能体现人行走过程中的形态变化,提高识别率。
     此外,本文引入了McNemar检验的统计方法对不同算法进行评估,该方法对不同算法的多次实验的结果进行统计,计算得到不同算法间差异的一阶统计显著性,从而实现对算法性能的定量比较。与其它方法相比,这种方法对算法的评估更准确、更可靠。
Dynamic image sequence is composed of a series of image frames with comparatively given order. Besides the spatial characteristic as a single image, dynamic image sequence also possesses of temporal characteristic, which is motion information. Dynamic image sequence modeling analyses and identifies the movement patterns of the sequence and describes them with natural language. It is one of the most promising research topics of computer vision and has important application on smart surveillance, man-machine interactive, motion analysis and so on.
     Effective detecting and describing the dynamic information in the image sequence is the core issue of dynamic image sequence modeling. Time series analysis methods are perfect statistical tools for analyzing the time series. However, many problems appear when applying them to modeling dynamic image sequence. Based on the theory of graphical model, especially the special graphical model for time sequence - Dynamic Bayesian Networks, we discuss the problems arising during the application of the two most popular time series models (hidden Markov model and state-space model) to the dynamic image sequence representation and improve the learning method of the models. We also bring forward a new time series model which is more suitable for the image sequences and evaluate our work in human movement analysis.
     In order to complement the deficiency of single feature and describe the image sequence more comprehensive, exact and credible, we introduce the factorial hidden Markov model as a feature-level fusion method and the parallel hidden Markov model as a decision-level fusion method. According to the experimental results and the correlation between features, we analyse the factors impacting the fusion performance in depth. When choosing features with small performance diversity and good performance, the two fusion algorithms improve the recognition performance efficiently. Basing on this foundation, the lower is the features’relativity, the better.
     Because dynamic texture model is not applicable to represent binary image sequence, we propose two improved dynamic texture models. The first is the binary dynamic texture model, which considers that binary image submits to Bernoulli distribution and adopts binary principal component analysis to learn the model parameters. The second, tensor subspace dynamic texture model, employs tensor subspace analysis to transform binary image sequence to low dimensional gray image sequence and then use dynamic texture to describe it. The experimental results show that the two improved models can describe the image sequence more exactly.
     Hidden Markov model and its extension have great limitation on choosing the hidden states exactly. Improved dynamic texture models are linear and have difficulties in describing image sequence comprehensively. We present a layered time series model to resolve the aforementioned deficiencies. The proposed model is a two-level statistical model. In the first level, we employ segmental linearity to approximate nonlinearity. The image sequence is divided into several clusters and each cluster is described by the dynamic texture model or improved dynamic texture model. In the second level, these models are considered as the hidden states and the hidden Markov model is built to characterize the relationship among them. The observation probability of the hidden Markov model is a function of the distance between the observation and the corresponding synthesized observation of these models. The combined parameters of the two levels are the parameters of the hierarchical time series model. The experiments results show that this model overcomes the limitation of hidden Markov model and dynamic texture model and remains the synthesis and prediction ability of the dynamic texture model and the description capability of hidden Markov model for dynamic process. It is an excellent model for describing and identifying the image sequences.
     We propose a new representation method named frame difference energy image to depress the influence of silhouette incompleteness caused by background subtraction in the gait database. This representation preserves the static information of each cluster and the positive portion of the frame difference between the former frame and the current frame. Combining static and dynamic information, not only does this method efficiently make up the deficiency of missing image information, but also embodies the change of human shape during walking and improves the recognition rate.
     In addition, the McNemar’s test is introduced to evaluate different algorithms. This method needs many experimental results and a first order check on the statistical significance of an observed difference between algorithms is calculated to quantitatively compare their performance. Compared with other methods, this method is more accurate and credible for algorithm evaluation.
引文
[1] Ji Q.. 3D Face pose estimation and tracking from a monocular camera. Image and vision computing. 2002, 20(7): 499-511.
    [2] Hild M.. Estimation of 3D motion trajectory and velocity from monocular image sequences in the context of human gait recognition. in Proceedings of the 17th International Conference on Pattern Recognition. 2004.
    [3] Agarwal A., Triggs B.. Recovering 3D human pose from monocular images. IEEE Transactions on Pattern Analysis and Machine Intelligence.2006, 28(1): 44-58.
    [4] Howe N.R., Silhouette lookup for monocular 3d pose tracking. Image and Vision Computing. 2007, 25(3): 331-341.
    [5] Aggarwa J.K. and Park S.. Human motion: modeling and recognition of actions and interactions. in Proceedings of 2nd International Symposium on 3D data Processing, Visualization and Transmission. 2004.
    [6] Boulgouris N.V. and Chi Z.X.. Gait Recognition Based on Human Body Components. in Proceedings of IEEE International Conference on Image Processing, 2007.
    [7] Whittaker J.. Graphical models in applied multivariate statistics. Wiley New York. 1990.
    [8] Murphy K.P.. Dynamic Bayesian Networks: Representation, Inference and Learning. Doctor's paper of University of California. 2002.
    [9] Retal C.. A system for video surveillance and monitoring: VSAM final report. Technical Report: CMU-RI-TR-00-12. 2000.
    [10] Michael I.J.. Learning in graphical models. MIT Press.. 1999.
    [11] Malfait M. and Roose D.. Wavelet-based image denoising using a Markov random field a priorimodel. IEEE Transactions on Image Processing. 1997, 6(4): 549-565.
    [12] Chellappa R. and Chatterjee S.. Classification of textures using Gaussian Markov random fields. IEEE Transactions on Acoustics, Speech, and Signal Processing. 1985, 33(4): 959-963.
    [13] Murphy K., An introduction to graphical models. Rap. tech, 2001.
    [14] Norvig P. and Russell S.J.. Artificial Intelligence: A Modern Approach. Hall, Englewoods. 1995.
    [15] Pearl J.. Causality: Models, Reasoning, and Inference. Cambridge University Press. 2000.
    [16] Dempster A.P., Laird N.M., and Rubin D.B.. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. 1977. 39(1): 1-38.
    [17] Friedman N., Murphy K., and Russell S.. Learning the structure of dynamic probabilistic networks. in Proceedings of 14th Conf. on Uncertainty in Artificial Intelligence. 1998.
    [18] Bobick A.F. and Davis J.W.. The Recognition of Human Movement Using Temporal Templates. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2001. 23: 257-267.
    [19] Carlsson S. and Sullivan J.. Action recognition by shape matching to key frames. in Workshop on Models versus Exemplars in Computer Vision. 2001.
    [20] Masoud O. and Papanikolopoulos N.. A Method For Human Action Recognition. Image and Vision Computing. 2003, 21(8): 729-743.
    [21] Wang L. and Suter D.. Informative Shape Representations for Human Action Recognition. in Proceedings of 18th International Conference on Pattern Recognition. 2006.
    [22] Blank M., Gorelick L., Shechtman, E.,et al., Actions as Space-Time Shapes. Transactions on Pattern Analysis and Machine Intelligence. 2007. 29(12): 2247-2253.
    [23] Fathi A. and Mori G.. Action Recognition by Learning Mid-level Motion Features. in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2008.
    [24] Bregler C.. Learning and recognizing human dynamics in video sequences. in Proceedins of IEEE Conference on Computer Vision and Pattern Recognition. 1997.
    [25] Madabhushi A. and Aggarwal J.K.. A Bayesian approach to human activity recognition. in Second IEEE Workshop on Visual Surveillance. 1999.
    [26] Fanti C., Zelnik-Manor L., and Perona P.. Hybrid models for human motion recognition.in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2005.
    [27] Wang Y., Sabzmeydani P., and Mori G.. Semi-Latent Dirichlet Allocation: A Hierarchical Model for Human Action Recognition. Lecture Notes in Computer Science. 2007. 4814: 240.
    [28] Wang Y., Mori G.. Learning a Discriminative Hidden Part Model for Human Action Recognition. in Proceedings of Advances in Neural Information Processing Systems. 2008.
    [29] Niyogi S.A. and Adelson E.H.. Analyzing and recognizing walking figures in XYT. in IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1994.
    [30] Murase, H. and Sakai R.. Moving object recognition in eigenspace representation: gait analysis and lip reading. Pattern Recognition Letters. 1996, 17(2): 155-162.
    [31] Cunado D., Nixon M.S., and Carter J.N.. Using gait as a biometric, via phase-weighted magnitude spectra. in Proceedings of Audio-and video-based biometric person authentication: first International Conference. 1997.
    [32] Little J. and Boyd J.. Recognizing people by their gait: the shape of motion. Videre: Journal of Computer Vision Research. 1998, 1(2): 1-32.
    [33] Huang P.S., Harris C.J., and Nixon M.S.. Visual surveillance and tracking of humans by face and gait recognition. in Proceedings of IFAC Symp. Artificial Intelligence in Real-Time Control. 1999.
    [34] Johnson, A.Y. and A.F. Bobick, A Multi-view Method for Gait Recognition Using Static Body Parameters. Lecture Notes in Computer Science. 2001: 301-311.
    [35] BenAbdelkader C., Cutler R. , and Davis L.. Stride and cadence as a biometric in automatic person identification and verification. in Proceedings of IEEE Conference on Face and Gesture Recognition. 2002.
    [36] Lee L. and Grimson W.E.L.. Gait analysis for recognition and classification. in Proceedings of IEEE Conference on Face and Gesture Recognition. 2002.
    [37] Yam C.Y., Nixon M.S., and Carter J.N.. On the relationship of human walking and running: automatic person identification by gait. in Proceeding of International Conference on Pattern Recognition. 2002.
    [38] Cunado D., Nixon M.S., and Carter J.N.. Automatic extraction and description of human gait models for recognition purposes. Computer Vision and Image Understanding. 2003, 90(1): 1-41.
    [39] Zhang R., Vogler C., and Metaxas D.. Human Gait Recognition. in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2004.
    [40] Huang P.S.. Automatic gait recognition via statistical approaches for extended template features. IEEE Transactions on Systems, Man, and Cybernetics, Part B. 2001, 31(5): 818-824.
    [41] Liu Y., Collins R.T., and Tsin Y.. Gait Sequence Analysis Using Frieze Patterns. Lecture Notes in Computer Science. 2002: 657-671.
    [42] Phillips P.J., Sarkar, S. Robledo, I., et al.. Baseline results for the challenge problem of human id using gait analysis. in Proceedings of IEEE Conference on Face and Gesture Recognition. 2002.
    [43] Wang L., Tan T., Hu W., et al., Automatic gait recognition based on statistical shape analysis. IEEE Transactions on Image Processing. 2003, 12(9): 1120-1131.
    [44] Mowbray S.D. and Nixon M.S.. Automatic Gait Recognition via Fourier descriptors of deformable objects. Lecture Notes in Computer Science. 2003: 566-573.
    [45] Kale A., Sundareasan A., Rajagopalan A. N., et al., Identification of humans using gait. IEEE Transactions on Image Processing. 2004, 13(9): 1163-1173.
    [46] Baum L.E. and Eagon J.A.. An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology. Bull. Amer. Math. Soc. 1967, 73: 360-363.
    [47] Bahl L. and Jelinek F.. Decoding for channels with insertions, deletions, and substitutions with applications to speech recognition. IEEE Transactions on Information Theory. 1975, 21(4): 404-411.
    [48] Jelinek F.. Continuous speech recognition by statistical methods. Proceedings of the IEEE USSR Joint Workshop on Information Theory. 1976, 64(4): 532-556.
    [49] Chen C., Liang J., Zhao H., et al. Gait recognition using Hidden Markov model. Lecture Notes in Computer Science. 2006.
    [50] Schwarz G.. Estimating the dimension of a model. Annals of Statistics. 1978, 6(2): 461-464.
    [51] Rissanen J.. Universal coding, information, prediction, and estimation. IEEE Transactions on Information Theory. 1984, 30(4): 629-636.
    [52] Zhang J., Huang Z. and Wang X.. Selection and analysis of HMM's state-number in speech recognition. in Proceedings of Fourth International Conference on Signal Processing. 1998: 641-645
    [53] Zimmermann M. and Bunke H.. Hidden Markov model length optimization for handwriting recognition systems. 2002.
    [54] Bakis, R., Continuous speech recognition via centisecond acoustic states. The Journal of the Acoustical Society of America. 1976, 59: S97.
    [55] Gunter S. and Bunke H.. Optimizing the number of states, training iterations and Gaussians in an HMM-based handwritten word recognizer. in Proceedings of 7th International Conference on Document Analysis and Recogntion. 2003.
    [56] Knoblauch D.. Data Driven Number-of-States Selection in HMM Topologies. in Proceedings of Eighth International Conference on Spoken Language Processing. 2004: 665-668.
    [57] Celeux G. and Durand J.-B.. Selecting hidden Markov model state number with cross-validated likelihood. Computational Statistics. 2008. 23(4): 541-564.
    [58] Flexer A., Pampalk E., and Widmer G.. Hidden Markov models for spectral similarity of songs. in Proceedings of the 8th International Conference on Digital Audio Effects. 2005.
    [59] Hartigan J.A. and Wong M.A.. A K-means clustering algorithm. JR Stat. Soc. Ser. C-Appl. Stat. 1979, 28: 100-108.
    [60] Gross R. and Shi J.. The cmu motion of body (mobo) database. in Technical Report CMU-RI-TR-01- 18. 2001, Robotics Institute, Carnegie Mellon University.
    [61] Kale] A., Cuntoor N., and Chellappa R.. A framework for activity-specific human recognition. in Proceedings of the International Conference on Acoustics, Speech and Signal Processing. 2002.
    [62] Zhang R., Vogler C., and Metaxas D.. Human gait recognition. in Proceedings of Confererence on Computer Vision and Pattern Recognition Workshop. 2004.
    [63] Grimson W.E.L.. Gait Analysis for Recognition and Classification. in Proceedings of IEEE Conference on Face and Gesture Recognition. 2002: 155-161.
    [64] Wagg D.K. and Nixon M.S.. On automated model-based extraction and analysis of gait. in Proceedings of IEEE Conference on Automatic Face and Gesture Recognition. 2004.
    [65] Han J. and Bhanu B.. Statistical feature fusion for gait-based human recognition. in IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2004.
    [66] Tao D., Li X., Wu X., et al., General Tensor Discriminant Analysis and Gabor Features for Gait Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2007. 29(10): 1700-1715.
    [67] Yang X., Zhou Y., Zhang T., et al., Gait recognition based on dynamic region analysis. Signal Processing. 2008, 88(9): 2350-2356.
    [68] Lam T.H.W. and Lee R.S.T.. A new representation for human gait recognition: Motion Silhouettes Image (MSI). Lecture Notes in Computer Science. 2006, 3832:612.
    [69] Lam T.H.W., Lee R.S.T., and Zhang D.. Human gait recognition by the fusion of motion and static spatio-temporal templates. Pattern Recognition. 2007, 40(9): 2563-2573.
    [70] Liu J. and Zheng N.. Gait History Image: A Novel Temporal Template for Gait Recognition. in Proceedings of IEEE International Conference on Multimedia and Expo. 2007.
    [71] Ma Q., Wang S., Nie D., et al. Recognizing Humans Based on Gait Moment Image. in Proceedings of International Conference on Software Engineering. 2007.
    [72] Lee S., Liu Y., and Collins R.. Shape variation-based frieze pattern for robust gait recognition. in IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2007.
    [73] Ioannidis D., Tzovaras D., Damousis I.G., et al., Gait recognition using compact feature extraction transforms and depth information. IEEE Transactions on Information Forensics and Security. 2007, 2(3): 623-630.
    [74] Liu Z., Malave L., and Sarkar S.. Studies on silhouette quality and gait recognition. in Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition. 2004: 704-711.
    [75] Liu Z. and Sarkar S.. Effect of silhouette quality on hard problems in gait recognition. IEEE Trans. Systems Man and Cybernetics. part B: Cybernetics. 2005, 35(2): 170-183.
    [76] Yu S., Tan D., Huang K., et al. Reducing the Effect of Noise on Human Contour in Gait Recognition. in Proceeding of International Conference of Biometrics. 2007.
    [77] Casia Gait Database, in http://www.cbsr.ia.ac.cn/english/Gait%20Databases.asp.
    [78] Wang L., Ning H., Tan T., et al., Fusion of static and dynamic body biometrics for gait recognition. IEEE Transactions on Circuits and Systems for Video Technology. 2004, 14(2):149-158.
    [79] Bazin A.I., Middleton L., and Nixon M.S.. Probabilistic Fusion of Gait Features for Biometric Verification. in Proceedings of International Conference on Information Fusion. 2005: 1211-1217.
    [80] Veres G.V., Nixon M.S., Middleton L., et al. Fusion of dynamic and static features for gait recognition over time. in Proceedings of International Conference on Information Fusion. 2005.
    [81] Veres G.V., Gordon L., Carter J N, et al. What image information is important in silhouette-based gait recognition? in Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004.
    [82] Chai Y., Wang Q., Jia J., et al. A novel gait recognition method via fusing shape and kinematics features. Lecture Notes in Computer Science. 2006, 4291.
    [83] Ekinci M.. Human identification using gait. Turkish Journal of Electrical Engineering and Computer Sciences. 2006, 14(2): 267-291.
    [84] Lu J. and Zhang E.. Gait recognition for human identification based on ICA and fuzzy SVM through multiple views fusion. Pattern Recognition Letters. 2007, 28(16): 2401-2411.
    [85] Tyagi A., Davis J.W., and Keck M.. Multiview fusion for canonical view generation based on homography constraints. in Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks. 2006.
    [86] Wang Y., Yu S., and Tan T.. Gait Recognition Based on Fusion of Multi-view Gait Sequences. Lecture Notes on Computer Science. 2005, 3832.
    [87] Huang G. and Wang Y.. Gender classification based on fusion of multi-view gait sequences. Lecture Notes on Computer Science. 2007, 4843.
    [88] Kale A., Roychowdhury A.K., and Chellappa R.. Fusion of gait and face for human identification. in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. 2004.
    [89] Lee T.K.M., Ranganath S., and Sanei S.. Fusion of chaotic measure into a new hybrid face-gait system for human recognition. in Proceedings of International Conference on Pattern Recognition. 2006.
    [90] Liu Z. and Sarkar S.. Outdoor recognition at a distance by fusing gait and face. Image and Vision Computing. 2007, 25(6): 817-832.
    [91] Geng X., Wang L., Li M., et al. Adaptive fusion of gait and face for human identification in video. in Proceedings of IEEE Workshop on Applications of Computer Vision. 2008.
    [92] Zhou X., Bhanu B., and Han. J. Human recognition at a distance in video by integrating face profile and gait. Lecture Notes on Computer Science. 2005, 3546.
    [93] Zhou X. and Bhanu B.. Feature fusion of face and gait for human recognition at a distance in video. in Proceedings of International Conference on Pattern Recognition. 2006.
    [94] Zhou X. and Bhanu B.. Integrating face and gait for human recognition. in Proceedings of Conference on Computer Vision and Pattern Recognition Workshop. 2006.
    [95] Zhou X. and Bhanu B.. Feature fusion of side face and gait for video-based human identification. Pattern Recognition. 2008, 41(3): p. 778-795.
    [96] Zhang T., Li X., Tao D., et al., Multimodal biometrics using geometry preserving projections. Pattern Recognition. 2008, 41(3): p. 805-813.
    [97] Ghahramani Z. and Jordan M.I.. Factorial hidden Markov models. Machine learning. 1997.29(2): 245-273.
    [98] Teague M.R.. Image analysis via the general theory of moments. Journal of the Optical Society of America. 1979, 70(8): 920-930.
    [99] Logan B. and Moreno P.J.. Factorial hidden Markov models for speech recognition: preliminary experiments. Cambridge Research Laboratories Technical Report. 1997, 7.
    [100] Gonzalez I., Dejean S., Martin P.G.P., et al., CCA: an R package to extend canonical correlation analysis. Journal of Statistical software. 2007, 23(12): p. 1-14.
    [101] Leurgans S.E., Moyeed R.A., and Silverman B.W.. Canonical correlation analysis when the data are curves. Journal of the Royal Statistical Society B. 1993, 55(3): 725-740.
    [102] Anderson T.W. and Krishnaiah P.R.. Multivariate Analysis. Academic Press. 1976, 25.
    [103] Clark A.F. and Clark C.. Performance characterization in computer vision a tutorial. 2004 Available from: http://peipa.esser.ac.uk/benchmark/tutorial/esser/tutorial.pdf.
    [104] Chan A. B. and Vasconcelos N.. Dynamic textures. in Proceeding of IEEE Conference of Computer Vision and Pattern Recognition. 2003.
    [105] Chan A.B. and Vasconcelos N.. Classifying video with kernel dynamic textures. in Proceeding of IEEE Conference on Computer Vision and Pattern Recognition. 2007.
    [106] Schein A.I., Saul L.K., and Ungar L.H.. A generalized linear model for principal component analysis of binary data. in Proceedings of the 9 th International Workshop on Artificial Intelligence and Statistics. 2003.
    [107] He X., Cai D., and Niyogi P.. Tensor subspace analysis. in Proceeding of Ninth Annual Conference on Neural Information Processing Systems. 2005.
    [108] Martin R.J.. A metric for ARMA processes. IEEE Transactions on Signal Processing. 2000, 48(4): 1164-1170.
    [109] De Cock K. and De Moor B.. Subspace angles between linear stochastic models. in Proceeding of IEEE Conference on Decision and Control. 2000.
    [110] Rabiner L.R. A tutorial on hidden Markov models and selected applications in speech recognition. Proceeding of the IEEE. 1989, 77(2): 257-286.
    [111] Juang B.H. and Rabiner L.R.. Hidden Markov models for speech recognition. Technometrics. 1991: 251-272.
    [112] Cutting, D., et al. A practical part-of-speech tagger. in Proceedings of the Third Conference on Applied Natural Language Processing. 1992.
    [113] Hutchins W.J. and Somers H.L.. An introduction to machine translation. Academic Press New York. 1992.
    [114] Hu J., Brown M.K., and Turin W.. HMM based on-line handwriting recognition. IEEE Transactions on pattern analysis and machine learning. 1996,18(10): 1039-1045.
    [115] Samaria F.S.. Face recognition using hidden Markov models. University of Cambridge. 1994.
    [116] Pavlovic V., Rehg J. M., Cham T. J., et al. A dynamic Bayesian network approach to figure tracking usinglearned dynamic models. in Proceeding of International Conference on Computer Vision. 1999.
    [117] North B., Blake A., Isard M., et al., Learning and classification of complex dynamics. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2000, 22(9): 1016-1034.
    [118] Pavlovic V. and Rehg J.M.. Impact of dynamic model learning on classification of human motion. in Proceeding of Computer Vision and Pattern Recognition. 2000.
    [119] Zoubin G. and Geoffrey E.H.. Switching state-space model. Toronto M5S 3H5, 1996.
    [120] Antti H.. Nonlinear Switching State-Space Models. Hel Sinki University Of Technology. 2001.
    [121] Li Y., Wang T., and Shum H.Y.. Motion texture: a two-level statistical model for character motion synthesis. ACM Transactions on Graphics. 2002: 465-472.
    [122] Sangun K. andSangchul W.. Model predictive tracking control for Wiener model using dynamic output feedback. in Proceedings of SICE 2004 Annual Conference. 2004.
    [123] DiCarlo J.J. andCox D.D.. Untangling invariant object recognition. Trends in Cognitive Sciences. 2007, 11(8): 333-341.
    [124] Jhuang H.. A biologically inspired system for action recognition. in Proceedings of IEEE 11th International Conference on Computer Vision. 2007: 1-8.
    [125] Meyers E. and Wolf L.. Using biologically inspired features for face processing. International Journal of Computer Vision. 2008, 76(1): 93-104.
    [126] Felzenszwalb P.F. and Huttenlocher D.P.. Efficient matching of pictorial structures. in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2000.
    [127] Lawrence N.D. and Moore A.J.. Hierarchical Gaussian process latent variable models. in Proceedings of International Conference in Machine Learning. 2007: 481-488.
    [128] Nilsson J., Sha F., and Jordan M.I.. Regression on manifolds using kernel dimension reduction. in Proceedings of the 24th international conference on Machine Learning. 2007: 697-704.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700