视频流分层传输与流中文本识别

英文题名：Layer Transmission of Video Stream and Recognition of Text Pattern in Stream
作者：丛键
论文级别：博士
学科专业名称：通信与信息系统
中文关键词：粗骨架 ; 字符识别 ; 信息重建 ; 分层编码 ; 多分辨率分析
英文关键词：rough skeleton ; character recognition ; information ; reconstruction ; layer coding ; multi-resolution analysis
学位年度：2001
导师：李在铭
学科代码：081001
学位授予单位：电子科技大学
论文提交日期：2001-10-01

摘要

随着网络技术与资源的发展，基于网络的各种视频应用越来越普遍，对于网络视频传输技术提出了更高的要求，因此提高基于各种网络类型的实时视频业务传输质量，成为当前研究的热点领域之一。另一方面，随着以视频为媒体的各种信息量迅速增长，如何准确有效的实现基于内容的视频信息索引成为一个迫切需要解决的课题，而利用视频流中的文字信息来描述视频内容的技术是目前一种很有潜力的解决方案。
     本文的研究工作包括两个方面：首先提出了一种视频与图像中数据丢失的重建算法，以此为基础对基于网络的实时视频传输方案进行了研究，提出了信源端分层编码方案、数据流组织方案以及接收端后向处理方案。其次，我们在视频流中文本信息的检测与识别方面开展了研究工作，并提出了相应的理论模型与实现方法，结合到实际应用系统中时，取得了良好的效果。
     在基于网络的实时视频传输方面，我们进行了如下的研究并取得了一定的成果。在接收端，我们以变换编码技术为对象，对于图像子块受损的信息重建问题进行了深入分析，建立了利用图像子块边界信号重建丢失信息的模型。同时我们对变换基信号子块边界分量空间进行了分析，并且提出了基信号子块边界分量空间标准正交基的构造方法，以及以此为基础的一种利用利用图像子块边界信号重建子块变换系数的快速算法。我们还介绍了这种技术在消除变换编码方块效应中的成功应用。以接收端丢失信息重建技术为基础，本文对ATM网与分组交换网的传输特性进行了分析，针对ATM网中VBR业务与分组交换网分别设计了相应的信源端分层编码方案以及视频流组织方案，同时在接收端对于分组与信元的丢失问题，提出了相应的丢失信息重建方案。
     视频流中文本信息的检测与识别包括3方面的工作：视频流中文本区域的检测与定位；文本区域中字符目标的检测与提取；字符目标的识别。本文在这三方面的研究工作上都作出了一定的贡献。首先我们通过对一般视频图像中，文本信息信号特征的分析，建立了视频流中文字区域的检测模型，并提出了一种利用图像的多尺度模糊处理与小波理论中的多分辨率分析，结合区域整体特征与纹理特征的文本区域检测技术；在字符目标的检测方法研究中，我们建立了一般性目标集合中满足一定规律特征目标子集的检测模型，提出了一种利用文本区域中字符目标空间分布规律的检测方法，并且在实现中我们提出了距离生成矩阵的概念以及利用距离生成矩阵的快速实现技术；字符目标的识别包括提取字符识别特征与识别两个步骤，提取字符识别特征时，我们提出了字符的粗骨架概念以及相应的基于非细化处理的字符骨架特征提取技术，在这种技术中，通过对字符按部件的分解、局部骨架提取、整体骨架连接实现了对字符几何形状在一定尺度上按骨架的描述。根据骨架特征，利

     用的图论的理论与方法，我们提出了一种提取包括笔划特征与笔划结构特征的字符
     识别特征的提取技术。在识别处理中，我们利用字符的笔划特征并引入模糊识别理
     论，提出了一种具有良好抗干扰性的快速字符识别技术。
The video application based on networks became more popular according to improvement of the technology and resource of network, and requirement for new technology of video transmission via network is imminent. Improving the quality of transmission in network became an active field of research currently. On the other hand, how to index the content of video information exactly and efficiently is a very attractive region of research with the information based on video increasing quickly, and it is a very potential method that indexing content with utilizing the text information in video streams.
    There are two research works is involved in this thesis. First, an algorithm of data reconstruction in videos and images is proposed. Base on this reconstruction technology, we developed the precept of layer coding in sender, the structure of data stream and the precept of post processing in receiver with analyzing the scheme of real time transmission of video in network. The successful application of based on this technology in block effect reduction of transform coding is also introduced in this paper. Secondly, we researched the detection and recognition technology of text information in video stream, and corresponding realization method is proposed. The technology showed great effect while utilized in real application.
    With the research on real time transmission of video, we take some progress as follows. At the receiver, we analyzed the problem with reconstruction of information lost in image sub-block based on transform coding of image, and a technology of lost information reconstruction with corresponding fast algorithm of sub-block information reconstruction based on the boundary information of image sub-block is proposed . At the same time, we applied this method into reduction of block effects in transform coding successfully. Based on this technology of reconstruction, we analyzed the transmit characteristic of ATM and packet switching network, then project of layer coding in sender and format of video stream is proposed respectively. At the receiver, the method of lost information reconstruction was proposed for the problem of packet or cell lost in transmission.
    The research work on detection and recognition of text in video stream includes 3 technologies: 1, the detection and location of the text region in



    video stream; 2, the segment and detection of the target character in the text region; 3, recognition of the text information. In this paper, we do some contribution on both 3 regions. With the study of the signal features of text information in general video and image, we developed a recognition model. With the detection of the text region, we produced a detection model and its achieving method with the global and local texture characteristic of text region based on the technology of multi-resolution analysis in theory of wavelet with blur image in multi-scale. With the character object detection, we developed a detection model of general object based on some regularity and a detection technique ot character object in the image is proposed with the regularity of spatial distribution. A fast algorithm is introduced for detection of character object which utilizing the distance generate matrix. In character recognition, we proposed a concept of 'rough skeleton', and developed a technique of skeleton extractin
    g based on non-thinning process of characters via three steps: character discomposed based component, local skeleton extraction and global skeleton connection. Then utilized these skeleton to form a description with the feature of strokes and stroke structure based on the theory and method of graph theory, and fuzzy recognition technique is introduced to achieve character recognition with excellent robust.

引文

[ABE94] A. Albanese, J. Bloemer, J. Edmonds, Priority encoding transmission, In Proc. Foundations of Computer Sciences, Santa Fc, NM, 1994, 604-612
    [ACR96] R. Aravind, M. R. Civanlar, A. R. Reibman, Packet loss resilience of MPEG-2 scalable video coding algorithms, IEEE Trans. On Circuits and Systems for Video Technology, 6(5) , 10, 1996, 426-435．
    [AF95] S.Aign,K.Fazel, Temporal & spatial error concealment techniques for hierarchical MPEG-2 video codec, In Proc. Globecom'95, 1995, 1778-1783．
    [AHD96] L. S. I. Abuhaiba, M. J. J. Holt, S. Datta, Processing of binary images of handwritten text documents, Pattern Recognition 29,1996,1161-1177．
    [AP77] F.Ali,T.Pavlidis, Syntactic recognition of handwritten numerals, IEEE Trans. Systems Man Cybernet. 7,1977, 537-541．
    [Baru88] O. Baruch, Line thinning by line following, Pattern Recognition Letters, 28(4) , 1988, 271-276．
    [BCCM93] G Boccignone, A. Chianese, L. P. Cordelia, A. Marcelli, Recovering dynamic information from static handwriting, Pattern Recognition 26, 1993, 409-418．
    [BV97] J. C. Batllo, V. A. Vaishampayan, Asymptotic performance of multiple description transform codes, IEEE Trans. On Information Theory, 43, 3, 1997． 703-707．
    [BVHB92] K.T.. Blackwell, T. P. Vogl, S. D. Hyman, G S. Barbour, D. L. Alkon, A new approach to handwritten character recognition, . Pattern Recognition 25(6) , 1992, 655-666．
    [CCHW97] S. H. Chang, F. H. Cheng, W. H. Hsu, G Z. Wu,

    Fast algorithm for point pattern matching: invariant to translations, rotations and scale changes, Pattern Recognition, 30,1997, 311-320．
    [CGKM93] T. Caesar, J. Gloger, A. Kalrenmeier, E. Mandler, Recognition of handwritten work images by statistical methods, Proc. 3rd Int. Workshop on Frontiers in Handwriting Recognition, New York, 1993, 409-414．
    [CH88] Y. S. Chen, W. H. Hsu, A modified fast parallel algorithm for thinning digital patterns, Pattern Recognition Letters, 1988, 99-106．
    [CH89] Y. S. Chen, W.H. Hsu, An interpretive model of line continuation in human visual perception, Pattern Recognition 22, 1989, 619-639．
    [CH91 ] F. H. Cheng, W. H. Hsu, Research on Chinese OCR in Taiwan, Int. J. Pattern Recognition Artificial Intell. 5(1) , 1991, 139-164．
    [Che96] F. H. Cheng, Point pattern matching algorithm invariant to geometrical transformation and distortion, Pattern Recognition Letters, 17,1996,1429-1435．
    [Chen95] F.H. Cheng, Planar pattern matching algorithm and its applications to handwritten Chinese character recognition, Commun. COUPS S(1&2) , 1995, 9-18．
    [Chen96] F. H. Cheng, New stroke merging and matching for handwritten Chinese character recognition, in: Proc. 5th National Workshop on Character Recognition and Document Analysis. Chung-li, Taiwan, 1, 1996, pp. 105-112．
    [Chen97] Z. G Chen, Coding and transmission of digital video on the internet, PhD thesis, University of Illinois at Urbana Champaign, 1997．
    [Chen98] F. H. Cheng, Multi-stroke relaxation matching method for handwritten Chinese character recognition, Pattern Recognition, 31(4) , 1998, 401-410．
    [CHK93] F. H. Cheng, W. H. Hsu, M. C. Kuo, Recognition of handwritten Chinese characters via stroke relaxation,

    Pattern Recognition, 26(4) , 1993, 579-593．
    [CL94] L. H. Chen, J. R. Lieh, Handwritten character recognition using a 2-layer random graph model by relaxation matching, Pattern Recognition 23(111) , 1994, 1189-1205．
    [CLC95] R. H. Cheng, C. W. Lee, Z. Chen, Prcclassification of handwritten Chinese characters based on basic stroke substructures, Pattern Recognition Letters, 16, 1995, 1024-1032．
    [CP92] B. Chouinard, R. Plamondon, Thinning and segmenting handwritten characters by line fllowing, Mach. Vision Appl. 5, 1992, 185-197．
    [CSW94] M. Christel, S. Stevens, H. Wactlar, Informedia digital video library, Proc. ACM Multimedia Conf. 10, 1994, 480-481．
    [CW93] H. D. Chang, J. F. Wang, Pre-classification for handwritten character recognition by a peripheral shape coding method, Pattern Recognition 26(5) , 1993, 711-719．
    [CW94] H.D. Chang, J.F.Wang, A robust stroke extraction method for handwritten Chinese characters, Int. J. Pattern Recognition and Machine Intelligence, 8(5) , 1994, 1223-1239．
    [CWK94] H.D. Chang, J.F. Wang, S.C.Kuo, A Bayesian neural network for separating similar complex handwritten Chinese characters, Pattern Recognition Letters, 15, 1994, 403-408．
    [DDS95] J.M.Danskin,GM. Davis, X. Song, Fast lossy internet image transmission, In ACM Multimedia Conf., San Francisco, 11, 1995．
    [Deng95] Y. Deng, Improvement of nonlinear normalization in hand-printed character recognition, IEICE Technical Report, PRU93-126, 1995．
    [DR92] A. S. Doermann, A. Rosenfeld, Recovery of temporal information from static image of handwriting, Proc. IEEE Computer Society Conf. Computer Vision and Pattern Recognition. 1992． pp. 162-168．


    [DR93] A. S. Doermann, A. Rosenfeld, The interpretation and reconstruction of inferring strokes, International Workshop on Frontiers in Handwriting Recognition, 1993, 41-50．
    [DR95] A. S. Doermann, A. Rosenfeld, Recovery of temporal information from static image of handwriting Recognition, Int. J. Comput, vision 15(1-2) (1995) 143-164．
    [FK88] L. A. Fletcher, R. Kasturi, A robust algorithm for text string separation from mixed text/graphics images, IEEE Trans. Pattern Analysis Macn. Intell. 10(6) , 1988, 910-918．
    [FR84] T. J. Ferguson, J. H. Ranowitz, Self-synchronizing Huffman codes, IEEE Trans. On Information Theory, IT-30, 7, 1984, 687-693．
    [GAK98] U.Gargi, S. Antani, R. Kasturi, Indexing text events in digital video databases, Proc. 14th Int. Conf. Pattern Recognition(ICPR), 1998, 916-918．
    [GH92] Z.Guo,R.W.Hall, Fast fully parallel thinning algorithms, CVGIP: Image Understanging 55(3) , 1992,317-328．
    [Ghan89] M. Ghanbari, Two-layer coding of video signals for VBR networks, IEEE J. On Select Areas Common.. 7(6) , 1989, 771-781．
    [Ghan96] M. Ghanbari, Postprocessing of late cells for packet video. IEEE Trans. On Circuits and Systems for Video Technology, 6, 12, 1996, 669-678．
    [Gorm93] L. Gorman, The document spectrum for page layout analysis, IEEE Trans. On Pattern Analysis and Machine Intelligence, 15(11) , 1993, 1162-1173．
    [Gorm94] L. Gorman, Binarizatin and multithresholding of document images using connectivity, Computer Vision, Graphics and Image Processing, 56(6) , 1994, 494-506．
    [GR87] A. D. Gross, A. Rosenfeld,

    Multiresolution object detection and delineation, Computer Vision Graphics Image Process. 39,1987． 102-115．
    [GS93] M.Ghanbari,V.Seferidis, Cell-loss concealment in ATM Video codecs. IEEE Trans. On Circuits and Systems for Video Technology, 3, 6, 1993, 238-247．
    [HAD96] M. J. J. Holt, I. S. I. Abuhaiba, S. Datta, Processing of binary images of handwritten text documents, Pattern Recognition 29(7) , 1996, 1161-1177．
    [HAT96] S. He, N. Abe, C. L. Tan, A clustering-based approach to the separation of text strings from mixed text/graphics documents, 13th Int. Conf. On Pattern Recognition, Austria, 25-29,1996, 706-710．
    [HC93] Y. F. Hsu, Y. C. Chen, A new adaptive separable median filter for removing blocking effects, IEEE Trans. Consumer Electron, 39(8) , 1993, 510-513．
    [HL93] T. H. Hilderband, W. Liu, Optical recognition of handwritten characters: advances since 1980, Pattern Recognition 26(2) , 1993, 217-225．
    [HM90] P. Haskell, D. Messerschmitt, Reconstructing lost video data in a lapped orthogonal transform based coder, In Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Albuquerque, NM, 4,1990,1985-1988．
    [HM95] S.S.Hemami,T.H.Y.Meng, Transform coded image reconstruction exploiting interblock correlation, IEEE Trans. On Image Processing, 4(7) , 7, 1995． 1023-1027．
    [Home+00] A. Homer, Extraction of strokes in handwritten characters, Pattern Recognition, 33(5) (2000) 1147-1160．
    [HS95] A. Hauptmann, M. Smith, Text, speech, and vision for video segmentation: the informedia project, AAAI Symp. Computational Models for Integrating Language and Vision, 1995．
    [IKA85] O. Iwaki, K. Kubota, H. Arakawa, A character/graphic segmentation method using neighborhood line density,

    IEICE Trans. Inform. Process. J68(4) , 1985． 821-828．
    [IST96] S.Iliescu,R.Shinghal,R.Y.M.Teo, Proposed heuristic procedures to preprocess character patterns using line adjacency graph. Pattern Recognition 29(6) , 1996, 951-975．
    [JB92] A. K. Jain and S. Bhattacharjee, Text segmentation using Gabor filters for automatic document processing, Mechine Vision and Applications, 5(3) , 1992,169-184
    [JHD93] T. Jarske, P. Haavisto, I. Defee, Post-filtering methods for reducing blocking effects from coded images, IEEE Trans. Consumer Electron. 1994, 521-526．
    [JHD94] T. Jarske, P. Haavisto, I. Defee, Post-filtering methods for reducing blocking effects from coded images, IEEE Trans. Consumer Electronics, 1994, 521-526．
    [JJ98] A. Jeon, J. Jeong, Blocking artifacts reduction in image compression with block boundary discontinuity criterion, IEEE Trans. On Circuits Syst. Video technology, 8(3) , 1998, 345-357．
    [JR94] J. M. Jolion, A. Rosenfeld, A pyramid framework for early vision, Kluwer Academic Publishers, Dordrecht, 1994．
    [JY98] A.K.Jain,B.Yu, Automatic text location in images and video frames, Pattem Recognition, 31(12) , 1998,2055-2076
    [JZ96] A.K.Jain,Y.Zhong, Page segmentation using texture analysis, Pattern Recognition, 29(5) , 1996, 743-770．
    [KL90] A. Khotanzad, J. H. Lu, Classification of invariant image representations using a neural network, IEEE Trans. ASSP 38(6) , 1990,1028-1038．
    [KN94] L. H. Kieu, K. N. Ngan, Cell-loss concealment techniques for layered video codecs in an ATM network. IEEE Trans. On Image Processing, 3, 9,1994, 666-677．


    [KS93] W. Kwok, H. Sun, Multi-directional interpolation for spatial error concealment, IEEE Trans. On Consumer Electronics, 39(3) , 8,1993, 455-460．
    [KV95] M. Kansari, M. Vetterli, Layered transmission of signals over power-constrained wireless channels, In Proc. IEEE Int'l Conf. On Image Processing, 10,1995, 380-383．
    [KV96] M. Kansari, M. Vetterli, Low bit rate video transmission over fading channels for wireless microcellular systems, IEEE Trans. On Circuits and Systems for Video Technology, 6, 11, 1996, 1-11．
    [LC92] H. J. Lee, B. Chen, Recognition of handwritten Chinese characters by short line segments, Pattern Recognition, 25(5) , 1992, 543-552．
    [LC95] J. Y. Lin, Z. Chen, A Chinese-character thinning algorithm based on global features and contour information, Pattern Recognition 28(4) , 1995． 493-512．
    [LCW87] B. H. Leung,Y. Y. Cheung,Y. L. Wong, A knowledge-based stroke-matching method for Chinese character recognition, IEEE Trans. Systems Man Cybernet. 17(6) . 1987, 993-1003．
    [Leav94] V. F. Leavers, Preattentive computer vision: towards a two-stage computer vision system for the extraction of qualitative descriptors and the cued for focus of attention, Image Vision Comput. 12(9) , 1994． 583-599．
    [LH90] A. W. Liao, J. S. Huang, Stroke segmentation by Bernsteinbezier curve fitting, Pattern Recognition 23, 1990, 475-484．
    [LHS97] K. Liu, Y. S. Huang, C. Y. Suen, Robust stroke segmentation method for handwritten Chinese character recognition, Proc. 4th Int. Conf. Document Anal. Recognition(ICDAR), Germany, Vol.1, 1997, pp.211-215．
    [LHS99] K. Liu, Y.S. Huang, C.Y. Suen, Identification of fork points on the skeletons of handwritten Chinese characters,

    IEEE Trans. On Pattern Analysis and Machine Intelligence, 21(10) , 1999, 1095-1100．
    [LLA90] S. H. Lee, P. J. Lee, R. Ansari, Cell loss detection and recovery in variable rate video, In Proc. 3rd Int'l workshop on packet video, 3,1990．
    [LLP96] S. W. Lee, D.J. Lee, H.S. Park, A new methodology for grayscale character segmentation and recognition, IEEE Trans. Pattern analysis and Machine Intelligence, 1S(10) , 1996, 1045-1050．
    [LP92] S. Lee, J.C. Pan, Office tracing and representation of signatures, IEEE Trans. Systems Man Cybernet 22,1992, 755-771．
    [LR92] W. M. Lam, A. R. Reibman, Self-synchronizing variable length codes for image transmission. In Proc. ICASSP'92, 3,1992． 477-480．
    [LRS91 ] S. W. Liu, Y. Ren, C. Y. Suen, Hierarchical attributed for representation and recognition of handwritten Chinese characters, Pattern Recognition, 24(7) , 1991, 617-632．
    [LS91] S.M.Lei,M.T.Sun, An entropy coding system for digital HDTV applications, IEEE Trans. On Circuits and Systems for Video Technology, 1, 3, 1991, 147-154．
    [LS95] L. Lam, C.Y. Suen, An evaluation of parallel thinning algorithms for character recognition, IEEE Trans. On Pattern Analysis Machine Intelligence, 17(9) , 1995, 914-919．
    [LS96] R. Lienhart and F. Stuber, Automatic text recognition hi digital videos, Proc. Praktische Informatic IV,1996． 68-131．
    [LX92] S. W. Lu, H. Xu, False stroke detection and elimination for character recognition, Pattern Recognition Lett. 13,1992, 745-755．
    [Ma189] S. E. Mallat, A theory for multiresolution signal decomposition: the wavelet representation, IEEE Trans. On Pattern Analysis and Machine Intelligence, 11, 1989,

    544-561
    [MP90] J. Malik, P. Perona, Preattentive texture discrimination with early vision mechanisms, J. Opt. Soc.Am. 7(5) . 1990, 923-932．
    [MS89] H. S. Malvar, D. H. Staelin, The LOT: Transform coding without blocking effects, IEEE Trans. On ASSP, 37(4) , 1989, 553-559．
    [MS90] T. A. Mai, C. Y. Suen, A generalized knowledeg-based system for the recognition of unconstrained handwritten numerals. IEEE Trans. Systems Man Cybernet. 20(4) , 1990, 835-848．
    [MSY92] S. Mori, C.Y. Suen, K.Yamamoto, Historical review of OCR research and development, Proc. IEEE 80, 1992,1029-1058．
    [NBT98] J. Nonnenmacher, E. W. Biersack, D. Towsley, Parity-based loss recovery for reliable multicast transmission. IEEE/ACM Trans. On Networking, 6, 8,1998, 349-361．
    [NSV92] G Nagy, S.Seth, M. Viswanathan, A prototype document image analysis system for technical journals, Computer, 1992,10-22．
    [OD94] D. Olivier and B. Dominique, A robust and multiscale document image segmentation for block line/text line structures extraction, 12th Int. Conf. On Pattern Recognition, Jerusalem. 1994,306-309．
    [Ogaw74] A. Ogawa, Hierarchical representation and recognition of Chinese characters, 1EICE Trans. 57(12) , 1974, 700-708．
    [Oka83] R.Oka, Handwritten Chinese-Japanese character recognition using cellular features, IEICE Trans J66(1) , 1983, 44-51．
    [OSA94] J. Ohya, A.Shio, S. Akamastsu, Recognition characters in scene image, IEEE Trans. Pattern Analysis and Machine Intelligence, 16, 1994, 214-220．
    [OT82] H. Ogawa, K. Taniguchi, Thinning and stroke segmentation for handwritten Chinese character recognition,

    Pattern Recognition 15,1982, 299-308．
    [OWVR97] M.T. Orchard, Y. Wang, V. Vaishampayan, A. R.Reibman, Redundancy rate-distortion analysis of multiple description coding using pair-wise correlating transforms, In Proc. IEEE Int'l Conf. On Image Processing, 10,1997, 608-611．
    [Pavl86] T. Pavlidis, A Vectorizer and feature extractor for document recognition, Comput. Vision Graphics and Image Process. 35,1986,111-127．
    [PC93] J. C. Pettier, J. Camillerapp, Segmentation et representation d'images de trait, Publication internet, 756,1993．
    [PGSV95] E. J. Posnak, S. P. Gallindo, A. P. Stephens, H. M. Vin, Techniques for resilient transmission of JPEG video streams. In Proc. Of Multimedia Computing and Networking, San Jose, 2, 1995, 243-252．
    [PKL94] J. W. Park, D. S. Kim, S. U. Lee, On the error concealment techniques for DCT based image coding, in Proc. ICASSP, 3,1994, 293-296．
    [PMV97] V. Parthasarathy, J. W. Modestino, K. S. Vastola, Design of a transport coding scheme for high quality video over ATM networks, IEEE Trans. On Circuits and Systems for Video Technology, 7(2) . 4,1997, 358-376．
    [PP95] A. M. Privitera, R. Plamondon, A system for scanning and segmenting cursive handwritten words into bask strokes, Proc. 3rd Int. Conf. Document Anal. Recognition (ICDAR), Canada, 1995, pp. 1047-1050．
    [PSSS92] P. W. Palumbo, S. N Shihari, J. Soh, R. Shridar, V. Demhanmenko, Postal address location in real time, Computer, 1992, 34-42．
    [PZ92] T. Pavalidis, J. Zhou, Page segmentation and classification, CVGIP: Graphical models and Image Processing, 54(6) , 1992, 484-496．
    [PZ92] T. Pavalidis, J. Zhou, Page segmentation and classification, CVGIP: Graphical models and Image Processing, 54(6) , 1992, 484-496．
    [RBS95] M. Roth, H. Bunke, E. G Schukat-Talamazzini,

    Off-line cursive handwriting recognition using hidden Markov Models, Pattern Recognition, 28(9) , 1995, 1399-1413．
    [RE93] S. J. Reeves, S. L. Eddins, Comments on iterative procedures for reduction of blocking effects in transform image coding, IEEE Trans. Circuits Syst. Video Technology, 3(12) , 1993, 439-440．
    [RL84] H. C. Reeves, J. S. Lim, Reduction of blocking effects in image coding, J. Opt. Eng. 23(1) , ,1984, 34-37．
    [Rose84] A. Rosenfeld, Some useful properties of pyramids, Multiresolution Image Processing and Analysis, Berlin. 1984, 2-5．
    [Sait82] T. Saito, An analysis of hand-printed Chinese characters, IEICE Trans. J65(5) . 1982, 550-556．
    [SDB98] J. C. Shim, C. Dorai, R. Bolle, Automatic text extraction from video for content-based annotation and retrieval, Proc. 14th Int. Conf. Pattern Recognition, 1998, 618-620．
    [SH97] J.Suh,Y.Ho, Error concealment based on directional interpolation, IEEE Trans. On Consumer Electronics, 43(3) , 8,1997,295-302．
    [Shio80] M. Shiono, Recognition of hand-printed characters by directional matching method, IEICE Trans. J63(5) , 1980, 402-410．
    [Siko97] T. Sikora, The MPEG-4 video standard verification model, IEEE Trans. On Circuits and Systems for Video Technology, 7(1) , 2, 1997, 19-31．
    [Simo92] J. C. Simon, Off-line cursive work recognition, Proc. IEEE 80(7) . 1992． 1150-1161．
    [SuK95] H. Sun, W. Kwok, Concealment of damaged block transform coded images using projections onto convex sets, IEEE Trans. On Image Processing, 4(4) , 4, 1995, 470-477．


    [SK95] M. A. Smith, T. Kanade, Video skimming and characterization through language and image understanding techniques, Technical report, Carnegie Mellon Univ. 1995．
    [SNM92] T. Suzuki,RNishida, S.Mori, Thin line representation from contour representation of hand-printed characters from pixels to features Ⅲ: Frontiers in Handwriting Recognition, Elsevier, Amsterdam, (1992) 29-40．
    [SRVN+00] S. D. Servetto, K. Ramchandran, V. A. Vaishampayan, K. Nahrstedt, Multiple description wavelet based image coding, IEEE Trans. On Image Processing, 9, 5, 2000, 813-826．
    [SW+00] X. Su, B. W. Wah, Streaming video with optimized reconstruction-based DCT, In Proc. Int'l Conf. On Multimedia and Expo, IEEE, 7-8, 2000．
    [TC92] L.Y. Tseng .C.T.Chuang, An efficient knowledge-based stroke extraction method for multi-font Chinese characters, Pattern Recognition 25, 1992,1445-1458．
    [TH93] A.N.Tien,H.M.Hang, Transform-domain post processing of DCT-coded images, Proc. SPIE, Visual Communication, Image Processing, 2094(11) , 1993, 1627-1638．
    [TH96] T.Turletti,C.Huitema, Video conferencing on the internet IEEE/ACM Trans. On Newworking, 4(3) , 6,1996, 340-351．
    [TLYS91 ] L. T. Tu, Y. S. Lin, C. P. Yeh, I. S. Shyu, J. L. Wang, K. H. Joe, W. W. Lin, Recognition of hand-printed Chinese characters by features matching, Int. Conf. On Computer Processing of Chinese and Oriental Languages, 1991, 154-157．
    [TM86] C.L. Tan, W.N.Martin, A distributed system for analyzing timer-varying multiresolution imagery, Comput. Vision Graphics Image. Process. 36,1986,162-174
    [TN98] A.L.Tan,P.O.NG, Text extraction using pyramid, Pattern Recognition, 31(1) , 1998, 63-72．


    [TP92] C. J. Turner, L. L. Peterson, Image transfer and end-to-end design, In Proc. SIGCOMM'92, 1992． 258-268．
    [Tsuk92] J. Tsukumo, Hand-printed Kanji character recognition based on flexible template matching, 11th ICPR, 1992． 483-490．
    [Tsuk93] J. Tsukumo, Trend of hand-printed Kanji OCR development, Proc. Of IEICE Symposium on hand-printed character recognition, 1993．
    [TT95] O. D. Trier, T. Taxt, Evaluation of binarizatin methods for document images, IEEE Trans. On Pattern Analysis and Machine Intelligence, 17(3) , 1995, 312-315．
    [TTLL94] Y. Y. Tang, L. T. Tu, T. Li, W. W. Lin, I. S. Shyu, C. Y. Suen, Chinese character recognition with stroke features and tree structured neural network, Comput. Process. Chinese Oriental Languages, 8, 1994, 17-36．
    [Tzou89] K. H. Tzou, Psot filtering for cell loss concealment in packet video, In Proc. SP1E Conf. Visual Commun., Image Processing, Philadelphia, PN. 11,1989,1620-1627．
    [Umed93] M. Umeda, A prejudiced survey of recognition methods for hand-printed Kanji characters, Proc. IEICE Symposium on hand-printed character recognition. 1993．
    [Umed95] M. Umeda, The progress of recogniton methods for hand-printed Kanji characters, IEICE Technical Report, PRU95-120． 1995．
    [Vais96] V. A. Vaishampayan, Application of multiple description codes to image and video transmission over lossy networks, In Proc. 7th Int'l Workshop on Packet Video, 3,1996, 55-60．
    [VF90] V. A. Vaishampayan, N, Farvardin, Optimal block cosine transform image coding for noisy channel, IEEE Trans. On Communications, 38, 3, 1990, 327-336．
    [Waka95] T. Wakabayashi, Accuracy improvement by nonlinear normalization and feature

    compression in handwritten Chinese character recognition, IEICE Technical Report, PRU95-I,1995．
    [Wall91] GK. Wallace, The JPEG still picture compression standard, Communication, ACM 1991．
    [WCW82] K. Y. Wong, R. G Casey, F. M Wahl, Document analysis system, IBM Journal Res. Dev. 26(6) , 1982． 647-656．
    [WKSS96] H. D. Wactlar, T. Kanade, M. Smith, S. Stevens, Intelligent access to digital video: the informedia project, IEEE Computer, 1996, 46-52．
    [WMO92] T. Wakahara, H. Murase, K. Odaka, On-line handwriting recognition, Proc. IEEE 80, 1992,1181-1194．
    [WMR97] V. Wu, R. Manmatha, E. Riseman, Finding text in Images, 20th Int. ACM SIGIR Conf. Research and Development in Information Retrieval, 1997, 3-12．
    [WOR97] Y. Wang, M. T. Orchard, A. R. Reibman, Multiple description image coding for noisy channels by pairing transform coefficients, In Proc. IEEE First Workshop Multimedia Signal Procesing, 6, 1997, 419-424．
    [WS89] D. Wang, S. N. Srihari, classification of newspaper Image blocks using texture analysis, Computer Vision, Graphics and Image Processing, 47,1989, 327-352．
    [WS99] B.W.Wah,X.Su, Streaming video with transformation based error concealment and reconstruction, In Proc. Int'l Conf. On Multimedia Computing and Systems, IEEE, 1, 6, 1999, 238-243．
    [WSL+00] B. W. Wah, Xiao Su, D. Lin, A survey of error-concealment schemes for real-time audio and video transmissions over the internet, Proc. IEEE International Symposium on Multimedia Software Engineering, 12, 2000．
    [WT92] R. Y. Wu, W. H. Tsai, A new one-pass parallel thinning algorithm for binary images, Pattern Recognition Letters. 13, 1992, 715-723．


    [WWC82] F. M. Wahl,K. Y.Wong and R.G Casey, Block segmentation and text extraction in mixed text/image documents, Comput. Graphics Image Process. 20, 1982, 375-390．
    [WZL93] Y. Wang, Q. Zhu, L. Shaw, Maximally smooth image recovery in transform coding, IEEE Trans. On communications, 41(10) , 10, 1993, 1544-1551．
    [WZS93] Y. Wang, Q. F. Zhu, L. Shaw, Maximally smooth image recovery in transform coding, IEEE Trans. Commun 41(10) , 1993, 1544-1551．
    [Yama93] A. Yamada, Non-linear approach in handprinted Kanji character recognition, Proc. IEICE Symposium on hand-printed character recognition, 1993．
    [Yasu79] M. Yasudda, An improvement of correlation method for character recognition, IEICE Trans. J63(3) , 1979, 217-223．
    [YC90] X.Ying, S.ChengJian, Recognizing restricted handwritten Chinese characters by structure similarity method, Pattern Recognition Lett. 11, 1990, 67-73．
    [YF96] A. S. Yeung, H. S. Fong, A fuzzy sub-stroke extractor for handwritten Chinese characters, Pattern Recognition, 29(12) , 1996,1963-1980．
    [YGK93] Y. Yang, N. P. Galatsanos, A. K. Katsaggelos, Regularized reconstruction to reduce blocking artifacts of block discrete cosine transform compressed images, IEEE Trans. Circuits Syst. Video Technology, 3(12) , 1993, 421-432．
    [YLM98] GYu,M.M.Liu,M.W.Marcellin, POCS-based error concealment for packet video using multiframe overlap information, IEEE Trans. On Circuits and Systems for Video Technology, 8(4) , 8, 1998, 422-434．
    [YO93] A. Yuceea, K. Oflazer, A rotation, scaling , and translation invariant pattern classification system, Pattern Recognition 26(5) , 1993, 687-710．
    [YQL93] Wang Yao, Zhu Qin-Fan, Shaw L, Maximally smooth image recovery in transform coding,

    IEEE Trans. On Commun.. 41(10) , 1993, 1544-1551．
    [YT90] S.S.Yu,W.H.Tsai, A new thinning algorithmfor gray scale images by relaxation technique, Pattern Recognition 23(10) , 1990, 1067-1076．
    [Zakh92] A. Zakhor, Iterative procedures for reduction of blocking effects in transform image coding, IEEE Trans. Circuits Syst. Video Technology, 2(3) , 1992． 91-95．
    [ZJ+00] Y. Zhong, A. K. Jain, Automatic caption localization in compressed video, IEEE Trans. On Pattern Analysis and Machine Intelligence, 22(4) , 2000, 385-392．
    [ZKJ95] Y. Zhong, K. Karu, A. K. Jain, Locating text in complex color images, Pattern Recognition, 28(10) , 1995, 1523-1526
    [ZL99] W.Zeng, B.Liu, Geometric-structure-based error concealment with novel applications in block-based low-bit-rate coding IEEE Trans. On Circuits and Systems for Video Technology, 9(4) , 6, 1999, 648-665．
    [ZS84] T. Y. Zhang, C. Y. Suen, A fast parallel algorithm for thinning digital patterns, Commun. ACM 27(3) , 1984,236-239．
    [ZWL93] Q. Zhu, Y. Wang, L. Shaw, Coding and cell-loss recovery in DCT-based packet video, IEEE Trans. On Circuits and Systems for Video Technology, 3(3) . 6,1993, 786-796．
    [ZWZ98] W. Zhu, Y. Wang, Q. Zhu, Second-order derivative based smoothness measure for error concealment in DCT-based codecs, IEEE Trans. On Circuits and Systems for Video Technology, 8(6) , 10, 1998, 713-718．


    2．中文文献
    [MH99] 毛用才，胡奇英“随机过程”，西安电子科技大学出版社，1999
    [XS96] 徐佩霞，孙功宪编著 “小波分析与应用实例”，中国科学技术大学出版社，1996
    [Yang99] 杨振明 “概率论”，南开大学数学教学丛书，科学出版社，1999
    [ZB98] 张贤达，保铮 “非平稳信号分析与处理”，国防工业出版社，1998
    [ZX96] 赵松年，熊小芸 “子波变换与子波分析”，电子工业出版社，1996

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700