基于大字符集脱机手写体汉字识别方法研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
脱机手写汉字识别在中文字符自动化处理和智能输入方面有着广泛的应用前景。由于手写汉字具有随意性、相似字多和字体形式多变等特点,使得脱机手写汉字识别成为字符识别领域的一个难点和热点。
     本文主要以脱机手写文本图像作为研究对象,研究了文本图像的二值化、汉字字符的分割和基于多特征多分类器融合的识别方法三个方面,寻求一个针对大字符集能较好区分相似特征的脱机手写体汉字识别方案。论文研究内容如下:
     (1)针对光照不均对文本图像二值化的影响,提出了一种基于边缘轮廓的自适应文档图像二值化方法。该方法基于log边缘轮廓生长的阈值化方法估计文本前景区域,有效的减少笔划丢失和断笔现象,同时解决前景估计时产生大块噪音的问题。其次,以局部区域背景灰度平均值和前景区域平均灰度值与当前位置像素的灰度差值为度量标准,引入抑制噪音的参数变量来改进阈值公式,进一步对噪音进行抑制。实验表明该方法有效地抑制了噪音,较好的保留了汉字结构的完整性。
     (2)针对手写体汉字中粘连或交叉字符的分割问题,本文提出一种基于最小加权分割路径的脱机手写汉字多步分割方法。该方法继承了以往粗分割和细分割相结合的思想,首先采用投影方法进行粗分割,将文本汉字分为粘连字符和非粘连字符两类;在细分割阶段,抛弃常用的串行模式分割思想,直接利用粗分割后的统计信息,来设置初始分割路径。并基于最短分割路径的思想,在初始分割路径的局部邻域内采用基于最小权值的算法搜索并修改分割路径,从而获得最佳的加权分割路径。实验证明该方法较好地解决了字符分割不足和多处粘连字符的分割问题,有效的提高了分割的准确率,且算法的时间复杂度较低。
     (3)为进一步提高大字符集汉字的识别率,本文将能反映上下文关系的基于词的级联隐马尔可夫训练模型用于解决相似字识别问题,并给出了一种与其相应的级联识别方法,尝试从识别分类器的角度提高相似字的识别率。然后利用不同分类器的优点,设计了一种结合词级联HMM的多特征多分类器集成方案,该方案使相似字和非相似字能自适应地选择合适的方法进行针对性识别,有效的提高了整体识别率。
The off-line handwritten Chinese character recognition has wide rang of applications, which is widely used in automated processing and intelligent input of Chinese characters. Handwritten Chinese characters are characterized with arbitrary writing, lots of similar characters and serious irregular variations of shapes. It makes that the research of off-line handwritten Chinese characters recognition is popular and difficult in the region of Chinese characters recognition.
     The document image is used as the object of research in this paper. We take a preliminary research on the document image binarization, handwritten Chinese characters segmentation and the recognition method based on multi-classifier fusion with multi-feature. The paper will attempt to find an off-line handwritten Chinese character recognition program with high accuracy.
     Firstly, for the binarization result is the impact of uneven object intensity and the BG algorithm is inefficient, an adaptive document image binarization method based on contour is put forward in this paper. The pixel growth method based on contour with Log’s operator is used to estimate foreground region of text image. It reduces the number of broken strokes and solves the problem of local noise. The new parameter variable is added to the threshold formula to suppress niose further. The experimental results show that the method is effective on suppressing noise and keeping the integrity of Chinese character structure.
     Secondly, a multi-step segmentation method was put forward in this paper to segment connected or overlapped Chinese character in ancient document. It inherited the fuzzy approach of the rough segmentation and fine segmentation. Firstly, the project profile histogram method was employed to obtain the no touching or overlapping characters from the separated blocks of the characters string. Then, for the touching characters in the wide blocks, the segmentation is performed by searching and modifying the segmentation path in the local neighborhood of initial segmentation path with minimum weight segmentation path algorithm, and the initial segmentation path was obtained according to the statistical data of rough segmentation. Experimental results show that the proposed method can solve the problem of insufficient segmentation characters and multiple touching character segmentation, the proposed method can improve the accuracy of handwritten Chinese character segmentation effectively, and the algorithm has a lower time complexity.
     Thirdly, in order to improve the large-set recognition rate, the cascaded HMM training algorithm is put forward in this paper to solve the similar character recognition. And cascaded recognition which is used to recognize similar character is also put forward to improve the similar character’s recognition rate. Then, according to the characteristics of different classifiers, a multi-classifier fusion method with multi-feature based on cascaded HMM is put forward to select appropriate recognition algorithm by character feature adaptively. Experimental results show that the proposed method increases effectively recognition rate effectively.
引文
[1]陈友斌,丁晓青.非特定人脱机手写汉字识别[J].中国人工智能网, http://ccce.gucas.ac.cn/news.php?id=919, 2005-3-5.
    [2] Hong-Wei Hao, Xu-Hong Xiao, Ru-Wel Dai. Handwritten Chinese Character Recognition by Metasynthetic APProach [J]. Pattern Recognition, 1997, 30(8): 1321-1328.
    [3]朱小燕,史一凡,马少平.手写体字符识别研究[J].模式识别与人工智能, 2000, 13(2): 174-180.
    [4]高彦宇,杨扬.脱机手写体汉字识别研究综述[J].计算机工程与应用, 2004, 4(20): 74-77.
    [5] Gao Yangyu, Yang Yang. Survey of Unconstrained Handwritten Chinese Character Segmention [J]. Computer Engineering, 2004, 30(5):144-146.
    [6]何志国.脱机手写体汉字识别综述[J].计算机工程, 2008, 15(34): 201-204.
    [7]孙华,张航.汉字识别方法综述[J].计算机工程, 2010, 36(20): 194-197.
    [8] Kato N, Suzuki M, Omachi S. A Handwritten Character Recognition System Using Directional Element Feature and Asymmetric Mahalannobis Distance [J]. IEEE Trans. on Pattern Analysis and Machine Intelligence, 1999, 21(3): 258-262.
    [9]刘伟,朱宁波,何浩智.基于弹性网格模糊特征的手写体汉字识别方法[J].中文信息学报, 2007, 21(3):117-121.
    [10]封筠,杨扬,朴春慧.一种手写相似汉字特征选择方法的研究[J].计算机工程, 2005, 12(13): 33-41.
    [11]喻莹,杨扬,董才林.基于动态特征选择的手写体相似汉字的识别[J].计算机工程, 2006, 17(32):10-12.
    [12] Cheng Lin Lin, Hiromichi Fujisawa. Classification and Learning Methods for Character Recognition: Advances and Remaining Problems [A]. Studies in Computational Intelligence: Machine Learning in Document Analysis and Recognition[C]. Springer Verlag, Berlin, Heidellerg, 2008: 139-1612.
    [13] Hee-seon Park, Seong-whan Lee. Off-line Recognition of Large Set Handwritten Character with Multiple Hidden Markov Models [J]. Pattern Recognition, 1996, 29(2): 231~244.
    [14]赵继印,郑蕊蕊,吴宝春,李敏.脱机手写体汉字识别综述[J].电子学报, 2010, 2(38): 405-415.
    [15]陈友斌,丁晓青,吴佑寿.非特定人脱机手写汉字识别[OL].中国计算机报, 1997.
    [16]徐磊,肖柏华,戴汝为,王春恒.一种面向大类别集的快速分类方法[J].计算机研究与发展, 2008, 45(4): 588-595.
    [17]应自炉,李景文,张有为.基于融合的多类支持向量机[J].计算机工程, 2009, 35(19): 187-188.
    [18] Liu Chenglin, Katsumi Marukawa. Pseudo Two-dimensional Shape Normalization Methods for Handwritten Chinese Character Recognition [J]. Pattern Recognition, 2005, 38(12): 2242-2255.
    [19]丁晓青.汉字识别研究的回顾[J].电子学报, 2002, 9(30): 1364-1368.
    [20]侯艳平.脱机手写体汉字识别研究[D].硕士论文,扬州大学, 2008.
    [21]哈明虎,田大增.视觉文档图像识别预处理[M].保定:河北大学出版社, 2007.
    [22] N.Otsu. A Threshold Selection Method from Gray Level Histograms [J]. IEEE Trans. Systems Man Cybernet, 1979, 9(1): 62-66.
    [23] Rais, N.B Hanif, M.S Tai. Adaptive Thresholding Technique for Document Image Analysis [C]. In: Proceedings of the 7th International Multitopic Conference. 2004: 61-66.
    [24] W.Niblack. An Introduction to Digital Image Processing [M]. Prentice Hall, Englewood Cliffs, NJ, 1986:115-116.
    [25] J.Sauvola, M.Pietikainen. Adaptive Document Image Binarization [J]. Pattern Recognition, 2000, 33: 255-236.
    [26] J. Bernsen, Dynamic Thresholding of Grey-level Images [C]. In: Proceedings of the Eighth International Conference on Pattern Recognition, 1986: 1251-1255.
    [27]龚声蓉,刘纯平,王强.数字图像处理与分析[M].北京:清华大学出版社, 2006.
    [28] B.Gatos, I.Pratikakis, S.J.Perantonis. Adaptive Degraded Document Image Binarzation [J]. Pattern Recognition, 2006, 39(10):317-327.
    [29] A. Jain, Fundamentals of Digital Image Processing [M]. Prentice-Hall, Englewood Cliffs, NJ, 1989.
    [30] Lu Y, Shridhar M. Character Segmentation in Handwritten Words: An Overview [J]. Pattern Recognition, 1996, 29(1): 77-96.
    [31] Casey R, Lecolinet E. A Survey of Method and Strategies in Character Segmentation [J]. IEEE Trans PAMI, 1996, 18(7): 690-706.
    [32] Tseng Lin Yu, Chen Rung Chins. Segmenting Handwritten Chinese Character Based on Heuristic Merging of Stroke Bounding Boxes and Dynamic Programming [J]. Pattern Recognition, 1998, 19(10): 963-973.
    [33]王嵘,丁晓青,刘长松.基于笔划合并的手写体信函地址汉字切分识别[J].清华大学学报(自然科学版), 2004, 44(4): 498-502.
    [34]赵宇明,江兴智,施鹏飞.基于笔划提取和合并的离线手写体汉字字符切分算法[J].红外与激光工程, 2002, 32 (1): 23-27.
    [35] Zhao S Y, Chi Z R, Shi P F. Two-Stage Segmentation of Unconstrained Handwritten Chinese Characters [J]. Pattern Recognition, 2003, 36(1): 145-156.
    [36]马瑞,杨静宇.一种有效的手写汉字多步分割方法[J].中国图象图形学报, 2007, 11(12): 2062-2067.
    [37] Xianghui Wei, Shaoping Ma, Yijiang Jin. Segmentation of Connected Chinese Characters Based on Genetic Algorithm [C]. In: Proceedings of the 8th International Conference on Document Analysis and Recognition, 2005: 645-649.
    [38] Wang, H.F., Liu, C.Y., Song, X.L., et al.: Parameters Self-adaptive Fuzzy Controller Based on Genetic Algorithm [C]. In: Proceedings of 2007 IEEE International Conference on Grey Systems and Intelligent Services, 2007: 952–956.
    [39] Tahera, K, Ibrahim, R.N., Lochert, P.B. Development of a Self-adaptive Genetic Algorithm [C]. In: Proceedings of the 7th International Conference on Intelligent Systems Design and Applications, 2007: 883-888.
    [40] Rui-rui Zheng, Ji-yin Zhao, Bao-chun Wu. Multi-step Off-Line Handwritten ChineseCharacters Segmentation with GA [J]. Fuzzy Information and Engineering, 2009, 2(62): 1-11
    [41]冯兵,丁晓青,吴佑寿. HMM方法识别脱机手写汉字[J].模式识别与人工智能, 2002, 1(15): 84-88.
    [42]童学锋,邓刚,柴佩琪.隐马尔可夫模型在脱机手写体汉字识别中的应用[J].计算机应用, 2002, 10 (22): 1-3.
    [43] Liu Wen-zhuang Li Jun-li. A Method for Off-line Handwritten Chinese Character Recognition Based on Hidden Markov Model [C]. In: Proceedings of the 11th Conference on System Simulation Technology and Applications, 2009: 821-827.
    [44] Jie Li, Jiaxin Wang, Yannon Zhao, Zehong Yang. A New Approach for Off-line Handwritten Chinese Character Recognition Using Self-adaptive HMM [J]. Intelligent Control and Automation, 2004, 5: 4165-4168.
    [45]文风春,金人超,肖枝洪.训练隐马尔可夫模型的BW-GA方法[J].计算机工程与应用, 2010, 46(17): 236-238.
    [46]尤庆成,朱俊株,邓昌葛,杜宏伟.改进的HMM-SVM混合模型手写汉字签名认证方法[J].小型微型计算机系统, 2010, 9(31): 1869-1872.
    [47]赵巍,刘家峰,唐降龙.基于部件HMM级联的联机手写体汉字识别方法[J].哈尔滨工业大学学报, 2004, 5(36): 570-573.
    [48]赵巍,刘家峰,唐降龙,吴锐.连续字符识别的级联HMM训练算法[J].计算机学报, 2007, 12(30): 2142-2149.
    [49]蔺志青,郭军.贝叶斯分类器在手写汉字识别中的应用[J].电子学报, 2002, 12(30): 1804-1807.
    [50] Fu Qiang, Ding Xiaoqing, Li Tongzhi, Liu Changsong. An Effective and Practical Classifier Fusion Strategy for Improving Hand Written Character Recognition [C]. In: Proceedings of the 9th International Conference on Document Analysis and Recognition, 2007: 1038-1042.
    [51]朱敏觉,朱宁波,袁异.一种多分类器集成的手写体汉字识别方法[J].计算机工程与科学, 2009, 4(31): 36-39.