用户名: 密码: 验证码:
联机手写维吾尔文字母与单词识别研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
联机手写维吾尔文字识别是模式识别领域的难题之一,也是一门综合性技术。近年来,随着智能手机、平板电脑、电子白板等移动终端设备的普及,联机手写输入做为一种自然、方便的输入方法,已经得到了高度重视,并在日常生活中得到了广泛应用。但是,新疆少数民族地区广泛使用的官方语言——维吾尔语的文字识别研究相对滞后,对联机维吾尔文手写识别技术的研究也非常少。维吾尔文字手写输入系统的研究不仅对其他民族文字手写输入系统的研究也有一定的参考价值,而且促进少数民族地区的信息技术发展。
     维吾尔语文字是一种拼音文字,其书写方式与汉文和西文有很大不同。维吾尔文是从右到左写,每个词中所有字母连着写,且每一个字母在一个词的词首、词中和词尾所取的字形不一样。这些特点给维吾尔语文字的识别带来很大的困难。本文在深入调研国内外联机手写识别技术的研究动态,并通过分析维吾尔文字母与单词自身的结构和书写特点,在联机手写维吾尔文字母识别、维吾尔文单词切分、维吾尔文单词识别等三方面做了一些有益的探索。主要工作包括以下几个方面:
     一、首次分别建立了手写维吾尔文字母样本库和手写单词样本库。为了支持维吾尔文手写识别方面的研究,收集整理了联机手写数据库。维吾尔文字母样本库包括32个维吾尔文字母的四种不同书写类型(128类),由612个人书写,总共包含78336个样本。手写单词样本库包含常用的1460的单词,由400个人书写,总共有584000个维吾尔文单词。该数据库可用于手写字符识别、文档检索和笔迹鉴别等多方面的研究。
     二、提出了一种联机手写维吾尔文字母识别方案。选择在手写汉字识别技术中所提出来的归一化、特征提取、以及常用的分类方法,从中找出了最佳的技术选择。在实验对比中,本文采用了线性归一化(LN)、矩归一化(MN)、二分矩归一化(BMN)、中心边界对准(CBA)、改进的中心边界对准(MCBA)以及几种相应的伪二维归一化方法等八种不同的归一化预处理方法,基于坐标归一化的特征提取(NCFE)方法,以及MQDF(改进的二次分类函数)、DLQDF(判别学习型二次判别函数)、LVQ(学习矢量量化)、SVM(支持向量机)四种分类器。同时,再考虑字符在文档中的空间几何特征,进一步提高了识别性能。在128个维吾尔文字母类别、12,800个测试样本的实验中,正确识别率最高达到了89.08%,为进一步研究面向维吾尔文字母特性的识别技术奠定了重要基础。
     三、设计了一个基于DTW的联机手写维吾尔文字母识别系统框架。对预处理、特征提取、聚类分析和匹配识别等模块进行了较系统地理论和实验研究。在预处理中,采用了维吾尔文字母的线性归一化和基于点密度的一化的方法;特征提取中,采用了结合结构特征和统计特征的方法并提取了均匀采样特征、方向链表特征、网格方向特征、投影特征等;聚类分析中中采用基于最小生成树(MST)的动态聚类算法,分类器采用最近邻分类方法。测试结果表明,独立式、前连式、后连式、双连式字母的识别率分别达到74.67%,70.43%,63.33%,72.02%。出现在5个候选字母中的识别率达到94.34%,94.19%,93.15%,95.86%;
     四、提出了基于多分类器集成维吾尔文字母识别方法。多分类器组合能够在一定程度上弥补单个分类器的缺陷,因此它在模式识别中得到了广泛的应用。本文利用5种不同的特征提取方法构造了5个独立的分类识别器,采用了等权投票和不等权投票等两种策略将5种分类识别器进行了有效组合。其中,单分类器采用了基于动态时间弯折匹配距离的最近邻分类方法。实验结果表明,本文提出的集成策略的识别率明显高于单分类器的识别率,而且为特征的综合集成提供了多种有效途。
     五、通过对手写维吾尔文字中的字母连接特点深入研究,提出了一种有效的基于动态规划的联机手写单词分别方案。首先,对单词进行过切分。去掉单词中的附件部分后,通过分析主要笔划书写轨迹的形状,找出潜在的过分割点并合并被切分成的基本块与对应它的附加部分,得到基本字母片段序列;然后,再利用单字母分类信息和基于切分块的几何信息进行融合,采用动态规划算法来进行评价分析,从而寻找出最优的分割路径。对于联机单词样本进行的实验证明,该文所提出的算法对于维吾尔文单词的分割有很好的效果。
     六、提出了基于多步分割的维吾尔文单词切分方法。本文采用了先把单词分割成连体段,然后把连体段切分成字母的两级切分方案,并针对连体段分割问题和字母切分问题分别提出了连体段分割算法和字母切分算法。
     七、提出了基于词典驱动的、集成切分与识别的联机手写维吾尔文单词识别框架和方法。系统中把单词识别问题转化为一个词典中的词条与手写单词图像匹配的优化问题。维吾尔文单词识别的主要难点是字母在被识别之前不能准确地切分。解决方法是把字母分割与识别集成起来,通过组合搜索得到最优的切分和识别结果。第一步,利用过切分算法将单词进行切分,对相邻的基本片段进行组合形成切分候选网格。第二步,采用词典驱动的方法,将字母识别信息、几何信息和词典信息一起加入到单词识别系统的路径匹配过程。本文采用用置信度转换的方法,将分类器的输出转换成概率的形式,使参数调整更为方便。动态规划算法应用于单词识别过程中的最优路径匹配过程,得到最优识别结果。词典中的词条分别为100、500、1000、10000的情况下,单词识别实验结果分别为84%、78%、68%和47%。
On-line Handwritten Character Recognition is one of challenging important topic in the field of pattern recognition, but also a comprehensive technology. In recent years, with the increasing use of mobile devices such as cell phone, tablet computers and digital pen, on-line handwriting input as a natural, convenient has been attached great importance to and has been widely used in daily life. Uyghur language is an official language and very popular in the Xinjiang province of China. However, the research about the technology for Uyghur handwritten recognition is lagging much behind and little work has been done in this area. The research of recognition techniques for online handwritten Uyghur characters are not only has great reference values for starting the research of other ethnic group's handwritten scripture recognition, but also has a far-reaching meaning about developing the information technology and national culture of specific ethnic group.
     Uyghur words are formed by concatenation of the characters, which has a very special written structure different from Chinese and English characters. It is written from right to left; every letter may have different shapes in different positions. All these characteristics bring many difficulties to recognition. In this paper, through in-depth study the research trend of on-line handwriting recognition technology in domestic and abroad, based on analysis of the unique shape and writing styles of Uyghur characters, proposes an approach for online handwritten Uyghur character and word recognition. The major contributions of this dissertation are as follows:
     1. A handwritten Uyghur character database and a handwritten word database are established for the first time respectively. In order to support the research of Uyghur handwritten recognition, we collected the samples of online Uyghur handwriting. The datasets of Uyghur characters contain78,336samples of128classes (including four different types of32characters set), are handwritten by612volunteers includes students and teachers. The datasets of Uyghur words contain584,000samples of commonly used1460words, individually are handwritten by400volunteers also includes students and teachers. The database can be used for typical research tasks of handwritten document analysis such as handwritten recognition, handwritten document retrieval and writer identification etc.
     2. An online handwritten Uyghur characters recognition framework have been presented. We evaluate various techniques of normalization, feature extraction and classification that have been successfully applied in handwritten Chinese character recognition. Specifically, we use eight normalization techniques such as liner normalization (LN), moment normalization (MN), bi-moment normalization (BMN), Centroid-boudary alignment (CBA) and several corresponding pseudo2D normalization methods. We use the normalization cooperated feature extraction (NCFE) method with different settings. For classification, we use four classifiers, namely, the modified quadratic discriminant function (MQDF), the discriminative learning quadratic discriminant function (DLQDF), the learning vector quantization (LVQ) classifier, and the support vector classifier with RBF kernel (SVC-rbf). Furthermore, the geometric features which characterizing the spatial context in handwritten documents are extracted for enhance the recognition performance. In experiments on38,400test samples of128classes, the proposed approach achieved an accuracy of89.08%.
     3. we designed a framework for online handwritten Uyghur characters recognition system based on DTW and carried out a more systematic theoretical and experimental research on its'module, such as pre-processing, feature extraction, Cluster analysis and classifier. In the pre-processing, in order to obtain the structure information of characters, according to handwritten Uyghur character's feature, we use linear normalization and nonlinear normalization based on dot density method. Taking into account the more similar characters in the Uyghur language, use the feature extraction method of combined with the structural features and statistical features, such as uniform sampling feature, direction feature, grid direction density feature, two directional projection feature. Cluster analysis use the dynamic clustering algorithm based on the minimum spanning tree (MST), and classifier use the nearest neighbor matching classification. The experimental testing has been carried out and the results show that over-all recognition rate for four different character shapes is respectively74.67%,70.42%,63.33%,72.02%; the recognition rate for the handwritten characters which are recognized as in one of the two candidate characters are respectively86.85%,86.09%,80.43%,88.41%, and one of the five candidate characters are94.34%,94.19%,93.15%,95.86%respectively.
     4. And an online handwritten Uyghur characters recognition method based on the integration of multiple classifiers have been presented. Combination of multiple classifiers, a certain extent, compensate for defects of a single classifier, so it has been widely applied in pattern recognition. In our research, we applies five different feature extraction methods to construct five separate classifier and using voting strategy of ranging from rights to effective combined five kinds of classifier. Each classifier use the nearest neighbor classification method based on dynamic time bending matching distance. Experimental results show that the recognition rate based on integration strategy is significantly higher than the recognition rate of separate classifier, and it also provide a variety of effective ways for the comprehensive integration of features.
     5. To be enabled to separate the many connected characters in cursive Uyghur handwriting, we present a novel character segmentation method using dynamic programming. Firstly, after removing delayed strokes from the handwritten words, potential breakpoints are detected from concavities and ligatures by temporal and shape analysis of the stroke trajectory. Reconstruct delayed strokes and obtained a sequence of primitive segments. Then, by merging the neighboring blocks, create candidate segmentation paths. Then paths were evaluated by the character recognition and geometric information, and a dynamic programming method is applied to find the best segmentation point for each character. Our preliminary experiments on an online Uyghur word dataset demonstrate that the proposed method can achieve good performance in segmenting cursive handwritten Uyghur characters.
     6. On the issues of characters segmentation, we adopted the two level segmentation scheme in which the word segmented into conjoined section firstly, and then the conjoined section cut into characters in the next steps. We put forward conjoined section segmentation algorithm and characters segmentation algorithm for conjoined section segmentation problem includes characters segmentation problem.
     7. The online handwritten Uyghur word recognition approaches based on a lexicon-driven, integrated segmentation and recognition have been presented. Word recognition problem is transformed into matching optimization problems between the dictionary entry and the handwritten word image. There are many connected characters in cursive Uyghur writing, which makes the segmentation and recognition of Uyghur words very difficult. The solution is using of integrating the segmentation and recognition method to obtain the optimal segmentation and recognition results came from combined search. The first step, using of the over-segmentation algorithm to word separation, formed the segmentation candidate grids by combining adjacent fragments. In the second step, using lexicon-driven approach, combined with character recognition information, geometric information and dictionary information into path matching procedure in the word recognition system. Our preliminary experiments on an online Uyghur word dataset demonstrate that the proposed method can give high recall rate of segmentation point detection. Then using the confidence transformation method convert the similarity scores into probabilities, such that the tuning of weighting parameters becomes easier. Dynamic matching between characters in the lexicon entry and segment(s) of the input word image is used to ranking the lexicon entries in order to get best match. As the result the performance for lexicons of size100,500,1000,10000are84%、78%、68%and 47%respectively.
引文
[1]R. Plamondon, N.Srihari. On-line and off-line handwriting recognition. a comprehensive survey. IEEE transaction on pattern analysis and machine intelligence.2000,22(1):63-85
    [2]C.-L. Liu, S. Jaeger, M. Nakagawa, Online recognition of Chinese characters:the state-of-the-art, IEEE Trans. Pattern Anal. Mach. Intell.,2004,26(2):198-213.
    [3]C.C. Tappert, C.Y. Suen, T. Wakahara. The state of the art in on-line handwriting recognition [J]. IEEE Trans Pattern Anal Machine Intell.,1990,12(8):787-808.
    [4]M. Cheriet, N. Kharma, C.-L. Liu, and C.Y. Suen. Character Recognition Systems:A Guide for Students and Practitioners., JOHN WILEY & SONS, INC.,2007.
    [5]K.Hinckley, K.Yatani, M.Pahud, N.Coddington, J.Rodenhouse, A.Wilson, H. Benko, and B. Buxton. Pen + touch = new tools. In Proc. of the 23nd Annual ACM Symposium on User Interface Software and Technology(UIST),2010:27-36,.
    [6]买买提·沙地克.基础维吾尔语[M].乌鲁木齐:新疆人民出版社,1992
    [7]哈力木拉提,阿孜古丽.多字体印刷维吾尔文字符识别系统的研究与开发.计算机学报,2004,27(11):1480-1484
    [8]万芳.联机手写维吾尔文字识别技术的研究与实现.新疆大学硕士学位论文,2007:45-49
    [9]Yoshida K, Sakoe H. Online handwritten character recognition for a personal computer system. IEEE Trans on Consumer Electronics,1982, CE-28(3):202-208
    [10]K.C. Santosh, C. Nattee,:A Comprehensive Survey on Online Handwriting Recognition Technology and Its Real Application to the Natural Handwriting. Kathmandu University Journal of Science, Engineering and Technology,2009
    [11]R.G. Casey, E. Lecolinet, A survey of methods and strategies in character segmentation, IEEE Trans. Pattern Analysis and Machine Intelligence,1996,18(7):690-706.
    [12]邹明福,钮兴昱,刘昌平,白洪亮.联机手写英文识别.计算机研究与发展,2006,43(1):138-144
    [13]Fujisawa H, Nakano Y, Kurino K. Segmentation methods for character recognition:from segmentation to documents tructure analysis.Proc.IEEE,1992,80(7):1079-1092.
    [14]M. Pastor, A. Toselli, E. Vidal, Writing speed normalization for on-line handwritten text recognition, ICDAR'05, Seoul, Korea,2005,2:1131-1135.
    [15]C.-L. Liu. Handwritten Chinese character recognition:effects of shape normalization and feature extraction. SACH06, Maryland, USA,2006.
    [16]C.-L. Liu, Marukawa. Global shape normalization for handwritten Chinese character recognition: A new method. Ninth International Workshop on Frontiers in Handwriting Recognition (IWFHR'04), Tokyo, Japan,2004, pp.300-305
    [17]C.-L. Liu, M. Koga, H. Sako, H. Fujisawa, Aspect ratio adaptive normalization for handwritten character recognition, in: T. Tan, Y. Shi, W. Gao, Editors, Advances in Multimodal Interfaces—ICMI 2000, Lecture Notes in Computer Science, Vol.1948, Springer, Berlin,2000, pp.418-425.
    [18]C.-L. Liu, X.-D. Zhou, Online Japanese character recognition using trajectory-based normalization and direction feature extraction, IWFHR'06, La Baule, France,2006.
    [19]O.D. Trier, A.K. Jain, T. Taxt, Feature extraction methods for character recognition - a survey, Pattern Recognition,1996,29 (4):641-662.
    [20]C.-L. Liu, Y. -J. Liu, R. -W. Dai, Processing and statistical /structural feature extraction for handwritten numeral recognition, in:A. C. Downton, S. Impedovo (Eds.), Progress of Handwriting Recognition, World Scientific, Singapore, Pattern Recognition,1997, pp.161-168
    [21]M. Hamanaka, K. Yamada, J. Tsukumo, Normalization-cooperated feature extraction method for handprinted Kanji character recognition, Proceedings of the Third International Workshop on Frontiers of Handwriting Recognition, Buffalo, NY,1993, pp.343-348.
    [22]C.-L. Liu, K. Nakashima, H. Sako, H. Fujisawa. Handwritten digit recognition:investigation of normalization and feature extraction techniques. Pattern Recognition,2004,37(2):265-279.
    [23]F. Kimura, K. Takashina, S. Tsuruoka, Y. Miyake, Modified quadratic discriminant functions and the application to Chinese character recognition, IEEE Trans. Pattern Analysis and Machine Intelligence, 1987,9(1):149-153.
    [24]C.-L. Liu, H. Sako, H. Fujisawa. Discriminative learning quadratic discriminant function for handwriting recognition. IEEE Trans. Neural Networks,2004,15(2):430-434.
    [25]Mangasarian, O.-L., Musicant, D.-R., Successive overrelaxation for support vector machines, IEEE Trans. Neural Networks,1999,10(5):1032-1037.
    [26]Jin, X., Liu, C.-L., Hou, X., Regularized margin-based conditional loglikelihood loss for prototype learning, Pattern Recognition,2010:2428-2438.
    [27]李元祥,丁晓青,吴佑寿.一种基于字词结合的汉字识别上下文处理新方法.计算机研究与发展,2002,39(7):832-837
    [28]谢旭东,丁晓青,彭良瑞,刘长松.一个基于混合语言模型的日文识别后处理系统.计算机工程应用,38:014,2002,68-72
    [29]F. Yin, Q.-F. Wang, C.-L. Liu, Integrating geometric context for text alignment of handwritten Chinese documents, Proc. 12th ICFHR,2010, pp.7-12.
    [30]Xiang-Dong Zhou, Jin-Lun Yu, Cheng-Lin Liu, etal. Online handwritten Japanese character string recognition incorporating geometric context. Proc. Ninth Int'l Conf. Document Analysis and Recognition,Curitiba,Brazil,2007.
    [31]吴佑寿,丁晓青.汉字识别原理方法与实践.北京:高等教育出版社,1993
    [32 S.Marukatat, T.Artieres,B.Dorizzi, P.Gallinari. Sentence Recognition Through Hybrid Neuro-Markovian Modeling. Proceedings of International Conference on Document Analysis and Recognition (ICDAR), Seattle, Washington, US.2001:731-735
    [33]T.Artieres, P.Gallinari, H.Li, S.Marukatat, B.Dorizzi. From Character to Sentences:A Hybrid Neuro-Markovian System for On-line Handwriting Recognition, In "Hybrid Methods in Pattern Recognition," H. Bunke, A. Kandel (eds), World Scientific Publ. Co. Parution courant 2001:1-27
    [34]Rohini k. Srihari, Charlotte M. Baltus. Incorporating Syntactic Constraints in Recognizing Handwritten Sentences. Proceedings of International Joint Conference on Artificial Intelligence (IJCAI-93), Chambery, France, August 1993:1262-1267
    [35]Alessandro L. Koerich, Robert Sabourin, Ching Y. Suen. Fast Two-level Viterbi Search Algorithm for Unconstrained Handwriting Recognition. ICASSP, Orlando, USA, May,2002:3537-3540
    [36]Fadi Biadsy, Jihad EL-Sana, Nizar Habash. Online Arabic Handwriting Recognition Using Hidden Markov Models. DB/OL. http://www.neuroscript.net,2006
    [37]Cheng-Lin Liu, Stefan Jaeger, Masaki Nakagawa. Online Recognition of Chinese Characters:The State-of-the-Art. IEEE Transactions on Pattern Analysis and Machine Intelligence,Vol.26, No.2, February 2004,198-213
    [38]Cheng-Lin Liu, Kazuki Nakashima, Hiroshi Sako, et al. Handwritten digit recognition:benchmarking of state-of-the-art techniques. Pattern Recognition 36,2003,2271-2285
    [39]M. Nakagawa et al. Robust and Highly Customizable Recognition of On-Line Handwritten Japanese Characters. Proc.13th Int'l Conf. Pattern Recognition, vol.3,1996,269-273
    [40]Fadi Biadsy, Jihad EL-Sana, Nizar Habash. Online Arabic Handwriting Recognition Using Hidden Markov Models. DB/OL. http://www.neuroscript.net,2006
    [41]M.S.Khorsheed. Recognition handwritten Arabic manuscript using a single hidden Markov model [J]. Pattern Recognition Letters.2003,24(3):2235-2242
    [42]Abdolah Chalechale, Golshah Naghdy, Prashan Pramaratne. Arabic/Persian Cursive Signature Recognition and Verification Using Line Segment Distribution [J]. IEEE 2004,04(2):475-476
    [43]A. Ennaji, M. Sellami, A. Benouareth. Arabic Handwritten Word Recognition Using HMMs with explicit State Duration [J]. Journal on Advances in Signal Processing.2008,10(11):1155-1168
    [44]K.C. Santosh, C. Nattee,:A Comprehensive Survey on Online Handwriting Recognition Technology and Its Real Application to the Natural Handwriting. Kathmandu University Journal of Science, Engineering and Technology,2009
    [45]S. Jakob, M. Jonas, A. Jonas, Christer Friberg. On-line Arabic handwriting recognition with templates. Pattern Recognition,2009, pp:3278-3286.
    [46]哈力旦,伊力哈木·亚尔买买提,库尔班·买提木沙.复杂背景下维吾尔文字母的分割算法[J].计算机工程与应用.2007年,第43卷,第20期:163-166
    [47]玉素甫等.笔式维吾尔文识别中的文字切分研究[C].全国第11届民族语言文字信息专业委员会学术会议.2007.1:86-89.
    [48]袁保社,吾守尔·斯拉木.一种手写维吾尔文字母识别算法[J].计算机工程.2010,36(2):186-188.
    [49]Kurban Ubul etc.A Feature Selection and Extraction Method for Uyghur handwriting Based Writer Identification.International Conference on Computational Intelligence and Natural Computing.June 2009,2:345-348.
    [50]阿力木江·亚森,哈力木拉提·买买提.维吾尔文联机手写识别的预处理和特征提取.新疆大学学报(自然科学版).2010,27(2):232-241.
    [51]A.Ymin,Y.Aoki.On the Segmentation ofMulti-Front Printed Uyghur Scripts.13th International Conference on Pattern Recognition(ICPR'96),Aug 1996,Vol.3,No.5:215-219.
    [52]Hong, C., Gareth, L., Wu, Y.M.:Segmentation and Recognition of Continuous Handwriting Chinese Text, Int. J. Pattern Recognition and Artificial Intelligence,12(2):223-232(1998)
    [53]I. Guyon, L. Schomaker, R. Plamondon, M. Liberman, and S. Janet. U-NIPEN project of online data exchange and recognizer benchmarks. InProc.12th ICPR,1994:29-33,.
    [54]K. Matsumoto, T. Fukushima, and M. Nakagawa. Collection and analysis of on-line handwritten Japanese character patterns. In Proc. 6th ICDAR,Seattle, USA,2001:496-500.
    [55]L.Jin, Y.Gao, G.Liu, Y.Li, and K.Ding. SCUT-COUCH2009—a comprehensive online unconstrained Chinese handwriting data base and benchmark evaluation. International Journal on Document Analysis and Recognition (IJDAR),14(1):53-64,2011.
    [56]S.Zhou, Q.Chen, and X.Wang. HIT-OR3C:an opening recognition corpus for Chinese characters. In Proc.9th IAPR Int. Workshop on DAS, Boston, USA,2010:223-230.
    [57]胡家忠.计算机文字识别技术[M].气象出版社,1994
    [58]C.-L. Liu, K. Nakashima, H. Sako, H. Fujisawa. Handwritten digit recognition:investigation of normalization and feature extraction techniques. Pattern Recognition,2004,37(2):265-279.
    [59]M. Hamanaka, K. Yamada, J. Tsukumo, Normalization-cooperated feature extraction method for handprinted Kanji character recognition, Proceedings of the Third International Workshop on Frontiers of Handwriting Recognition, Buffalo, NY,1993:343-348.
    [60]Liu Chenglin, Liu Yingjian, Dai Ruwei. Processing and statistical/structural feature extraction for handwritten numeral recognition, in:Downton A C, Impedovo S. Editors. Progress of Handwriting Recognition, World Scientific, Singapore, Pattern Recognition,1997:161-168
    [61]Josef Kittler,Mohamad Hatef,Robert P.W.Duin and Jiri Matas. On Combining Classifiers[J]. Pattern Analysis and Machine Intelligence,1998, (20):226-239.
    [62]Sharma A.Paliwalk K,Onwubolug C.Class2dependent PCA, MDC and LDA:A combined classifier for pattern classification[J]. Pattern Recognition,2006,39(7):1215-1229.
    [63]肖旭红,戴汝为.一种识别别手写汉字的多分类器集成方法,自动化学报,1997,23(5):10-20
    [64]征荆,丁晓青,吴佑寿.基于最小代价的多分类器动态集成,计算机学报,编辑部邮箱1999年2期
    [65]姚正斌,丁晓青,刘长松;基于笔划合并和动态规划的联机汉字切分算法,清华大学学报(自然科学版),2004,10:76-84
    [66]吴敏清,黄建成.一种用于手写体汉字识别的候选字加权多分类器集成方法[J].计算机工程,2001,27(3):13-15
    [67]燕继坤,郑辉,王艳,曾立君.基于可信度的投票法[J]。计算机学报,2005,28(8):1308-1313
    [68]CaseyR, LeeolinetE.A survey of methods and strategies in character segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence,1996,18(7):690-706P
    [69]Casey, R.G., Lecolinet, E.:A survey of methods and strategies in character segmentation, IEEE Trans. Pattern Anal. Mach. Intell.,18(7):690-706 (1996).
    [70]Benouareth, A., Ennaji, A., Sellami, M.:Arabic handwritten word recognition using HMMs with explicit state duration, Journal on Advances in Signal Processing,1-13 (2008)
    [71]Cheung, A., Bennamoun, M., Bergmann, N.W.:A new word segmentation algorithm for Arabic script, DICTA:Digital Imaging Comput. Tech. Appl,431-435 (1997).
    N. Arica and F. T.Yarman-Vural. An overview of character recognition focused on on-line handwriting. IEEE Trans. System Man Cybernet. Part C,2001,31 (2):216-233.
    [72]Lu, Y., Shridhar, M.:Character segmentation in handwritten words-an overview, Pattern Recognition,29(1):77-96 (1996).
    [73]Q. Fu, X. Ding, C. Liu, Y. Jiang, and Z. Ren. A Hiddern Markov Model based segmentation and recognition algorithm for Chinese handwritten address character strings. In Proc.8th ICDAR,2005: 590-594.
    [74]T.-H. Su, T.-W. Zhang, D.-J. Guan, and H.J. Huang. On-line recognition of realistic Chinese handwriting using segmentation-free strategy. Pattern Recognition,2009,42(1):167-182.
    [75]T.H.Su, T.W.Zhang, andD.J.Guan. Corpus-based HIT-MW databasefor o?ine recognition of general-purpose Chinese handwritten text. International Journal on Document Analysis and Recognition (IJDAR),2007,10(1):27-38,2007.
    [76]苏统华,张田文,黄虎杰.基于HMM识别器的连续手写汉字切分方法.In 2007年全国模式识别学术会议论文集,北京,2007:306-313.
    [77]H. Murase. Online recognition of free-format Japanese handwritings. In Proc.9th ICPR,1988: 1143-1147.
    [78]Q. Fu, X. Ding, T. Liu, Y. Jiang, and Z. Ren. A novel segmentation and recognition algorithm for Chinese handwritten address character strings. In Proc.18th ICPR,,2006:974-977.
    [79]X.-D. Zhou, C.-L. Liu, and M. Nakagawa. Online handwritten Japanese character string recognition using conditional random fields. In Proc.9th ICDAR, Barcelona, Spain,2009:521-525.
    [80]Q.-F. Wang, F. Yin, and C.-L. Liu. Integrating language model in handwritten Chinese text recognition. In Proc.10th ICDAR, Barcelona, Spain,2009:1036-1040.
    [81]B. Zhu, X.-D. Zhou, C.-L. Liu, and M. Nakagawa. A robust model for on-line handwritten Japanese text recognition. International Journal on Document Analysis and Recognition (IJDAR),2010, 13(2):121-131,.
    [82]Q.F. Wang, F. Yin, C.L. Liu, "Handwritten Chinese text recognition by integrating multiple contexts", IEEE Trans. Pattern Anal. Mach. Intell.,34(8),2012:1469-181.
    [83]N.-X. Li and L.-W. Jin. A Bayesian-based method of unconstrained handwritten o?ine Chinese text line recognition. International Journal on Document Analysis and Recognition (IJDAR),,2012:1-15.
    [84]Cheng-Lin Liu, Hiroshi Sako, Hiromichi Fujisawa. Effects of classifier structures andtraining regimes on integrated segmentation and recognition of handwritten numeral strings.IEEE Trans. Pattern Analysis and Machine Intelligence,2004,26(11):1395-1407
    [85]Xiang-Dong Zhou, Jin-Lun Yu, Cheng-Lin Liu, et al. Online handwritten Japanese characterstring recognition incorporating geometric context. Proc. Ninth Int'l Conf. DocumentAnalysis and Recognition, Curitiba, Brazil,2007.
    [86]R.G. Casey and E.Lecolinet. A survey of methods and strategies in character segmentation. IEEE Trans. Pattern Anal. Mach. Intell.,1996,18(7):690-706.
    [87]H. Fujisawa and Y. Nakano amd K. Kurino. Segmentation methods for character recognition:from segmentation to document structure analysis. Proc. IEEE,1992,80(7):1079-1092.
    [88]C.J.C.Burges,O.Matan,Y.LeCun,J.S.Denker,L.D.Jackel,C.E.Stenard C.R.Nohl, andJ.I.Ben. Shortestpathsegmentation:a method for train in a neural network to recognize character strings. InProc. of the International Joint Conference on Neural Networks,1992:165-172.
    [89]Y. Saifullah and M.T. Manry. Classification-based segmentation of ZIP codes. IEEE Trans. System, Man, and Cybernetics,1993,23(5):1437-1443.
    [90]姚正斌,丁晓青,刘长公.基于笔划合并和动态规划的联机汉字切分算法.清华大学学报(自然 科学版),2004,44(10):31-36.
    [91]T.H. Su, T.W. Zhang, H.J. Huang, Y. Zhou, HMM-based recognizer with segmentation-free strategy for unconstrained Chinese handwritten text, Proceedings of the Ninth International Conference on Document Analysis and Recognition, Curitiba, Brazil,2007:133-137.
    [92]T.H. Su, T.W. Zhang, D.J. Guan, H.J. Huang, Off-line recognition of realistic Chinese handwriting using segmentation-free strategy, Pattern Recognition.2009:167-182.
    [93]L.R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE,1989,77(2):257-285.
    [94]刘刚,丁晓青,彭良瑞等.多知识综合判决的字符切分算法.计算机工程与应用.2002,(17):59-62
    [95]韩智,刘昌平,殷绪成.手写中文信封的地址行字符切分算法.中文信息学报.2006,20(1):85-90
    [96].Masaki Nakagawa, Bilan Zhu, Motoki Onuma. A Model of On-line Handwritten JapaneseText Recognition Free from Line Direction and Writing Format Constraints. IEICE Trans. Inf.& Syst. vol. E88- D, no.8, August 2005,1815-1822
    [97]Javad Sadria, ChingY. Suen, TienD. Bui. A genetic framework using contextual knowledge for segmentation and recognition of handwritten numeral strings. Pattern Recognition,2006,40(4): 898-919
    [98]Cheng-Lin Liu, Masashi Koga, Hiromichi Fujisawa. Lexicon-Driven Segmentation and Recognition of Handwritten Character Strings for Japanese Address Reading. IEEE Trans.Pattern Analysis and Machine Intelligence,2002,24(11):1425-1437.
    [99]M. Koga, R. Mine, H. Sako, et al. Lexical Search Approach for Character-String Recognition. Document Analysis Systems:Theory and Practice, S.-W. Lee and Y. Nakano eds., Springer,1999, 115-129
    [100]N. Arica and F. T.Yarman-Vural. An overview of character recognition focused on on-line handwriting. IEEE Trans. System Man Cybernet. Part C,31 (2):216-233,2001.
    [101]T. Steinherz, E. Rivlin, and N. Intrator. Online cursive script word recognition-a survey. International Journal on Document Analysis and Recognition (IJDAR),2(2):90-110,1999.
    [102]A.Vinciarelli. A survey on on-line cursive word recognition. Pattern Recognition,2002,35(7): 1433-1446,
    [103]C.-L. Liu, S.Jaeger, and M.Nakagawa. Online handwritten Chinese character recognition:The state of the art. IEEE Trans. Pattern Anal. Mach.Intell.,2004,26(2):198-213.
    [104]MoriS, SuenCY, Yamamoto K. HIStorical Review of OCR Research and Development. Proceedings of the IEEE.1992,80(7):1029-1058
    [105]Somaya Al-Maadeed,Colin Higgens,and Dave Elliman. Recognition of off-line handwritten Arabic words using hidden Markov model approach.Proceedings of the 16th International Conference on Pattern Recognition,2002:481-484
    [106]Yousef Al-Ohali,Mohamed Cheriet,and Ching Y.Suen.Introducing termination probabilities to HMM.Proceedings of the 16th International Conference on Pattern Recognition,2002:319-322.
    [107]J.JIN.Printed arabic document recognition system.Vision GeometryⅩⅢ,Proceedings of the SPIE,2004:48-55
    [108]Dehghan,M.and Faez,K.and Ahmadi,M.and Shridhar,M..Holistic handwritten word recognition using discrete HMM and self-organizing feature map,Systems,Man,and Cybernetics,2000 IEEE International Conference on,2000:2735-2739
    [109]Chunheng Wang, Y. Hotta, M. Suwa, et al. Handwritten Chinese address recognition. InFrontiers in Handwriting Recognition,2004. IWFHR-9 2004. Ninth International Workshopon, volume 1, Tokyo,Japan,2004,539-544
    [110]张重阳,徐勇,娄震,等.中文日期词的分割与识别.计算机研究与发展,2007,44(12): 2085-2091
    [111]Jianying Hu, M.K. Brown, and W.Turin. HMM based online handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell.,1996,18(10):1039-1045.
    [112]V. Margner and H. El Abed. Databases and competitions:strategies to improve Arabic recognition. In:Arabic and Chinese Handwriting Recognition, S. Jaeger and D. Doermann(Eds.), LNCS, Springer,2008,47:82-103.
    [113]Volker M"argner and Haikal El Abed. ICDAR 2011 - Arabic handwriting recognition competition. In Proc.11th ICDAR, pages 1444-1448, Beijing, China,2011.
    [114]Khaled El Gowely,Ossama El Dessouki,and Ahmed Nazif.Multi-phase recognition of multi-font photoscript Arabic text.Proceedings of the 10th International Conference on Pattern Recognition, 1990:700-702
    [115]Mohamed Fakir,and Chuichi Sodeyama.Machine recognition of Arabic printed scripts by dynamic programming matching method.IEICE Transactions on Information and Systems,1993, E76-D(2): 235-242
    [116]H.Y.Abdelazim,A.M.Mousa,Y.R.Salih,and M.A.Hashish.Arabic text recognition using a partial observation approach. Proceedings of the 12th National Computer Conference,1990:427-437
    [117]M.C.Fehri,and M.Ben Ahmed.A new approach to Arabic character recognition in multi-font documents. Proceedings of the 4th International Conference and Exhibition on Multi-lingual Computing(Arabic and RomanScript),1994:251-257
    [118]Volker M?rgner.SARAT-A system for the recognition of Arabic printed text. Proceedings of the 1 lth International Conference on Pattern Recognition,1992,11:561-564.
    [119]Gasser A.Auda,and Hazem Raafat.An automatic text reader using neural networks. Proceedings of the Canadian Conference on Electrical and Computer Engineering,1993,1:92-95.
    [120]Mohammad B.Menhaj,and Mahdi Adab.Simultaneous segmentation and recognition of Farsi Latin printed texts with MLP.Proceedings of the International Joint Conference on Neural Networks,2002,11:1534-1539.
    [121]Cheng-Lin Liu, Hiroshi Sako, Hiromichi Fujisawa. Effects of classifier structures andtraining regimes on integrated segmentation and recognition of handwritten numeral strings.IEEE Trans. Pattern Analysis and Machine Intelligence,2004,26(11):1395-1407
    [122]M. Cheriet, N. Kharma, C.L. Liu, C.Y. Suen, Character Recognition Systems:A Guide for Students and Practitioners, JOHN WILEY & SONS, INC.,2007.
    [123]C.L. Liu, M. Koga, H. Fujisawa, Lexicon-driven segmentation and recognition of handwritten character strings for Japanese address reading. IEEE Trans. Pattern Anal. Mach. Intell.24(11) (2002) 1425-1437.
    [124]L.-L. Ma, C.-L. Liu, Special radical detection by statistical classification for on-line handwritten Chinese character recognition, accepted by ICFHR2010.
    [125]T. Fukushima, M. Nakagawa, On-line writing-box-free recognition of handwritten Japanese text considering character size variations, Proceedings of the 15th International Conference on Pattern Recognition, Barcelona, Spain,2000, vol.2, pp.359-363.
    [126]X.Q. Ding, H.L. Liu, Segmentation-driven offline handwritten Chinese and Arabic script recognition. Proceedings of the Summit on Arabic and Chinese Handwriting, College Park, USA, 2006, pp.61-73.
    [127]X. Gao, P.M. Lallican, C. Viard-Gaudin, A two-stage online handwritten Chinese character segmentation algorithm based on dynamic programming, roceedings of the Eighth International Conference on Document Analysis and Recognition, Seoul, Korea,2005:735-739.
    [128]S. Senda, K. Yamada, A maximum-likelihood approach to segmentation-based recognition of unconstrained handwriting text, Proceedings of the Sixth International Conference on Document Analysis and Recognition, Seattle, WA,2001, pp.184-188.
    [129]Cheng-Lin Liu, Hongwei Hao, Hiroshi Sako. Confidence transformation for combining classifiers. Pattern Analysis and Applications,2004,7(1):2-17
    [130]Cheng-Lin Liu. Classifier combination based on confidence transformation. Pattern Recognition,vol. 38, no.1, Jan.2005,11-28
    [131]H. Hao, C.-L. Liu, H. Sako. Confidence evaluation for combining diverse classifiers. In Proc.7th ICDAR2003,760-764

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700