详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
Text is part of the important information in videos. Text detection and recognition in videos can help a lot in video content analysis and understanding, since text can provide concise and direct description of the stories presented in the videos. In digital news videos, the superimposed captions usually present the involved person's name and the summary of the news event. Hence, the recognized text can become a part of index in a video retrieval system. The importance of video can be estimated by the recognized text. So text detection and analysis is important in video analysis. Detecting the accurate position of text in the video and segmenting text from the complex background are the foundation of video text analysis.
     The text information extraction system can be divided into the following six parts: text detection, text localization, text tracking, text extraction, text enhancement and text recognition. This thesis focuses on the research in text localization. The projection analysis of edge based method and the learning of support vector machine based method are combined to localize text on videos. It has shown good results in the experiments compared to the simple edge based method and the learning based method. The text localization can be divided into two steps. In the first step, the potentially text area are extracted by the edge method. In the second step, support vector machine is used to classify the actual text areas and the false text areas. The false text areas are removed in this step. This method improves the precision rate of text areas. Compared to the learning based method, this method doesn't need to compute the texture of the whole image. Instead, it only computes the texture of the text areas. This algorithm can reduce the time complexity. The textures used in the support vector machine are wavelet, corner, line and the center of gravity of the text areas.
     This method is applied in localizing text in advertisements. A multi-resolution based method is used to localize text in advertisements. It is a part of the advertisements detection system. It is obvious that texts in the news are more formal and its positions of texts are in a certain areas. But texts in the advertisements are different from each other in size and style. The method can give out a more accurate position of advertisements. And it has shown good results in the experiments.
    6. Keechul Jung, Kwang In Kim, Anil K. Jain, Text information extraction in images and video: a survey, Pattern Recognition, 2004 (37): 977-997
    7. U. Gargi,, D. Crandall, S. Antani, T. Gandhi, R. Keener, R. Kasturi, A system for automatic text detection in video, Proceedings of International Conference on Document Analysis and Recognition, 1999: 29-32
    8. Y. K. Lim, S. H. Choi, S.W. Lee, Text extraction in MPEG compressed video for content-based indexing, Proceedings of International Conference on Pattern Recognition, 2000: 409-412
    9. Giulia Piccioli, Enrico De Micheli,, Marco Campani, A robust method for road sign detection and recognition, Image and Vision Computing, 1996 (14) : 209-233
    10. C. M. Lee, A. Kankanhalli, Automatic extraction of characters in complex images, Pattern Recognition, 1995 (1) : 67-82
    11. Edward K. Wong, Minya Chen, A new robust algorithm for video text extraction, pattern recognition, 2003: 1397-1406
    12. Chert Datong, Odobez Jean-Marc, Bourlard Hervé, Text detection and recognition in images and video frames, Pattern Recognition, 2004 37(3): 595-608
    13. H. Hase, T. Shinokawa, M. Yoneda, C. Y. Suen, Character string extraction from color documents, Pattern Recognition, 2001 (7) : 1349-1365
    14. A. K. Jain, B. Yu, Automatic text location in images and video frames, Pattern Recognition 1998 (12) : 2055-2076
    15. Cai M, Song J Q, Lyu M R, A new approach for video text detection, Proceedings of 2002 International Conference on Image Processing, 2002 (1) : 117-120
    16. Lyu M R, Song J Q, Cai M, A Comprehensive Method for Multilingual VideoText Detection, Localization, and Extraction, IEEE transactions on circuits and system for video technology, 2005 (15): 243-255
    17. Y. M. Y. Hasan, L. J. Karam, Morphological text extraction from images, IEEE Trans. Image Process., 2000 (11) : 1978-1983
    23. Y. Zhong, K. Karu, A. K. Jain, Locating text in complex color images, Pattern Recognition 1995 (10) : 1523-1535
    24. S. H. Park, K. I. Kim, K. Jung, H. J. Kim, Locating car license plates using neural networks, Electronics Letters , 1999 (35) : 1475-1477
    25. V. Wu, R. Manmatha, E. M. Riseman, Text Finder: an automatic system to detect and recognize text in images, IEEE Transactions on Pattern Analysis and Machine Intelligence, 1999 (11) : 1224-1229
    26. B. Sin, S. Kim, B. Cho, Locating characters in scene images using frequency features, Proceedings of International Conference on Pattern Recognition, 2002 (3) : 489-492
    27. W. Mao, F. Chung, K. Lanm, W. Siu, Hybrid Chinese English text detection in images and video frames, Proceedings of International Conference on Pattern Recognition, 2002 (3) : 1015-1018
    30. H. Li, D. Doermann, A video text detection system based on automated training, Proceedings of IEEE International Conference on Pattern Recognition, 2000: 223-226
    31. R. Lienhart, A. Wernicke, Localizing and segmenting text in images and videos, IEEE Transactions on Circuits and Systems for Video Technology, 2002 12(4): 256-268
    32. Kwang In Kim, Keechul Jung, Se Hyun Park, Hang Joon Kim, Support vector machine-based text detection in digital video,Pattern Recognition, 2001(2): 527-529
    35. B. L. Yeo, B. Liu, Visual content highlighting via automatic extraction of embedded captions on MPEG compressed video, IS&T/SPIE Symposium on Electronic Imaging: Digital Video Compression, 1996: 142-149.
    36. R. Lienhart, W. E2elsberg, Automatic text segmentation and text recognition for video indexing, Technical Report TR-98-009, Praktische Informatik Ⅳ, University of Mannheim, 1998
    37. S. Antani, U. Gargi, D. Crandall, T. Gandhi, R. Kasturi, Extraction of text in video, Technical Report, Department of Computer Science and Engineering, Pennsylvania State University, CSE-99-016, 1999
    38. T. Sato, T. Kanade, E. K. Hughes, M. A. Smith, Video OCR for digital news archive, Proceedings of IEEE Workshop on Content based Access of Image and Video Databases, 1998: 52-60
    39. H. Li, O. Kia, D. Doermann, Text enhancement in digital video, Proceedings of SPIE, Document Recognition Ⅳ, 1999: 1-8
    42.Rafael C.Gonzalez,Richard E.Woods著,阮秋琦,阮宇智译,数字图像处理,北京:电子工业出版社,2005
    43. Otsu N, A threshold selection method from gray-level histograms, IEEE Transactions on Systems, Man, and Cybernetics, 1979, 9(1): 62-66
    45. M. Bertini, C. Colombo, A. Del Bimbo, automatic caption localization in videos using salient points, IEEE international conference on multimedia and Expo., 2001
    47. David A. Sadlier, Dr. Scan Marlow, Dr Noel O. Connor, Dr Noel Murphy, Automatic TV Advertisement Detection from MPEG Bitstream, Pattern Recognition, 2002 (35): 2719-2726
    48. R. Lienhart, C. Kuhmunch, W. Effeisberg, On the detection and Recognition of Television Commercials, In proceedings of IEEE international Conference on Multimedia Computing and Systems, 1997
    49. Satterwhite, B. Marques, Automatic detection of TV commercials, IEEE Potentials Magazine, 2004 (23): 9-12
    50. A. Albiol, M. J. Ch. Fulla, A. Albiol, L. Torres, Detection of TV commercials, in: Proc. of the Int. Conf. on Acoustics, Speech and Signal Processing, Montreal, 2004: 541-544
    51. Sato T., Kanade T., Hughes E., Simth M., Satoh S., Video OCR: Indexing digital news libraries by recognition of superimposed caption, Multimedia System, 1999, 7(5): 385-395
    52. Zhang H J, Wu Jianhua, et al. An Integrated System for Content-Based Video Retrieval and Browsing, Pattern Recognition, 1997 (4): 643-657

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700