Preparing for OCR of Books Handled by Visually Impaired
详细信息    查看全文
  • 关键词:OCR ; Visually impaired ; Voice scanner for blind people ; Book flipping ; Angle correction ; Image correction ; Image distortion ; Real ; time processing ; Book video stream ; Image streaming
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2016
  • 出版时间:2016
  • 年:2016
  • 卷:10070
  • 期:1
  • 页码:419-430
  • 全文大小:1,720 KB
  • 参考文献:1.Singh, S.: Optical character recognition techniques: a survey. J. Emerg. Trends Comput. Inf. Sci. 4(6), 545–550 (2013)
    2.Xiu, P., Baird, H.S.: Whole-Book recognition. IEEE Trans. Pattern Anal. Mach. Intell. 34(12), 2467–2480 (2012)CrossRef
    3.Chakraborty, D., Roy, P.P., Alvarez, J.M., Pal, U.: OCR from video stream of book flip-ping. In: 2013 2nd IAPR Asian Conference on Pattern Recognition, pp. 130–134. IEEE, Okinawa (2013)
    4.Sarkar, P., Baird, H.S., Zhang, X.: Training on severely degraded text-line images. In: Proceedings of the Seventh International Conference on Document Analysis and Recognition, pp. 38–43. IEEE (2003)
    5.Crovato, C.D.P., et al.: A preprocessing algorithm to increase OCR performance on application processor-centric FPGA architectures. In: 14th International Conference on Smart Homes and Health Telematics. ICOST 2016 - Inclusive Smart Cities and Digital Health, Wuhan, China (2016)
    6.Hersh, M., Johnson, M.A. (eds.): Assistive technology for visually impaired and blind people. Springer Science & Business Media, London (2010)
    7.Liang, J., DeMenthon, D., Doermann, D.: flattening curved documents in images. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 338–345. IEEE, San Diego (2005)
    8.Shamqoli, M., Khosravi, H.: Warped document restoration by recovering shape of the surface. In: 8th Iranian Conference on Machine Vision and Image Processing, pp. 262–265. IEEE Press, Zanjan (2013)
    9.Song, L., Wu, Y., Sun, B.: A robust and fast dewarping method of document images. In: International Conference on E-Product E-Service and E-Entertainment, pp. 1–4. IEEE Press, Henan (2010)
    10.Stamatopoulos, N. Gatos, B., Pratikakis, I., Perantonis, S.: A two-step dewarping of camera document images. In: The 8th IAPR International Workshop on Document Analysis Systems, pp. 209–216. IEEE Press, Nara (2008)
    11.Stamatopoulos, N., Gatos, B., Pratikakis, I., Perantonis, S.J.: Goal-oriented rectification of camera-based document images. IEEE Trans. Image Process. 20(4), 910–920 (2011). IEEEMathSciNet CrossRef
    12.Mitchell, P.E., Yan, H.: Document page segmentation and layout analysis using soft ordering. In: 15th International Conference on Proceedings Pattern Recognition, vol. 1, pp. 458–461. IEEE (2000)
    13.Sadri, J., Cheriet, M.: A new approach for skew correction of documents based on particle swarm optimization. In: 10th International Conference on Document Analysis and Recognition, pp. 1066–1070. IEEE (2009)
    14.Shivakumar, P., Kumar, G.H., Guru, D.S., Nagabhushan, P.: A new boundary growing and hough transform based approach for accurate skew detection in binary document images. In: International Conference on Proceedings of Intelligent Sensing and Information Processing, pp. 140–146. IEEE (2005)
    15.Ulges, A., Lampert, C.H., Breuel, T.M.: Document image dewarping using robust estimation of curled text lines. In: Eighth International Conference on Document Analysis and Recognition, pp. 1001–1005. IEEE (2005)
    16.Gatos, B., Pratikakis, I., Ntirogiannis, K.: Segmentation based recovery of arbitrarily warped document images. In: Ninth International Conference on In Document Analysis and Recognition, vol. 2, pp. 989–993. IEEE (2007)
    17.Peng, X., Cao, H., Subramanian, K., Prasad, R., Natarajan, P.: Automated image quality assessment for camera-captured OCR. In: 18th IEEE International Con-ference on Image Processing, pp. 2621–2624. IEEE (2011)
    18.Video Stabilization Using Point Feature Matching. http://​www.​mathworks.​com/​help/​vision/​examples/​video-stabilization-using-point-feature-matching.​html . Last Access: 1 June 2016
    19.Lee, K.Y., Chuang, Y.Y., Chen, B.Y., Ouhyoung, M.: Video stabilization using robust feature trajectories. In: 12th International Conference on Computer Vision, pp. 1397–1404. IEEE (2009)
    20.Plustek BookReader V200. Plustek Inc., Taipei, Taiwan
    21.Image Magick, version 7.0.2, Computer Software, ImageMagick Studio LLC (2016)
    22.Tesseract, version 3.03 (rc1), Computer Software, Google Inc., Mountain View, California (2014)
    23.DiffMatch, version 20121119, Computer Software, Google Inc., Mountain View, California (2012)
  • 作者单位:César Crovato (17)
    Delfim Torok (17)
    Regina Heidrich (18)
    Bernardo Cerqueira (19)
    Eduardo Velho (19)

    17. Institute of Technology and Exact Sciences at Feevale University, Novo Hamburgo, Brazil
    18. Feevale University, Novo Hamburgo, Brazil
    19. Scientific Improvement Researcher, Feevale University, Novo Hamburgo, Brazil
  • 丛书名:Ubiquitous Computing and Ambient Intelligence
  • ISBN:978-3-319-48799-1
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
  • 卷排序:10070
文摘
The objective of this work is to synthesize the difficulties an algorithm must handle in book digitization for subsequent OCR application, such as angle correction, image distortion and words segmentation in addition to being operated by blind or visually impaired people real-time by video stream without further assistance. The developed method seems reliable, and provides good OCR results on a page by page basis. The results show improvements above 99,3 % in OCR performance in some cases, although execution time has increased. “The Vocalizer Project” emerged from a demand from the Brazilian Ministry of Culture and Education for application in schools and public libraries. It aims to create more inclusive smart cities. Furthermore, it is destined for the inclusion of visually impaired and blind people to the vast bibliographic material existent.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700