A new algorithm for skew correction and baseline detection based on the randomized Hough Transform
详细信息    查看全文
文摘
The proposed technique is based on the detection of the lower baselines of the text lines of Arabic documents. As the lower baseline pixels belong to the lower edge of the word images, we first locate vertically the black–white transitions at the black pixels where the resulting image would emphasize the baselines of the text. Once the skew angle is determined using a randomized Hough transform, the baselines are extracted using y-intercept histogram. This algorithm can also contribute significantly for text line extraction from skewed document images for many languages.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700