Application of a Parallel FSM Parsing Algorithm for Web Engines
详细信息    查看全文
  • 关键词:Parallel computing ; FSM ; Web page parsing ; HTML5
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2017
  • 出版时间:2017
  • 年:2017
  • 卷:10135
  • 期:1
  • 页码:133-143
  • 丛书名:Smart Computing and Communication
  • ISBN:978-3-319-52015-5
  • 卷排序:10135
文摘
Finite State Machine (FSM) is widely applied to parsing the html pages in WebKit browser. In the traditional WebKit kernel, single thread is used to parse web pages, therefore cannot make full use of multi-core processors and page analysis in kernel is not granular down to html paragraphs, resulting in the browser working serially in each page. Our research is about realizing the page parallel parsing in WebKit, aiming at improving the browser’s parsing speed. Made some optimizations on the method proposed in [13], we put forward a novel parsing model named PFBE (Parallel FSM Browser Engine) from the aspect of data parallelism, realizing the preliminary parser to load network resources and parsing string in parallel, thus using multi-core processors to improve browser’s performance. PFBE carved up the input data into multiple segmentation which contained public characters for processing, through comparing the public characters to determine whether to merge the segmentation. PFBE utilized original serial FSM which has highly optimized, compatible with HTML5 standards and technology. We used Chromium web engine to present PFBE in detail and the result proved the improvement of PFBE comparing with serial FSM, and the page loading time has been reduced by 12.36%.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700