基于Android的个性化RSS订阅系统的设计与实现
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
移动互联网发展到今天并伴随着安致等智能手机操作系统出现,我们已经进入了一个移动网络高速化、信息海量化的时代。使用丰富站点摘要(RSS,Rich Site Summary)订阅系统不需要直接去各个门户网站进行信息的搜索,因此解决了用户打开众多网页的麻烦。而且,RSS订阅系统会定时更新,使得信息不能及时更新的问题得到了很好的解决。但是,由于各个网站报道的新闻内容很多是相似的,用户因此订阅了很多重复的信息。
     针对现有的RSS阅读器当用户同时订阅很多网站信息时会存在大量重复信息的缺点,本系统将改进现有RSS阅读器功能,实现个性化订阅。这些个性化的功能分别是相同或相似内容新闻过滤功能,即当订阅的网站出现相同或相似的新闻时仅显示用户最想看到的新闻;选择订阅功能,即用户设置的订阅关键词或禁止关键词后,优先显示用户订阅的新闻不显示禁止的新闻以及收藏夹功能,即可收藏自己感兴趣的新闻。
     本系统开始先从RSS网站取得RSS文件并通过解析模块进行解析,然后通过中文分词算法对解析出的新闻标题进行中文分词,并将分词结果作为该新闻的关键词。最后按照用户对RSS源优先级的设置过滤重复新闻,显示优先级最高的新闻即用户最想看的新闻;如果用户设置的订阅或禁止显示的关键词,则要优先显示用户订阅的新闻,不显示禁止的新闻。
     通过引入准确率和召回率以及F值的评测指标,将系统运行结果加以详细的统计和对比分析,系统满足当初的设计要求,能够为用户提供个性化订阅服务,因此认为得到了较为理想的结果。
As for mobile internet development and the smart mobile phones operating system about the Android appeared, we have entered a high-speed mobile network, the quantification of the information era. Using RSS(Rich Site Summary) reader does not need to go straight to the website for information searching and thus solve a user’s trouble that user needs to open a lot of webs. RSS subscription system will regularly update, which provides a solution to the update problem. However, a lot of sites contain similar news, so a lot of information is subscribed by the user.
     For an existing RSS reader, there is much duplication of information shortcomings, the system will improve the existing functions, implement personalized subscribe. These customized features are deleting news that have the same or similar content (when the subscription web site that is the same or similar news, it shows only the news which user wants to see), selectively subscription (it only shows subscription news that are set by the user) and favorite (user favorites news which is interest to by user).
     Firstly, this system obtains RSS documents from RSS web site and parses module. Then, tag keyword from the title of news analyzed by using the Chinese word segmentation. Finally, in accordance with priority which is set by user, implement for filtering news and show the highest priority news. If user set keyword, firstly show new which is subscribed by user, and don’t show new which is forbidden by user.
     Import the value of P and R and the evaluation parameter and run the system to be detailed statistics or comparative analysis. System achieves the original design. It can be able to offer individualized subscription service. So a good result is achieved.
引文
[1]胡晶晶,郑志蕴.基于RSS的个性化信息服务研究[J].计算机应用与软件,2009,26(5):3
    [2]翟晓玲.面向学科的基础教育资源垂直搜索引擎的研究与实现[D].东北师范大学硕士学位论文,2009:1~2
    [3]胡潜,汪会玲.基于RSS的个性化推送服务[J].情报杂志,2008,27(10):3
    [4]陈旻,江兰帆.基于Google Android平台的移动开发研究[J].福建电脑, 2008(11):156~157
    [5] Symbian[OL]. http://baike.baidu.com/view/10708.htm
    [6] Windows Mobile[OL]. http://baike.baidu.com/view/40733.htm
    [7] iOS[OL]. http://baike.baidu.com/view/158983.htm
    [8]李惠,丁革建.智能手机操作系统概述[J].电脑与电信,2009(3):67~68
    [9] Frank Ableson, Robi Sen. Unlocking Android[M]. Manning Publications, 2010:24~38
    [10] Reto Meier. Professional Android 2 Application Development[M]. Wrox, 2010:2~4
    [11]韩青.以商业级Linux抓住Android带来的商机[J].现代电信科技,2008(l):24~26
    [12] OMS[OL]. http://www.shouji114.com/doc-view-167.html
    [13] Holzner, Steven. Secrets of RSS[M]. Addison-Wesley,2005:3~16
    [14]詹青龙. RSS技术及其在远程教育中的应用[J].教育信息化, 2005(14):62~63
    [15]郭军城,于金海. RSS的版本演变[J].科技情报开发与经济, 2007(33):191~192
    [16]史旻昱.基于RSS的个性化网络广告推荐系统研究[D].华中科技大学博士论文,2008:2~7
    [17] Eric Miller. An Introduction to the Resource Description Framework[J]. D-Lib Magazine,1998:32~34
    [18]葛嘉佳.网络个性化信息服务综述[J].计算机时代,2004(2):11~12
    [19]卢小华.网络环境下高校图书馆个性化信息服务研究[J].长沙理工大学图书馆,2004(12):187~189
    [20]赵庆峰.个性化信息服务系统的评估[J].情报探索,2006():3~5
    [21]庞海涛. RSS技术及其在新闻发布网站的应用[J].交通与计算机, 2006(2):118~119
    [22]姚星吴,刘卫国. Android的架构与应用开发研究[J].计算机系统应用, 2008(11):110~112
    [23] Rick Rogers ,John Lombardo , Zigurd Mednieks ,G. Blake Meike. Android Application Development: Programming with the Google SDK[M]. O'Reilly Media,2009:1~13
    [24] Shane Conder,Lauren Darcey. Android Wireless Application Development[M]. Addison-Wesley Professional,2009:6~17
    [25]姚昱旻.基于Android的移动学习终端平台的开发与研究[D].中南大学硕士学位论文,2008:14~15
    [26] Fekade Getahun Taddesse, Joe Tekli, Richard Chbeir, Marco Viviani, Kokou Yetongnon. Semantic-based Merging of RSS Items[J]. World Wide Web, 2009(12): 169~207
    [27] Paoshan Yue,Araby Greene,Lisa S. Blackwell. Do You See RSS in Your Future[J]. The Serials Librarian,2002(3):56~58
    [28] John Garofalakis, Vassilios Stefanis. Using RSS feeds for effective mobile web browsing[J]. Universal Access in the Information Society,2007(3): 249~257
    [29]李朝虹,陆建湖.现代汉语自动分析研究现状综述[J].广西教育学院学报,2003(l):114
    [30] Zhang Huipeng, Liu Ting, Ma Jinshan. Chinese Word Segmentation with Multiple Postproeessors in HIT-IRLab[J]. Sighan,2005:172~175
    [31] M.Scott. Thede, Mary P.Harper. A second-order Hidden Markov Model for part-of-speech tagging[R]. The 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics. Maryland,1999:175~182
    [32] S.Richard, Thomas E. The first international Chinese word segmentation bakeoff[R]. The Second SIGHAN Workshop on Chinese Language Processing. Japan.2003
    [33] S. M. Thede, M. P. Harper. A Second-order Hidden Markov Model for Part-of-speech Tagging[R]. The 37th annual meeting of the association forcomputation linguistics. 1999:175~182
    [34]刘东旭.在自然汉语中进行分词和词性标注[D].电子科技大学硕士学位论文,2003:12~13
    [35]王茜. Android嵌入式系统架构及内核浅析[J].电脑开发与应用,2011(4):59~61
    [36]单李旺. Android操作平台的研究与应用[D].南开大学硕士学位论文, 2009:9~11
    [37] Simson Garfinkel. Android[J]. Technology Review, 2008,20(3):2
    [38] Hammersely Ben. Developing Feeds with RSS add Atom[M]. O’Reilly, 2005:10~12
    [39] Owens M. Query Anything with SQLite[J]. The World of Software Development,2007(12):24~28
    [40] Owens M. The SQLite Database Engine[J]. C/C++UsersJoumal, 2004(3):6~15
    [41] A Selkirk. XML and Security. BT Technology Journal[J],2001(3):23~34
    [42]刘雨潇.基于SAX的XML数据解析技术分析研究[J].现代电子技术, 2010(12):55~56
    [43] D. A. Lizorkin,K. Yu. Lisovsky. Implementation of the XML linking language XLink by functional methods. Programming and Computer Software[J]. 2005(1): 34~46
    [44] ICTCLAS[OL]. http://ictclas.org/ictclas_introduction.html
    [45]俞鸿魁,张华平,刘群等.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006(2):87~93
    [46]张华平,刘群.基于N-最短路径方法的中文词语粗分模型[J].中文信息学报,2002(5):1-7
    [47]王睿,张洁,张由仪等.基于混合模型的中文命名实体抽取系统[J].清华大学学报,2005

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700