贝页斯垃圾邮件分类系统成本参数调整对系统精度的影响
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Effect of Cost Parameters Adjustment on the Accuracy of Bayesian Anti-Spam Filtering System
  • 作者:崔超 ; 吕丹 ; 姜淑凤
  • 英文作者:CUI Chao;Lü Dan;JIANG Shu-feng;College of Computer and Control Engineering,Qiqihar University;School of Mechatronics Engineering,Qiqihar University;
  • 关键词:系统建模 ; 贝叶斯系统 ; 过滤器 ; 成本分析
  • 英文关键词:system modeling;;Bayesian system;;filter;;cost analysis
  • 中文刊名:BJLG
  • 英文刊名:Transactions of Beijing Institute of Technology
  • 机构:齐齐哈尔大学计算机与控制工程学院;齐齐哈尔大学机电工程学院;
  • 出版日期:2019-02-15
  • 出版单位:北京理工大学学报
  • 年:2019
  • 期:v.39;No.288
  • 基金:国家教育部产学合作协同育人项目(201702104029)
  • 语种:中文;
  • 页:BJLG201902006
  • 页数:5
  • CN:02
  • ISSN:11-2596/T
  • 分类号:36-40
摘要
为降低垃圾邮件系统分类计算的误码率,分析了贝页斯垃圾邮件过滤系统对目标邮件的自动检测过程,从系统过滤质量和用户容错两个方面研究系统成本定义.在不同样本集合及其属性空间内,对于词语还原和间断表的开启与关闭,重点分析成本参数λ,通过调整成本参数分析贝页斯过滤系统在多种假定下邮件处理结果,完善系统建模定义标准,优化应用系统建模,提高系统过滤质量.实验结果证明该解决方案是可行的.
        In order to reduce the error rate in classification calculation of anti-spam filtering system, the automatic testing process of target email in the Bayesian anti-spam filtering system was analyzed, the definition of system cost was researched from two aspects of system filtering quality and user fault tolerance. Cost parameters were analyzed in the collection of different sample sets and attribute spaces, with disabling and enabling lemmatizer and stop-list. By adjusting the cost parameters, the results of the Bayesian filtering system in various assumptions were analyzed, the standard of system modeling was optimized, and system filtering quality was upgraded. The results prove that the scheme is feasible.
引文
[1]Kush E N.Learning to remove internet advertisements[C]//Proceedings of the 3rd International Conference on Autonomous Agents.Seattle,Washington:[s.n.],1999:175-181.
    [2]Hall R J.How to avoid unwanted E-mail[J].Communication of ACM,1998,41(3):88-95.
    [3]Cohen W W.Learning rule that classify E-Mail[C]//Proceedings of the AAAI Spring Symposium on Machine learning in information Access.Stanford,California:[s.n.],1996:85-90.
    [4]崔超,吴双,张宪忠,等.基于贝叶斯概率理论的防火墙技术研究[J].北京理工大学学报,2012,32(8):801-804.Cui Chao,Wu Shuang,Zhang Xianzhong,et al.Firewall technology based on Bayesian probability theory[J].Transactions of Beijing Institute of Technology,2012,32(8):801-804.(in Chinese)
    [5]崔超.贝叶斯网络在垃圾邮件算法中的应用研究[J].哈尔滨工业大学学报,2011,43(11):145-148.Cui Chao.Bayesian application study on arithmetic for filtering junk e-mail[J].Journal of Harbin Institute of Technology,2011,43(11):145-148.(in China)
    [6]Paul G.Better Bayesian filtering[OL].[2003-01-20].http://paulgraham.com/better.html.
    [7]Dan L V,SUN Jianfeng,Li Qi,et al.Model-based recognition of 3D articulated target using ladar range data[J].Applied Optics,2015,54(17): 5382-5391.
    [8]Jiang Shufeng.Study on multi-strategy analysis and application of A-r algorithm[J].Information Technology Journal,2015,12(21): 6096-6097.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700