基于最小包含球的大数据集域自适应快速算法
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:A Fast Learning Algorithm Based on Minimum Enclosing Ball for Large Domain Adaptation
  • 作者:许敏 ; 王士同 ; 顾鑫 ; 俞林
  • 英文作者:XU Min1,2, WANG Shi-Tong1, GU Xin1,3, YU Lin2 1(School of Digital Media, Jiangnan University, Wuxi 214122) 2(School of Internet of Things Engineering, Wuxi Institute of Technology, Wuxi 214121) 3(Wuxi Northern Lake Optical Co., Ltd., Wuxi 214035)
  • 关键词:领域自适应 ; 支持向量域描述(SVDD) ; 最小包含球(MEB) ; 核心集 ; 大数据集
  • 英文关键词:Domain Adaptation, Support Vector Domain Description(SVDD), Minimum Enclosing Ball(MEB) , Core Set, Large Data Set
  • 中文刊名:MSSB
  • 英文刊名:Pattern Recognition and Artificial Intelligence
  • 机构:江南大学数字媒体学院;无锡职业技术学院物联网技术学院;无锡北方湖光光电有限公司;
  • 出版日期:2013-02-15
  • 出版单位:模式识别与人工智能
  • 年:2013
  • 期:v.26;No.116
  • 基金:国家自然科学基金项目(No.60903100,60975027,61170122);; 江苏省研究生创新工程项目(No.CXZZ12-0759)资助
  • 语种:中文;
  • 页:MSSB201302006
  • 页数:10
  • CN:02
  • ISSN:34-1089/TP
  • 分类号:33-42
摘要
相同应用领域,不同时间、地点或设备检测到的数据域不一定完整.文中针对如何进行数据域间知识传递问题,提出相同领域的概率分布差异可用两域最小包含球中心点表示且其上限与半径无关的定理.基于上述定理,在原有支持向量域描述算法基础上,提出一种数据域中心校正的领域自适应算法,并利用人造数据集和KDD CUP 99入侵检测数据集验证该算法.实验表明,这种领域自适应算法具有较好的性能.
        The data fields detected from different times, places or devices are not always complete even if they come from the same data resource. To solve the problem of effectively transferring the knowledge between the two fields, the theorem is proposed that the difference between two probability distributions from two domains can be expressed by the center of each domain′s minimum enclosing ball and its up limit has nothing to do with the radius. Based on the theorem, a fast center calibration domain adaptive algorithm, center calibration-core sets support vector data description (CC-CSVDD), is proposed for large domain adaptation by modifying the original support vector domain description (SVDD) algorithm. The validity of the proposed algorithm is experimentally verified on the artificial datasets and the real KDD CUP-99 datasets. Experimental results show that the proposed algorithm has good performance.
引文
[1]Yang J,Yan R,Hauptmann A G.Cross-Domain Video Concept De-tection Using Adaptive SVMs//Proc of the15th International Con-ference on Multimedia.Augsburg,Germany,2007:188-197
    [2]Blitzer J,McDonald R,Pereira F.Domain Adaptation with Struc-tural Correspondence Learning//Proc of the Conference on Empiri-cal Methods in Natural Language Processing.Philadelphia,USA,2006:120-128
    [3]Pan S J,Tsang I W,Kwok J T,et al.Domain Adaptation via Transfer Component Analysis.IEEE Trans on Neural Networks,2010,22(2):199-210
    [4]Tax D M J,Duin R P W.Support Vector Domain Description.Pat-tern Recognition Letters,1999,20(11/12/13):1191-1199
    [5]Liu Y H,Liu Yanchen,Chen Y J.Fast Support Vector Data De-scriptions for Novelty Detection.IEEE Trans on Neural Networks,2010,21(8):1296-1313
    [6]GhasemiGol M,Monsefi R,Yazdi H S.Intrusion Detection by New Data Description Method//Proc of the International Conference on Intelligent Systems,Modelling and Simulation.Liverpool,UK,2010:1-5
    [7]Tsang I W,Kwok J T,Cheung P.Core Vector Machines:Fast SVM Training on Very Large Data Sets.Journal of Machine Learning Re-search,2005,6(4):363-392
    [8]Badoiu M,Clarkson K L.Optimal Core Sets for Balls.Computation-al Geometry:Theory and Applications,2008,40(1):14-22
    [9]Tsang I W,Kwok J T,Zurada J M.Generalized Core Vector Ma-chines.IEEE Trans on Neural Networks,2006,17(5):1126-1140
    [10]Chu C S,Tsang I W,Kwok J K.Scaling up Support Vector Data Description by Using Core-Sets//Proc of the IEEE International Joint Conference on Neural Networks.Budapest,Hungary,2004,I:425-430
    [11]Deng Zhaohong,Chung F L,Wang Shitong.FRSDE:Fast Re-duced Set Density Estimator Using Minimal Enclosing Ball Approx-imation.Pattern Recognition,2008,41(4):1363-1372
    [12]Mark G,He C.Probability Density Estimation from Optimally Con-densed Data Samples.IEEE Trans on Pattern Analysis and Ma-chine Intelligence,2003,25(10):1253-1264
    [13]Marzio M Z,Taylor C C.Kernel Density Classification and Boos-ting:An L2Analysis.Statistics and Computing,2005,15(2):113-123
    [14]Hall P,Wand M P.On Nonparametric Discrimination Using Den-sity Differences.Biometrika,1988,75(3):541-547
    [15]Smola A,Schlkopf B.Sparse Greedy Matrix Approximation for Machine Learning//Proc of the17th International Conference on Machine Learning.San Francisco,USA,2000:911-918

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700