基于基因视角的恶意代码同源性判定(英文)
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Malware homology identification based on a gene perspective
  • 作者:Bing-lin ; ZHAO ; Zheng ; SHAN ; Fu-dong ; LIU ; Bo ; ZHAO ; Yi-hang ; CHEN ; Wen-jie ; SUN
  • 英文作者:Bing-lin ZHAO;Zheng SHAN;Fu-dong LIU;Bo ZHAO;Yi-hang CHEN;Wen-jie SUN;State Key Laboratory of Mathematical Engineering and Advanced Computing;Georg-August-University of Goettingen;
  • 关键词:恶意代码分类 ; 基因视角 ; 函数依赖图 ; 同源性分析
  • 英文关键词:Malware classification;;Gene perspective;;Dependency graph;;Homology analysis
  • 中文刊名:JZUS
  • 英文刊名:信息与电子工程前沿(英文)
  • 机构:State Key Laboratory of Mathematical Engineering and Advanced Computing;Georg-August-University of Goettingen;
  • 出版日期:2019-06-03
  • 出版单位:Frontiers of Information Technology & Electronic Engineering
  • 年:2019
  • 期:v.20
  • 基金:Project supported by the National Natural Science Foundation of China(Nos.61472447 and 61802435)
  • 语种:英文;
  • 页:JZUS201906005
  • 页数:16
  • CN:06
  • ISSN:33-1389/TP
  • 分类号:59-74
摘要
恶意代码同源性判定对攻击事件溯源、应急响应方案处置以及事件发展趋势预测有重要作用。目前,恶意代码同源性判定以人工分析为主,效率较低,对安全事件的爆发无法快速响应。因此,提出一种新的从基因视角分析的恶意代码同源性判定方法。恶意代码基因由表示家族同源性的子图组成。通过筛选关键应用程序接口和利用社团划分算法,从函数依赖图中提取关键子图作为恶意代码基因。然后,设计一种频繁子图挖掘算法发现恶意代码家族的共有基因,并对基因编码。最后,利用家族共有基因指导恶意代码同源性判定。对公开数据集的分类和实验结果表明,分类准确率达97%,且效率较高。
        Malware homology identification is important in attacking event tracing, emergency response scheme generation, and event trend prediction. Current malware homology identification methods still rely on manual analysis, which is inefficient and cannot respond quickly to the outbreak of attack events. In response to these problems, we propose a new malware homology identification method from a gene perspective. A malware gene is represented by the subgraph, which can describe the homology of malware families. We extract the key subgraph from the function dependency graph as the malware gene by selecting the key application programming interface(API) and using the community partition algorithm. Then, we encode the gene and design a frequent subgraph mining algorithm to find the common genes between malware families. Finally, we use the family genes to guide the identification of malware based on homology. We evaluate our method with a public dataset, and the experiment results show that the accuracy of malware classification reaches 97% with high efficiency.
引文
Alam S,Horspool RN,Traore I,2013.MAIL:Malware Analysis Intermediate Language:a step towards automating and optimizing malware detection.Proc 6th Int Conf on Security of Information and Networks,p.233-240.https://doi.org/10.1145/2523514.2527006
    Alam S,Horspool RN,Traore I,2014.MARD:a framework for metamorphic malware analysis and real-time detection.28th Int Conf on Advanced Information Networking and Applications,p.212-233.https://doi.org/10.1109/AINA.2014.59
    Cesare S,Xiang Y,Zhou WL,2013.Malwise-an effective and efficient classification system for packed and polymorphic malware.IEEE Trans Comput,62(6):1193-1206.https://doi.org/10.1109/TC.2012.65
    Defferrard M,Bresson X,Vandergheynst P,2016.Convolutional neural networks on graphs with fast localized spectral filtering.Conf and Workshop on Neural Information Processing Systems,p.3837-3845.
    Drew J,Moore T,Hahsler M,2016.Polymorphic malware detection using sequence classification methods.Security and Privacy Workshops,p.81-87.https://doi.org/10.1109/SPW.2016.30
    Han J,Zhao RC,Shan Z,et al.,2018.Analyzing and recognizing Android malware via semantic-based malware gene.Int Conf on Cyber-Enabled Distributed Computing and Knowledge Discovery,p.17-20.https://doi.org/10.1109/CyberC.2017.36
    Jang JW,Woo J,Yun J,et al.,2014.Mal-netminer:malware classification based on social network analysis of call graph.Proc 23rd Int Conf on World Wide Web,p.731-734.https://doi.org/10.1145/2567948.2579364
    Kaggle,2015.Microsoft Malware Classification Challenge(Big 2015).https://www.kaggle.com/c/malware-classification[Ac cessed on Nov.4,2015].
    Kinable J,Kostakis O,2011.Malware classification based on call graph clustering.J Comput Virol,7(4):233-245.https://doi.org/10.1007/s11416-011-0151-y
    Kipf TN,Welling M,2016.Semi-supervised classification with graph convolutional networks.https://arxiv.org/abs/1609.02907?context=cs
    Kirat D,Vigna G,2015.MalGene:automatic extraction of malware analysis evasion signature.Proc 22nd ACMSIGSAC Conf on Computer and Communications Security,p.769-780.https://doi.org/10.1145/2810103.2813642
    Liu L,Wang BS,Yu B,et al.,2017.Automatic malware classification and new malware detection using machine learning.Front Inform Technol Electron Eng,18(9):1336-1347.https://doi.org/10.1631/FITEE.1601325
    Naval S,Laxmi V,Rajarajan M,et al.,2017.Employing program semantics for malware detection.IEEE Trans Inform Forens Secur,10(12):2591-2604.https://doi.org/10.1109/TIFS.2015.2469253
    Qiao YC,Yun XC,Zhang YZ,et al.,2016.An automatic malware homology identification method based on calling habits.Acta Electron Sin,44(10):2410-2414.https://doi.org/10.3969/j.issn.0372-2112.2016.10.019
    Qihoo 360,2017.Ransomware Threat Situation Analysis Report.http://zt.360.cn/1101061855.php?dtid=1101062360&did=490927082
    Wang XZ,Liu JW,Chen XE,2015.Microsoft Malware Classification Challenge(Big 2015)first place team:say no to overfitting.https://github.com/xiaozhouwang/kaggle_Microsoft_Mal ware/blob/master/Saynotooverfitting.pdf[Accessed on Nov.2,2015].
    Wu J,Dong MX,Ota K,et al.,2018a.Big data analysis-based secure cluster management for optimized control plane in software-defined networks.IEEE Trans Network Ser Manag,15(1):27-38.https://doi.org/10.1109/TNSM.2018.2799000
    Wu J,Luo SB,Wang S,et al.,2018b.NLES:a novel lifetime extension scheme for safety-critical cyber-physical systems using SDN and NFV.IEEE Int Things J,6(2):2463-2475.https://doi.org/10.1109/JIOT.2018.2870294
    Yu B,Fang Y,Yang Q,et al.,2018.A survey of malware behavior description and analysis.Front Inform Technol Electron Eng,19(5):583-603.https://doi.org/10.1631/FITEE.1601745

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700