A hybrid spam detection method based on unstructured datasets

详细信息查看全文

作者：Yeqin Shao ; Marcello Trovati ; Quan Shi ; Olga Angelopoulou…
关键词：Image spam ; Text spam ; Semantic networks ; Classification ; Subclass discriminant analysis ; Feature selection ; Sparse representation
刊名：Soft Computing
出版年：2017
出版时间：January 2017
年：2017
卷：21
期：1
页码：233-243
全文大小：
刊物类别：Engineering
刊物主题：Computational Intelligence; Artificial Intelligence (incl. Robotics); Mathematical Logic and Foundations; Control, Robotics, Mechatronics;
出版者：Springer Berlin Heidelberg
ISSN：1433-7479
卷排序：21

文摘

The identification of non-genuine or malicious messages poses a variety of challenges due to the continuous changes in the techniques utilised by cyber-criminals. In this article, we propose a hybrid detection method based on a combination of image and text spam recognition techniques. In particular, the former is based on sparse representation-based classification, which focuses on the global and local image features, and a dictionary learning technique to achieve a spam and a ham sub-dictionary. On the other hand, the textual analysis is based on semantic properties of documents to assess the level of maliciousness. More specifically, we are able to distinguish between meta-spam and real spam. Experimental results show the accuracy and potential of our approach.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700