A novel feature extraction approach in SMS spam filtering for mobile communication: one-dimensional ternary patterns
详细信息    查看全文
  • 作者:Yilmaz Kaya and Ö ; mer Faruk Ertuğrul
  • 刊名:Security and Communication Networks
  • 出版年:2016
  • 出版时间:25 November 2016
  • 年:2016
  • 卷:9
  • 期:17
  • 页码:4680-4690
  • 全文大小:599K
  • ISSN:1939-0122
文摘
The importance and utilization of mobile communication are increasing day by day, and the short message service (SMS) is one of them. Although SMS is a widely used communication way, it brings together a major problem, which is SMS spam messages. SMS spams do not only use vain in the mobile communication traffic but also disturb users. Based on this fact, blacklisting methods, statistical methods which are built on the frequency of occurrence of words or characters, and machine learning methods have been employed. Because punishments and legal laws are not enough to solve this problem and the Group Special Mobile number of SMS spam can easily be changed, a content-based approach must be proposed. Content-based methods showed high success in spam e-mail filtering, but it is hard in the SMS spam filtering because SMS messages are extremely short and generally contains many abbreviations. In this study, an image processing method, local ternary pattern was improved to extract features from SMS messages in the feature extraction stage. In the proposed one-dimensional ternary patterns, firstly, text message was converted to their UTF-8 values. Later, each character (its UTF-8 value) in the message was compared with its neighbors. Two different feature sets were extracted from the results of these comparisons. Finally, some machine learning methods were employed to classify these features. In order to validate the proposed approach, three different SMS corpora were used. The achieved accuracies and other employee performance measures showed that the proposed approach, one-dimensional ternary patterns, can be effectively employed in SMS spam filtering. Copyright

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700