论文标题

推文中的线索:Twitter引入的发现和SMS垃圾邮件的分析

Clues in Tweets: Twitter-Guided Discovery and Analysis of SMS Spam

论文作者

Tang, Siyuan, Mi, Xianghang, Li, Ying, Wang, XiaoFeng, Chen, Kai

论文摘要

SMS(简短消息服务)长期以来一直滥用垃圾邮件中的SMS(简短消息服务)在业务和服务交付中的关键作用,如今仍在上升,这可能是由于A2P散装消息的出现。缺乏有关非法活动的最新信息,阻碍了控制SMS垃圾邮件的努力。在我们的研究中,我们提出了一种新颖的解决方案,以从Twitter大规模收集最新的SMS垃圾邮件数据,用户自愿报告他们收到的垃圾邮件消息。为此,我们设计并实施了Spamhunter,这是一种自动化管道,以发现SMS垃圾邮件报告推文并从附件屏幕截图中提取消息内容。利用Spamhunter,我们从Twitter收集了21,918个SMS垃圾邮件消息的数据集,其中75种语言跨越了四年。据我们所知,这是有史以来最大的SMS垃圾邮件数据集公开。更重要的是,Spamhunter使我们能够不断监视新兴的SMS垃圾邮件消息,这有助于减轻SMS垃圾邮件的持续努力。我们还进行了一项深入的测量研究,阐明了垃圾邮件发送者策略,基础设施和垃圾邮件活动的新趋势。我们还利用了垃圾邮件SMS数据来评估SMS生态系统所实现的垃圾邮件对策的鲁棒性,包括反垃圾邮件服务,批量SMS服务和文本消息传递应用程序。我们的评估表明,这种保护无法有效处理这些垃圾邮件样本:引入重大的假阳性或缺少大量新报告的垃圾邮件消息。

With its critical role in business and service delivery through mobile devices, SMS (Short Message Service) has long been abused for spamming, which is still on the rise today possibly due to the emergence of A2P bulk messaging. The effort to control SMS spam has been hampered by the lack of up-to-date information about illicit activities. In our research, we proposed a novel solution to collect recent SMS spam data, at a large scale, from Twitter, where users voluntarily report the spam messages they receive. For this purpose, we designed and implemented SpamHunter, an automated pipeline to discover SMS spam reporting tweets and extract message content from the attached screenshots. Leveraging SpamHunter, we collected from Twitter a dataset of 21,918 SMS spam messages in 75 languages, spanning over four years. To our best knowledge, this is the largest SMS spam dataset ever made public. More importantly, SpamHunter enables us to continuously monitor emerging SMS spam messages, which facilitates the ongoing effort to mitigate SMS spamming. We also performed an in-depth measurement study that sheds light on the new trends in the spammer's strategies, infrastructure and spam campaigns. We also utilized our spam SMS data to evaluate the robustness of the spam countermeasures put in place by the SMS ecosystem, including anti-spam services, bulk SMS services, and text messaging apps. Our evaluation shows that such protection cannot effectively handle those spam samples: either introducing significant false positives or missing a large number of newly reported spam messages.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源