Loading…

Mitigating Adversarial Gray-Box Attacks Against Phishing Detectors

Although machine learning based algorithms have been extensively used for detecting phishing websites, there has been relatively little work on how adversaries may attack such "phishing detectors" (PDs for short). In this paper, we propose a set of Gray-Box attacks on PDs that an adversary...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on dependable and secure computing 2023-09, Vol.20 (5), p.1-19
Main Authors:	Apruzzese, Giovanni, Subrahmanian, V. S.
Format:	Article
Language:	English
Subjects:	Adversarial attacks Algorithms cybersecurity Detectors Feature extraction Machine learning Machine learning algorithms Performance prediction Phishing phishing detection Robustness
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Although machine learning based algorithms have been extensively used for detecting phishing websites, there has been relatively little work on how adversaries may attack such "phishing detectors" (PDs for short). In this paper, we propose a set of Gray-Box attacks on PDs that an adversary may use which vary depending on the knowledge that he has about the PD. We show that these attacks severely degrade the effectiveness of several existing PDs. We then propose the concept of operation chains that iteratively map an original set of features to a new set of features and develop the "Protective Operation Chain" ( {\sf POC} for short) algorithm. {\sf POC} leverages the combination of random feature selection and feature mappings in order to increase the attacker's uncertainty about the target PD. Using 3 existing publicly available datasets plus a fourth that we have created and will release upon the publication of this paper, 1 1. After consultation with the editor in chief, we provide a sample of our dataset for the referees. we show that {\sf POC} is more robust to these attacks than past competing work, while preserving predictive performance when no adversarial attacks are present. Moreover, {\sf POC} is robust to attacks on 13 different classifiers, not just one. These results are shown to be statistically significant at the p < 0.001 level.
ISSN:	1545-5971 1941-0018
DOI:	10.1109/TDSC.2022.3210029