A comprehensive survey of AI-enabled phishing attacks detection techniques

Telecommunication Systems (2021) 76:139–154
https://doi.org/10.1007/s11235-020-00733-2
A comprehensive survey of AI-enabled phishing attacks detection
techniques
Abdul Basit1 · Maham Zafar1 · Xuan Liu2 · Abdul Rehman Javed3 · Zunera Jalil3 · Kashif Kifayat3
Accepted: 9 October 2020 / Published online: 23 October 2020
© Springer Science+Business Media, LLC, part of Springer Nature 2020
Abstract
In recent times, a phishing attack has become one of the most prominent attacks faced by internet users, governments, and
service-providing organizations. In a phishing attack, the attacker(s) collects the client’s sensitive data (i.e., user account
login details, credit/debit card numbers, etc.) by using spoofed emails or fake websites. Phishing websites are common
entry points of online social engineering attacks, including numerous frauds on the websites. In such types of attacks, the
attacker(s) create website pages by copying the behavior of legitimate websites and sends URL(s) to the targeted victims
through spam messages, texts, or social networking. To provide a thorough understanding of phishing attack(s), this paper
provides a literature review of Artificial Intelligence (AI) techniques: Machine Learning, Deep Learning, Hybrid Learning,
and Scenario-based techniques for phishing attack detection. This paper also presents the comparison of different studies
detecting the phishing attack for each AI technique and examines the qualities and shortcomings of these methodologies.
Furthermore, this paper provides a comprehensive set of current challenges of phishing attacks and future research direction
in this domain.
Keywords Phishing attack · Security threats · Advanced phishing techniques · Cyberattack · Internet security · Machine
learning · Deep learning · Hybrid learning
Abbreviations
SVM Support vector machine
RF Random forest
B Xuan Liu
[email protected]
Abdul Basit
[email protected]
Maham Zafar
[email protected]
Abdul Rehman Javed
[email protected]
Zunera Jalil
[email protected]
Kashif Kifayat
[email protected]
1 Department of Computer Science, Air University, E-9,
Islamabad, Pakistan
2 School of Information Engineering, Yangzhou University,
Yangzhou, China
3 Department of Cyber Security, Air University, E-9,
Islamabad, Pakistan
IBK Instant base learner
ANN Artificial neural network
RF Rotation forest
DT Decision forest
eDRI Enhanced dynamic rule induction
LR Linear regression
CART Classification and regression tree
XGB Extreme gradient boost
GBDT Gradient boosting decision tree
AB AdaBoost
NN Neural-networks
GBM Gradient boosting machine
GLM Generalized linear model
NB Navies Bayes
KNN K-nearest neighbor
KS K-star
LC-ELM Combination extreme learning machine
ELM Extreme learning machine
RC Random committee
PCA Principle component analysis
123
140 A. Basit et al.
1 Introduction
The process of protecting cyberspace from attacks has come
to be known as Cyber Security [16,32,37]. Cyber Security
is all about protecting, preventing, and recovering all the
resources that use the internet from cyber-attacks [20,38,47].
The complexity in the cybersecurity domain increases daily,
which makes identifying, analyzing, and controlling the relevant risk events significant challenges. Cyberattacks are
digital malicious attempts to steal, damage, or intrude into
the personal or organizational confidential data [2]. Phishing attack uses fake websites to take sensitive client data,
for example, account login credentials, credit card numbers,
etc. In the year of 2018, the Anti-Phishing Working Group
(APWG) detailed above 51,401 special phishing websites.
Another report by RSA assessed that worldwide associations
endured losses adding up to $9 billion just due to phishing
attack happenings in the year 2016 [26]. These stats have
demonstrated that the current anti-phishing techniques and
endeavors are not effective. Figure 1 shows how a typical
phishing attack activity happens.
Personal computer clients are victims of phishing attack
because of the five primary reasons [60]: (1) Users do not
have brief information about Uniform Resource Locator
(URLs), (2) the exact idea about which pages can be trusted,
(3) entire location of the page because of the redirection or
hidden URLs, (4) the URL possess many possible options,
or some pages accidentally entered, (5) Users cannot differentiate a phishing website page from the legitimate ones.
Phishing websites are common entry points of online
social engineering attacks, including numerous ongoing web
scams [30]. In such type of attacks, the attackers create website pages by copying genuine websites and send suspicious
URLs to the targeted victims through spam messages, texts,
or online social networking. An attacker scatters a fake variant of an original website, through email, phone, or content
messages [5], with the expectation that the targeted victims
would accept the cases in the email made. They will likely
target the victim to include their personal or highly sensitive data (e.g., bank details, government savings number,
etc.). A phishing attack brings about an attacker acquiring
bank card information and login data. In any case, there
are a few methods to battle phishing [27]. The expanded
utilization of Artificial Intelligence (AI) has affected essentially every industry, including cyber-security. On account
of email security, AI has brought speed, accuracy, and the
capacity to do a detailed investigation. AI can detect spam,
phishing, skewers phishing, and different sorts of attacks utilizing previous knowledge in the form of datasets. These
type of attacks likely creates a negative impact on clients’
trust toward social services such as web services. According
to the APWG report, 1,220,523 phishing attacks have been
reported in 2016, which is 65% more expansion than 2015
[1]. Figure 2 shows the Phishing Report for the third quarter
of 2019.
As per Parekh et al. [51], a generic phishing attack has
four stages. First, the phisher makes and sets up a fake
website that looks like an authentic website. Secondly, the
person sends a URL connection of the website to a targeted
victim pretending like a genuine organization, user, or association. Thirdly, the person in question will be tempted to visit
the injected fake website. Fourth, the unfortunate targeted
victim will click on the fake source link and give his/her
valuable data as input. By utilizing the individual data of
the person in question, impersonation activities will be performed by the phisher. APWG contributes individual reports
on phishing URLs and analyzes the regularly evolving nature
and procedures of cybercrimes. The Anti-Phishing Working
Group (APWG) tracks the number of interesting phishing
websites, an essential proportion of phishing over the globe.
Phishing locales dictate the interesting base URLs. The absolute number of phishing websites recognized by APWG in
the 3rd quarter-2019 was 266,387 [3]. This was 46% from
the 182,465 seen in Q2 and in Q4-2018 practically twofold
138,328 was seen.
Figure 3 shows the most targeted industries in 2019.
Attacks on distributed storage and record facilitating websites, financial institutions stayed more frequent, and attacks
on the gaming, protection, vitality, government, and human
services areas were less prominent during the 3rd quarter [3].
MarkMonitor is an online brand insurance association,
verifying licensed innovation. In the 3rd quarter of 2019, the
greatest focus of phishing remained Software as a service
(SaaS) and webmail websites. Phishers keep on collecting
credentials to these sorts of websites, using them to execute
business email compromises (BEC) and to enter corporate
SaaS accounts.
This survey covers the four aspects of a phishing attack:
communication media, target devices, attack technique, and
counter-measures as shown in Fig. 4. Human collaboration is
a communication media with an application targeted by the
attack. Seven types of communication media which include
Email, Messenger, Blog & Forum, Voice over internet protocol, Website, Online Social Network (OSN), and Mobile
platform are identified from the literature. For the selection
of attack strategies, our devices play a significant role as
victims interact online through physical devices. Phishing
attack may target personal computers, smart devices, voices
devices, and/or WiFi-smart devices which includes VOIP
devices as well as mobile phone device.
Attack techniques are grouped into two categories: attack
launching and data collection. For attack launching, several
techniques are identified such as email spoofing, attachments,
abusing social settings, URLs spoofing, website spoofing,
intelligent voice reaction, collaboration in a social network,
reserve social engineering, man in the middle attack, spear
123
A comprehensive survey of AI-enabled phishing attacks detection techniques 141
Fig. 1 Phishing attack diagram
[26]
Fig. 2 Phishing report for third
quarter of the year 2019 [1]
phishing, spoofed mobile internet browser and installed web
content. Meanwhile, for data collection during and after
the victim’s interaction with attacks, various data collection
techniques are used [49]. There are two types of data collection techniques, one is automated data collection techniques
(such as fake websites forms, key loggers, and recorded messages) and the other is manual data collection techniques
(such as human misdirection and social networking). Then,
there are counter-measures for victim’s data collected or
used before and after the attack. These counter-measures
are used to detect and prevent attacks. We categorized
counter-measurement into four groups (1) Deep learningbased Techniques, (2) Machine learning Techniques, (3)
Scenario-based Techniques, and (4) Hybrid Techniques.
To the best of our knowledge, existing literature [11,18,
28,40,62] include a limited number of surveys focusing more
on providing an overview of attack detection techniques.
These surveys do not include details about all deep learning, machine learning, hybrid, and scenario based techniques.
Besides, these surveys lack in providing an extensive discussion about current and future challenges for phishing attack
detection.
Keeping in sight the above limitations, this article makes
the following contributions:
– Provide a comprehensive and easy-to-follow survey
focusing on deep learning, machine learning, hybrid
learning, and scenario-based techniques for phishing
attack detection.
– Provide an extensive discussion on various phishing
attack techniques and comparison of results reported by
various studies.
123
142 A. Basit et al.
Fig. 3 Most targeted industry sectors—3rd quarter 2019 [3]
– Provide an overview of current practices, challenges, and
future research directions for phishing attack detection.
The study is divided into the following sections: Sect. 1
present the introduction of phishing attacks. Section 2
presents the literature survey focusing on deep learning,
machine learning, hybrid learning, and scenario-based phishing attack detection techniques and presents the comparison
of these techniques. Section 3 presents a discussion on various approaches used in literature. Section 4 present the
current and future challenges. Section 5 concludes the paper
with recommendations for future research.
2 Literature survey
This paper explores detailed literature available in prominent
journals, conferences, and chapters. This paper explores relevant articles from Springer, IEEE, Elsevier, Wiley, Taylor
& Francis, and other well-known publishers. This literature
Fig. 4 Taxonomy of this survey focusing on phishing attack detection studies
123
A comprehensive survey of AI-enabled phishing attacks detection techniques 143
review is formulated after an exhaustive search on the existing literature published in the last 10 years.
A phishing attack is one of the most serious threats for any
organization and in this section, we present the work done on
phishing attacks in more depth along with its different types.
Initially, the phishing attacks were performed on telephone
networks also known as Phone Phreaking which is the reason
the term “fishing” was replaced with the term “Phishing”, ph
replaced f in fishing. From the reports of the anti-phishing
working group (APWG) [1], it can be confirmed that phishing was discovered in 1996 when America-on-line (AOL)
accounts were attacked by social engineering. Phishing turns
into a danger to numerous people, especially individuals who
are unaware of the dangers while being in the internet world.
In light of a report created by the Federal Bureau of Investigation (FBI) [4], from October-2013 to February-2016, a
phishing attack caused severe damage of 2.3 billion dollars.
In general, users tend to overlook the URL of a website. At
times, phishing tricks connected through phishing websites
can be effectively prevented by seeing whether a URL is of
phishing or an authentic website. For the situation where a
website is suspected as a targeted phish, a client can escape
from the criminal’s trap.
The conventional approaches for phishing attack detection give low accuracy and can recognize only about 20%
of phishing attacks. Machine learning approaches give good
outcomes for phishing detection but are time-consuming
even on the small-sized datasets and not scale-able. Phishing
recognition by heuristics techniques gives high false-positive
rates. Client mindfulness is a significant issue, for resistance
against phishing attacks. Fake URLs are utilized by phisher,
to catch confidential private data of the targeted victim like
bank account data, personal data, username, secret password,
etc.
Previous work on phishing attack detection has focused
on one or more techniques to improve accuracy however,
accuracy can be further improved by feature reduction and
by using an ensemble model. Existing work done for phishing
attack detection can be placed in four categories:
– Deep learning for phishing attack detection
– Machine learning for phishing attack detection
– Scenario-based phishing attack detection
– Hybrid learning based Phishing attack detection
2.1 Deep learning (DL) for phishing attack detection
This section describes the DL approaches-based intrusion
detection systems. Recent advancements in DL approaches
suggested that the classification of phishing websites using
deep NN should outperform the traditional Machine Learning (ML) algorithms. However, the results of utilizing deep
NN heavily depend on the setting of different learning
parameters [61]. There exist multiple DL approaches used
for cybersecurity intrusion detection [25], namely, (1) deep
neural-network, (2) feed-forward deep neural-network, (3)
recurrent neural-network, (4) convolutional neural-network,
(5) restricted Boltzmann machine, (6) deep belief network,
(7) deep auto-encoder. Figure 5 shows the working of deep
learning models. A batch of input data is fed to the neurons
and assigned some weights to predict the phishing attack or
legitimate traffic.
Authors in Benavides et al. [15] work to incorporate a
combination of each chosen work and the classification. They
characterize the DL calculations chosen in every arrangement, which yielded that the most regularly utilized are the
Deep Neural Network (DNN) and Convolutional Neural Network (CNN) among all. Diverse DL approaches have been
presented and analyzed, but there exists a research gap in the
use of DL calculations in recognition of cyber-attacks.
Authors in Shie [55] worked on the examination of different techniques and talked about different strategies for
precisely recognizing phishing attacks. Of the evaluated
strategies, DL procedures that used feature extraction shows
good performance because of high accuracy, while being
robust. Classifications models also depict good performance.
Authors in Maurya and Jain [46] proposed an anti-phishing
structure that depends on utilizing a phishing identification
model dependent on DL, at the ISP’s level to guarantee security at a vertical scale as opposed to even execution. This
methodology includes a transitional security layer at ISPs
and is set between various workers and end-clients. The proficiency of executing this structure lies in the way that a
solitary purpose of blocking can guarantee a large number
of clients being protected from a specific phishing attack.
The calculation overhead for phishing discovery models is
restricted distinctly to ISPs and end users are granted secure
assistance independent of their framework designs without
highly efficient processing machines.
Authors in Subasi et al. [57] proposed a comparison
of Adaboost and multi boosting for detecting the phishing
website. They used the UCI machine learning repository
dataset having 11,055 instances, and 30 features. AdaBoost
and multi boost are the proposed ensemble learners in this
research to upgrade the presentation of phishing attack calculations. Ensemble models improve the exhibition of the
classifiers in terms of precision, F-measure, and ROC region.
Experimental results reveal that by utilizing ensemble models, it is possible to recognize phishing pages with a precision
of 97.61%. Authors in Abdelhamid et al. [9] proposed a comparison based on model content and features. They used a
dataset from PhishTank, containing around 11,000 examples. They used an approach named enhanced dynamic rule
induction (eDRI) and claimed that dynamic rule induction
(eDRI) is the first algorithm of machine learning and DL
which has been applied to an anti-phishing tool. This algo123
144 A. Basit et al.
Fig. 5 Deep learning for
phishing attack detection
rithm passes datasets with two main threshold frequencies
and rules strength. The training dataset only stores “strong”
features and these features become part of the rule while
others are removed.
Authors in Mao et al. [44] proposed a learning-based system to choose page design comparability used to distinguish
phishing attack pages. for effective page layout features,
they characterized the guidelines and build up a phishing
page classifier with two conventional learning-algorithms,
SVM and DT. They tested the methodology on real website page tests from phishtank.com and alexa.com. Authors
in Jain and Gupta [34] proposed techniques and have performed experiments on more than two datasets. First from
Phishtank containing 1528 phishing websites, second from
Openphish: which contains 613 phishing websites, third from
Alexa: which contains 1600 legitimate websites, fourth from
payment gateway: which contains 66 legitimate websites, and
fifth from top banking website: which contains 252 legitimate websites. By applying machine-learning algorithms,
they improved accuracy for phishing detection. They used
RF, SVM, Neural-Networks (NN), LR, and NB. They used
a feature extraction approach on the client-side.
Authors in Li et al. [42] proposed a novel approach in
which the URL is sent as input and the URL, as well as
HTML related features, are extracted. After feature extraction, a stacking model is used to combine classifiers. They
performed experiments on different datasets: The first one
was obtained from Phishtank, with 2000 web pages (1000
legitimate and 1000 phishing). The second dataset is a larger
one with 49,947 web pages (30,873 legitimate, and 19,074
phishing) and was taken from Alexa. They used a support
vector machine, NN, DT, RF, and combined these through
stacking to achieve better accuracy. This research achieves
good accuracy using different classifiers.
Some studies are limited to few classifiers and some
used many classifiers, but their techniques were not efficient or accurate. Two datasets have been commonly used
by researchers in past and these are publicly accessible from
Phishtank and UCI machine learning repository. ML techniques have been used but without feature reduction, and
some studies used only a few classifiers to compare their
results.
2.2 Machine learning (ML) for phishing attack
detection
ML approaches are popular for phishing websites detection
and it becomes a simple classification problem. To train a
machine learning model for a learning-based detection system, the data at hand must-have features that are related to
phishing and legitimate website classes. Different classifiers
are used to detect a phishing attack. Previous studies show
that detection accuracy is high as robust ML techniques are
used. Several feature selection techniques are used to reduce
features. Figure 6 shows the working of the machine learning
model. A batch of input data is given as input for training to
the machine learning model to predict the phishing attack or
legitimate traffic.
By reducing features, dataset visualization becomes more
efficient and understandable. The most significant classifiers that were used in various studies and are found to give
good phishing attack detection accuracy are C4.5, k-NN, and
SVM. These classifiers are based on DTs such as C4.5, so
it gives the maximum accuracy and efficiency to detect a
phishing attack. To further explore the detection of phishing
attacks, researchers have mentioned the limitations of their
work. Many highlighted a common limitation that ensemble learning techniques are not used, and in some studies,
feature reduction was not done. Authors in James et al. [36]
used different classifiers such as C4.5, IBK, NB, and SVM.
Similarly, authors in Liew et al. [43] used RF to distinguish
phishing attacks from original web pages. Authors in Adebowale et al. [10] used the Adaptive Neuro-Fuzzy Inference
123
A comprehensive survey of AI-enabled phishing attacks detection techniques 145
Fig. 6 Machine learning for phishing attack detection
System based robust scheme using the integrated features for
phishing attack detection and protection.
Authors in Zamir et al. [65] presented an examination
of supervised learning and stacking models to recognize
phishing websites. The rationale behind these experiments
was to improve the classification precision through proposed
features with PCA and the stacking of the most efficient
classifiers. Stacking (RF, NN, stowing) outperformed other
classifiers with proposed features N1 and N2. The experiments were performed on the phishing websites datasets.
The data-set contained 32 pre-processed features with 11,055
websites. Authors in Alsariera et al. [13] used four metastudent models: AdaBoost-Extra Tree (ABET), BaggingExtra tree (BET), Rotation Forest-Extra Tree (RoFBET),
and LogitBoost-Extra Tree (LBET), using the extra-tree
base classifier. The proposed meta-algorithms were fitted for
phishing website datasets, and their performance was tested.
Furthermore, the proposed models beat existing ML-based
models in phishing attack recognition. Thus, they suggest
the appropriation of meta-algorithms when building phishing attack identification models.
Authors in El Aassal et al. [22] proposed a benchmarking structure called PhishBench, which enables us to assess
and analyze the existing features for phishing detection and
completely understand indistinguishable test conditions, i.e.,
unified framework specification, datasets, classifiers, and
performance measurements. The examinations indicated that
the classification execution dropped when the proportion
among phishing and authentic decreases towards 1 to 10.
The decrease in execution extended from 5.9 to 42% in F1-
score. Furthermore, PhishBench was likewise used to test
past techniques on new and diverse datasets.
Authors in Subasi and Kremic [56] proposed an intelligent phishing website identification system. They utilized
unique ML models to classify websites as genuine or phishing. A few classification methods were used to implement
an accurate and smart phishing website detecting structure. ROC area, F-measure, and AUC were used to assess
the performance of ML techniques. Results demonstrated
that Adaboost with SVM performed best among all other
classification techniques achieving the highest accuracy of
97.61%. Authors in Ali and Malebary [12] proposed a phishing website detection technique utilizing Particle Swarm
Optimization (PSO) based component weighting to improve
the detection of phishing websites. Their proposed approach
recommends using PSO to weigh different websites, effectively accomplishing higher accuracy when distinguishing
phishing websites. In particular, the proposed PSO based
website features weighting is utilized to separate different
features in websites, given how significantly these contribute
towards distinguishing the phishing from real websites.
Results showed that the ML models improved with the
proposed PSO-based component weighting to effectively
distinguish, and monitor both phishing and real websites separately.
Authors in James et al. [36] used datasets from Alexa and
Phishtank. Their proposed approach read the URL one by
one and analyze the host-name URL and path to classify into
an attack or legitimate activity using four classifiers: NB, DT,
KNN, and Support Vector Machine (SVM). Authors in Subasi et al. [57] used Artificial Neural Network (ANN), KNN,
SVM, RF, Rotation Forest, and C4.5. They discussed in detail
how these classifiers are very accurate in detecting a phishing
attack. They claim that the accuracy of the RF is not more
than 97.26%. All other classifiers got the same accuracy as
given in the study. Authors in Hutchinson et al. [31] proposed
a study on phishing website detection focusing on features
selection. They used the dataset of the UCI machine learning
repository that contains 11,055 URLs and 30 features and
divided these features into six groups. They selected three
groups and concluded that these groups are suitable options
for accurate phishing attack detection.
123
146 A. Basit et al.
Authors in Abdelhamid et al. [9] creates a method called
Enhanced Dynamic Rule Induction (eDRI) to detect phishing
attacks. They used feature extraction, Remove replace feature
selection technique (RRFST), and ANOVA to reduce features. The results show that they have the highest accuracies
of 93.5% in comparison with other studies. The research [29]
proposed a feature selection technique named as Remove
Replace Feature Selection Technique (RRFST). They claim
that they got the phishing email dataset from the khoonji’s
anti-phishing website containing 47 features. The DT was
used to predict the performance measures.
Authors in Tyagi et al. [58] used a dataset from the
UCI machine learning repository that contains unique 2456
URL instances, and 11,055 total number of URLs that have
6157 phishing websites and 4898 legitimate websites. They
extracted 30 features of URLs and used these features to predict the phishing attack. There were two possible outcomes
whether the user has to be notified that the website is a phishing or aware user that the website is safe. They used ML
algorithms such as DT, RF, Gradient Boosting (GBM), Generalized Linear Model (GLM), and PCA. The authors in Chen
and Chen [17] used the SMOTE method which improves
the detection coverage of the model. They trained machine
learning models including bagging, RF, and XGboost. Their
proposed method achieved the highest accuracy through the
XGboost method. They used the dataset of Phishtank which
has 24,471 phishing websites and 3850 legitimate websites.
Authors in Joshi et al. [39] used a RF algorithm as a
binary classifier and reliefF algorithm for feature selection
algorithm. They used the dataset from the Mendeley website
which is given as input to the feature selection algorithm
to select efficient features. Next, they trained a RF algorithm over the selected features to predict the phishing attack.
Authors in Ubing et al. [59] proposed their work on ensemble
Learning. They used ensemble learning through three techniques that were bagging, boosting, stacking. Their dataset
contains 30 features with a result column of 5126 records.
The dataset is taken from UCI, which is publicly accessible.
They had combined their classifiers to acquire the maximum
accuracy which they got from a DT. Authors in Mao et al. [45]
used different machine learning classifiers that include SVM,
DT, AdaBoost, and RF to predict the phishing attack. Authors
in Sahingoz et al. [54] created their dataset. The dataset contains 73,575 URLs, and out of this 36,400 legitimate URLs
and 37,175 phishing URLs. As they mentioned that Phishtank
doesn’t give a free dataset on the web page therefore they created their dataset. They used seven classification-algorithms
and natural-language-processing (NLP) based features for
phishing attack detection.
Table 1 presents the summary of ML approaches for phishing websites detection. Table shows that some studies provide
highly efficient results for phishing attack detection.
2.3 Scenario-based phishing attack detection
In this section, we provide a comparison of scenario-based
phishing attack detection used by various researchers. The
comparison of scenario-based techniques to detect a phishing attack is shown in Table 2. Studies show that different
scenarios worked with various methods and provides different outcomes.
Authors in Begum and Badugu [14] discussed some
approaches which are useful to detect a phishing attack.
They performed a detailed survey of existing techniques
such as Machine Learning (ML) based approaches, Nonmachine Learning-based approaches, Neural Network-based
approaches, and Behavior-based detection approaches for
phishing attack detection. Authors in Yasin et al. [64] consolidated various studies that researchers have used to clarify
different exercises of social specialists. Moreover, they proposed that a higher comprehension of the social engineering
attack scenarios would be possible utilizing topical and
game-based investigation techniques. The proposed strategy for interpreting social engineering attack scenario is
one such endeavor to empower people to comprehend general attack scenarios. Even though the underlying outcomes
have demonstrated neutral outcomes, the hypothetically predictable system of this strategy despite everything, merits
future augmentation and re-performance.
Authors in Fatima et al. [23] presented PhishI as a precise way to deal with structure genuine games for security
training. They characterize a game structure system that
incorporates the group of information on social networking,
that needs authoritative players. They used stick phishing
as a guide to show how the proposed approach functions,
and afterward assessed the learning impacts of the produced
game dependent on observational information gathered from
the student’s movement. In the PhishI game, members are
needed to trade phishing messages and have the option to
remark on the viability of the attack scenario. Results demonstrated that student’s attention to spear-phishing chances is
improved and that the protection from the first potential attack
is upgraded. Moreover, the game demonstrated a beneficial
outcome on members’ comprehension of extreme online data
and information disclosure.
Authors in Chiew et al. [18] concentrated phishing attacks
in detail through their features of the medium and vector which they live in and their specialized methodologies.
Besides, they accept this information will assist the overall
population by taking preparatory and preventive activities
against these phishing attacks and the policies to execute
approaches to check any further misuse by the phishers. Relying just on client instruction as a preventive measure in a
phishing attack is not sufficient. Their survey shows that the
improvement of clever frameworks to counter these specialized methodologies is required, as such countermeasures will
123
A comprehensive survey of AI-enabled phishing attacks detection techniques 147
Table 1 ML approaches for
phishing websites detection Authors Classification
method
Feature selection
method
Accuracy (%)
James et al. [36] J48, JBK, SVM, NB – 89.75
Abdelhamid et al. [9] eDRI – 93.5
Mao et al. [44] SVM, RF, DT, AB – 97.31
Jain and Gupta [34] – Feature extraction 99.09
Hota et al. [29] CART, C4.5 RRFST 99.11
Ubing et al. [59] EL – 95.4
Chen and Chen [17] ELM, SVM, LR,
C$.5, LC-ELM,
KNN, XGB
ANOVA 99.2
Table 2 Comparison of
scenario based studies Authors Scenarios Method Accuracy
Yao et al. [63] Identity detection
processs
Logo extraction 98.3%
Curtis et al. [21] Dark traid
attacker’s concept
Dark traid –
Williams et al. [62] 62,000 employers
over 6 weeks of
observation
Theoretical
approaches
–
Parsons et al. [52] Worked on 985
participants
ANOVA –
Table 3 Comparison of
scenario based studies Authors Classification
method
Feature selection
method
Accuracy (%)
Subasi et al. [57] ANN, KNN, RF,
SVM, C4.5, RF
– 97.36
Tyagi et al. [58] DT, RF, GBM PCA 98.4
Mao et al. [45] SVM, RF, DT, AB – 97.31
Jagadeesan et al. [33] RF, SVM – 95.11
Joshi et al. [39] RF, RA RA 97.63
Sahingoz et al. [54] SVM, DT, RF,
KNN, KS, NB
NLP 97.98
have the option to recognize and disable both existing attacks
and new phishing dangers.
Authors in Yao et al. [63] used the logo extraction method
by using the identity detection process to detect phishing.
Two non-overlapping datasets were made from a sum of 726
pages. Phishing pages are from the PhishTank website, and
the legitimate website pages are from the Alexa website as
they limited their work by not using the DL technique. The
authors gave the concept of dark triad attackers. Phishing
exertion and execution, and end-users’ arrangement of emails
are the theoretical approach of the dark triad method. They
had limited their work as end-client members may have been
hyper-mindful of potential duplicity and in this way progressively careful in their ratings of each email than they would
be in their normal workplace. Authors in Williams et al. [62]
uses a mixed approach to detect a phishing attack. They
used ensemble learning to investigate 62, 000 instances over
a six-week time frame to detect phishing messages, called
spear phishing. As they had a drawback of just taking information from two organizations, employee observations and
encounters are probably going to be affected by a scope of
components that might be explicit to the association considered.
Authors in Parsons et al. [52] used the method of ANOVA.
In a scenario-based phishing study, they took a total of 985
participants completed to play a role. Two-way repeatedmeasures analysis of variance (ANOVA) was led to survey
the impact of email authenticity and that impact was focused
123
148 A. Basit et al.
Table 4 Comparison of hybrid
methods used in state-of-the-art Authors Classification
method
Accuracy (%)
Patil et al. [53] LR, DT, RF 96.58
Niranjan et al. [48] RC, KNN, IBK, LR, PART 97.3
Chiew et al. [19] RF, C4.5, Part, SVM, NB 96.17
Pandey et al. [50] RF, SVM 94
on the study. This investigation included only one phishing
and one certifiable email with one of the standards and did
not test the impact of numerous standards inside an email.
Following are the comparison of specific classifier known as
RF which is the most used algorithm by the researchers.
Table 3 provides a comparison of RF classifiers with
different datasets and different approaches. Some studies
reduced features without creating a lot of impact on accuracy and the remaining studies focused on accuracy. Authors
in Subasi et al. [57] used different classifiers to detect phishing attacks and they achieved an accuracy of 97.36% by RF
algorithm.
Authors in Tyagi et al. [58] used 30 features to detect
the attack by RF. They used other classifiers as well but
their result on RF was better than other classifiers. Similarly, authors in Mao et al. [45] collected the dataset of 49
phishing websites from PhishinTank.com. They used four
learning classifiers to detect phishing attacks and concluded
that the RF classifiers are much better than others. Authors
in Jagadeesan et al. [33] used two datasets one from UCI
Machine Learning Repository having 30 features and one
target class, containing 2456 instances of phishing and nonphishing URLs. The second dataset comprises of 1353 URLs
with 10 features, grouped into 3 classifications: phishing,
non-phishing and suspicious. They concluded that RF provides better accuracy than that of support vector machine.
Authors in Joshi et al. [39] used the dataset from Mendeley
website which is publicly accessible. The dataset contains
5000 legitimate and 5000 phishing records. Authors in Sahingoz et al. [54] used Ebbu2017 Phishing Dataset containing
73,575 URLs in which 36,400 are legitimate URLs and
37,175 are phishing URLs. They proposed seven different
classification algorithms including Natural Language Processing (NLP) based features. They actually used a dataset
which is not used commonly for detecting phishing attack.
Authors in Williams et al. [62] conducted two studies
considering different aspects of emails. The email that is
received, the person who received that email, and the context
of the email all the theoretical approaches were studied in
this paper. They believe that the current study will provide a
way to theoretical development in this field. They considered
62,000 employers over 6 weeks and observed the individuals and targeted phishing emails known as spear phishing.
Authors in Parsons et al. [52] proposed and worked on 985
participants who completed a role in a scenario-based phishing study. They used two-way repeated-measures analysis of
variance which was named (ANOVA) to assess the effect of
email legitimacy and email influence. The email which was
used in their research indicates that the recipient has previously donated to some charity.
Authors in Yao et al. [63] proposed a methodology which
mainly includes two processes: logon extraction and identity detection. The proposed methodology describes that the
logon extraction extracted the logo from the image from the
two-dimensional code after performing image processing.
Next, the identity detection process assessed the relationship
between the actual identity of the website and it’s described
identity. If the identity is actual then the website is legitimate, if it is not then this is a phishing website. They created
two datasets which are non-overlapping datasets from 726
web pages. The dataset contains phishing web pages and
legitimate web pages. The legitimate pages are taken from
Alexa, whereas the phishing pages are taken from Phishtank. They believe that logo extraction can be improved in
the future. Authors in Curtis et al. [21] introduced the dark
triad attacker’s concepts. They used a dark triad score to complete the 27 items short dark triad with both attackers. The
end-users were asked to participate in the scenario to assign
scores based on psychopathy, narcissism, and Machiavellianism.
2.4 Hybrid learning (HL) based phishing attack
detection
In this section, we present the comparison of HL models
which are used by state-of-the-art studies as shown in Tables
4 and 5 The studies show how the accuracies got improved
by ensemble and HL techniques.
Authors in Kumar et al. [41] separated some irrelevant
features from the content and pictures and applied SVM as
a binary classifier. They group the real and phished messages with strategies like Text parsing, word tokenization,
and stop word evacuation. The authors in Jain et al. [35] utilized TF-IDF to locate the most significant features of the
website to be used in the search question, yet it has been well
adjusted to improve execution. The proposed approach has
been discovered to be more accurate for their methodology
123
A comprehensive survey of AI-enabled phishing attacks detection techniques 149
against existing techniques utilizing the traditional TF-IDF
approach.
Authors in Adebowale et al. [10] proposed a hybrid
approach comprising Search and Heuristic Rule and Logistic Regression (SHLR) for efficient phishing attack detection.
Authors proposed three steps approach: (1) the most of website shown in the result of a search query is legal if the
web page domain matches the domain name of the websites retrieved in results against the query, (2) the heuristic
rules defined by the character features (3) an ML model to
predict the web page to be either a legal web page or a phishing attack. Authors in Patil et al. [53] used LR, DT, and RF
techniques to detect a phishing attack, and they believe the
RF is a much-improved way to detect the attack. The drawback of this system is detecting some minimal false-positive
and false-negative results. Authors in Niranjan et al. [48]
used the UCI dataset on phishing containing 6157 legitimate and 4898 phishing instances out of a total of 11,055
instances. The EKRV model was used that involves a combination of KNN and random committee techniques. Authors
in Chiew et al. [19] used two datasets one from 5000 phishing web-pages based on URLs from PhishTank and second
OpenPhish. Another 5000 legitimate web-pages were based
on URLs from Alexa and the Common Crawl5 archive. They
used Hybrid Ensemble Strategy. Authors in Pandey et al. [50]
used a dataset from the Website phishing dataset, available
online in a repository of the University of California. This
dataset has 10 features and 1353 instances. They trained an
RF-SVM hybrid model that achieved an accuracy of 94%.
Authors in Niranjan et al. [48] proposed an ensemble technique through the voting and stacking method. They selected
the UCI ML phishing dataset and take only 23 features out
of 30 features for further attack detection. Out of a total of
11,055 instances, the dataset has 6157 legitimate and 4898
phishing instances. They used the EKRV model to predict the
phishing attack. Authors in Patil et al. [53] proposed a hybrid
solution that uses three approaches: blacklist and whitelist,
heuristics, and visual similarity. The proposed methodology
monitors all traffic on the end-user system and compares
each URL with the white list of trusted domains. The website analyzes various details for features. The three outcomes
are suspicious websites, phishing websites, and legitimate
websites. The ML classifier is used to collect data and to
generate a score. If the score is greater than the threshold,
then they marked the URL as a phishing attack and immediately blocked it. They used LR, DT, and RF to predict the
accuracy of their test websites.
Authors in Jagadeesan et al. [33] utilized RF and SVM to
detect phishing attacks. They used two types of datasets the
first one is from the UCI machine learning repository which
has 30 features. This dataset consists of 2456 entries of phishing and non-phishing URLs. The second dataset consists
of 1353 URLs which has 10 features and three categories:
Phishing, non-Phishing, and suspicious. Authors in Pandey
et al. [50] used the dataset of a repository of the University of
California. The dataset has 10 features and 1353 instances.
They trained a hybrid model comprising RF and SVM which
they utilize to predict the accuracy.
3 Discussion
Phishing is a deceitful attempt to obtain sensitive data using
social networking approaches, for example, usernames and
passwords in an endeavor to deceive website users and getting their sensitive credentials [24]. Phishers prey on human
emotion and the urge to follow instructions in a flow. Phishing is so omnipresent in the internet world that it has become
a constant threat. In phishing, the biggest challenge is that
the attackers are continuously devising new approaches to
deceive clients such that they fall prey to their phishing traps.
A comparative study of previous works using different
approaches is discussed in the above section with details.
Machine learning based approaches, deep learning based
approaches, scenario-based approaches, and hybrid techniques are deployed in past to tackle this problem. A detailed
comparative analysis revealed that machine learning methods
are the most frequently used and effective methods to detect
a phishing attack. Different classification methods such as
SVM, RF, ANN, C4.5, k-NN, DT have been used. Techniques
with feature reduction give better performance. Classification is done through ELM, SVM, LR, C4.5, LC-ELM, kNN,
XGB, and feature selection with ANOVA detected phishing attack with 99.2% accuracy, which is highest among all
methods proposed so far but with trade-offs in terms of computational cost.
The RF method gives the best performance with the highest accuracy among any other classification methods on
different datasets. Several studies proved that more than 95%
attack detection accuracy can be achieved using a RF classification method. UCI machine learning dataset is the common
dataset that has been used by researchers for phishing attack
detection in past.
In various studies, the researchers also created a scenariobased environment to detect phishing attacks but these
solutions are only applicable for a particular environment.
Individual users in each organization exhibit different behaviors and individuals in the organization are sometimes aware
of the scenarios. The hybrid learning approach is another
way to detect phishing attacks as it occasionally gave better
accuracy than that of a RF. Researchers are of the view that
some ensemble models can further improve performance.
Nowadays phishing attacks defense is probably considered a hard job by system security experts. With low false
positives, a feasible detection system should be there to identify phishing attacks. The defense approaches talked about
123
150 A. Basit et al.
Table 5 Comparison table of
state-of-the-art studies focusing
on phishing techniques
Authors Classification Feature selection
technique
Accuracy
James et al. [36] J48, IBK, SVM, NB – 89.75%
Subasi et al. [57] ANN, kNN, RF,
SVM, C4.5, RF
– 97.36%
Abdelhamid et al. [9] eDRI – 93.5%
Mao et al. [44] SVM, DT – 93%
Jain and Gupta [34] – – 99.09%
Yao et al. [63] – – 98.3%
Patil et al. [53] LR, DT, RF – 96.58%
Jagadeesan et al. [33] RF, SVM – 95.11%
Hota et al. [29] CART, C4.5 RRFST 99.11%
Tyagi et al. [58] DT, RF, GBM PCA 98.40%
Curtis et al. [21]– – –
Sahingoz et al. [54] SVM, DT, RF,
kNN, KS, NB
NLP 97.98%
Parsons et al. [52]– – –
Joshi et al. [39] RF, RA RA 97.63%
Ubing et al. [59] EL – 95.4%
Mao et al. [45] SVM, RF, DT, AB – 97.31%
Williams et al. [62]– – –
Niranjan et al. [48] RC, kNN, IBK,
LR, PART
– 97.3%
Chen and Chen [17] ELM, SVM, LR,
C4.5, LC-ELM,
kNN, XGB
ANOVA 99.2%
Chiew et al. [19] RF, C4.5, PART,
SVM, NB
– 96.17%
Pandey et al. [50] SVM, RF – 94%
so far are based on machine learning and deep learning
algorithms. Besides having high computational costs, these
methods have high false-positive rates; however, better at
distinguishing phishing attacks. The machine learning techniques provide the best results when compared with other
different approaches. The most effective defense for phishing attacks is an educated and well aware employee. But
still, people are people with their built features of curiosity.
They have a thirst to explore and know more. To mitigate
the risks of falling victim to phishing tricks, organizations
should try to keep employees away from their inherent core
processes and make them develop a mindset that will abstain
from clicking suspicious links and webpages.
4 Current practices and future challenges
A phishing attack is still considered a fascinating form of
attack to lure a novice internet user to pass his/her private
confidential data to the attackers. There are different measures available, yet at whatever point a solution is proposed to
overcome these attacks, attackers consider the vulnerabilities
of that solution to continue with their attacks. Several solutions to control phishing attacks have been proposed in past.
A recent increase in the number of phishing attacks linked
to COVID-19 performed between March 1 and March 23,
2020, and attacks performed on online collaboration tools
(ZOOM, Microsoft Teams, etc.) has led researchers to pay
more attention in this research domain. Most of the working
be it at government or the corporate level, educational activities, businesses, as well as non-commercial activities, have
switched online from the traditional on-premises approach.
More users are relying on the web to perform their routine work. This has increased the importance of having a
comprehensive phishing attack detection solution with better accuracy and better response time [6–8].
The conventional approaches for phishing attack detection are not accurate and can recognize only about 20% of
phishing attacks. ML approaches give better results but with
scalability trade-off and time-consuming even on the smallsized datasets. Phishing detection by heuristics techniques
gives high false-positive rates. User cautiousness is a key
123
A comprehensive survey of AI-enabled phishing attacks detection techniques 151
requirement to prevent phishing attacks. Besides educating
the client regarding safe browsing, some changes can be done
in the user interfaces such as giving dynamic warnings and
consequently identifying malicious emails. As the classified
resources are accessible to the IoT gadgets, but their security
architectures and features are not mature so far which makes
them an exceptionally obvious target for the attackers.
Phishing is a door for all kinds of malware and ransomware. Malware attacks on organizations use ransomware
and ransomware operators demand heavy amount as ransom in exchange for not disclosing stolen data which is a
recent trend in 2020. Phishing scams in 2020 are deliberately
impersonating COVID-19 and healthcare-related organizations and individuals by exploiting the unprepared users. It
is better to safeguard doors at our ends and be proactive in
defense rather than thinking about reactive strategies to combat once a phishing attack has happened.
Fake websites with phishing appear to be original but it
is hard to identify as attackers imitate the appearance and
functionality of real websites. Prevention is better than cure
so there is a need for anti-phishing frameworks or plug-ins
with web browsers. These plug-ins or frameworks may perform content filtering and identify as well as block suspected
phishing websites to proceed further. An automated reporting
feature can be added that can report phishing attacks to the
organization from the user’s end such as a bank, government
organization, etc. The time lost on remediation after a phishing attack can have a damaging impact on the productivity
and profitability of businesses. In the current scenario, organizations need to provide their employees with awareness
and feasible solutions to detect and report phishing attacks
proactively and promptly before it causes any harm.
In the future, an all-inclusive phishing attack detection
solution can be designed to identify, report, and block malicious web websites without the user’s involvement. If a
website is asking for login credentials or sensitive information, a framework or smart web plug-in solution should be
responsible to ensure the website is legitimate and inform the
owner (organization, business, etc.) beforehand. Web pages
health checking during user browsing has become a need of
the time and a scalable, as well as a robust solution, is needed.
5 Conclusion
This survey enables researchers to comprehend the various
methods, challenges, and trends for phishing attack detection.
Nowadays, prevention from phishing attacks is considered a
tough job in the system security domain. An efficient detection system ought to have the option to identify phishing
attacks with low false positives. The protection strategies
talked about in this paper are data mining and heuristics,
ML, and deep learning algorithms. With high computational
expenses, heuristic and data mining methods have high FP
rates, however better at distinguishing phishing attacks. The
ML procedures give the best outcomes when contrasted with
different strategies. A portion of the ML procedures can identify TP up to 99%. As malicious URLs are created every
other day and the attackers are using techniques to fool users
and modify the URLs to attack. Nowadays deep learning
and machine learning methods are used to detect a phishing attack. classification methods such as RF, SVM, C4.5,
DT, PCA, k-NN are also common. These methods are most
useful and effective for detecting the phishing attack. Future
research can be done for a more scalable and robust method
including the smart plugin solutions to tag/label if the website
is legitimate or leading towards a phishing attack.
References
1. (2016). Apwg trend report. http://docs.apwg.org/reports/apwg_
trends_report_q4_2016.pdf. Accessed from 20 July 2020
2. (2018) Phishing activity trends report. http://docs.apwg.org/
reports/apwg_trends_report_q2_2018.pdf. Accessed from 20 July
2020
3. (2019) Apwg trend report. https://docs.apwg.org/reports/apwg_
trends_report_q3_2019.pdf. Accessed from 20 July 2020
4. (2019) Fbi warns of dramatic increase in business e-mail
compromise (bec) schemes—fbi. https://www.fbi.gov/contactus/field-offices/memphis/news/press-releases/fbi-warns-ofdramatic-increase-in-business-e-mail-compromise-bec-schemes.
Accessed from 20 July 2020
5. (2019) What is phishing? https://www.phishing.org/what-isphishing. Accessed from 20 July 2020
6. (2020) Coronavirus-related spear phishing attacks see 667%
increase. https://www.securitymagazine.com/articles/92157-
coronavirus-related-spear-phishing-attacks-see-667-increase-inmarch-2020. Accessed from 20 July 2020
7. (2020) Cost of black market phishing kits soars 149% in 2019.
https://www.infosecurity-magazine.com/news/black-phishingkits/. Accessed from 20 July 2020
8. (2020) Recent phishing attacks. https://www.infosec.gov.hk/
english/anti/recent.html. Accessed from 20 July 2020
9. Abdelhamid, N., Thabtah, F., Abdel-jaber, H. (2017). Phishing
detection: A recent intelligent machine learning comparison based
on models content and features. In 2017 IEEE international conference on intelligence and security informatics (ISI) (pp. 72–77).
IEEE.
10. Adebowale, M. A., Lwin, K. T., Sanchez, E., & Hossain, M. A.
(2019). Intelligent web-phishing detection and protection scheme
using integrated features of images, frames and text.Expert Systems
with Applications, 115, 300–313.
11. Aleroud, A., & Zhou, L. (2017). Phishing environments, techniques, and countermeasures: A survey. Computers and Security,
68, 160–196.
12. Ali, W., & Malebary, S. (2020). Particle swarm optimization-based
feature weighting for improving intelligent phishing website detection. IEEE Access, 8, 116766–116780.
13. Alsariera, Y. A., Adeyemo, V. E., Balogun, A. O., & Alazzawi,
A. K. (2020). Ai meta-learners and extra-trees algorithm for the
detection of phishing websites. IEEE Access, 8, 142532–142542.
14. Begum, A., & Badugu, S. (2020). A study of malicious url detection
using machine learning and heuristic approaches. In Advances in
123
152 A. Basit et al.
decision sciences, security and computer vision, image processing
(pp. 587–597). Berlin: Springer.
15. Benavides, E., Fuertes, W., Sanchez, S., & Sanchez, M. (2020).
Classification of phishing attack solutions by employing deep
learning techniques: A systematic literature review. In Developments and advances in defense and security (pp. 51–64). Springer.
16. Cabaj, K., Domingos, D., Kotulski, Z., & Respício, A. (2018).
Cybersecurity education: Evolution of the discipline and analysis
of master programs. Computers and Security, 75, 24–35.
17. Chen, Y. H., & Chen, J. L. (2019). [email protected] ntiphish—machine learning mechanisms for cyber-phishing attack. IEICE Transactions on
Information and Systems, 102(5), 878–887.
18. Chiew, K. L., Yong, K. S. C., & Tan, C. L. (2018). A survey of
phishing attacks: Their types, vectors and technical approaches.
Expert Systems with Applications, 106, 1–20.
19. Chiew, K. L., Tan, C. L., Wong, K., Yong, K. S., & Tiong, W.
K. (2019). A new hybrid ensemble feature selection framework
for machine learning-based phishing detection system.Information
Sciences, 484, 153–166.
20. Conklin, W. A., Cline, R. E., & Roosa, T. (2014). Re-engineering
cybersecurity education in the us: An analysis of the critical factors.
In 2014 47th Hawaii international conference on system sciences
(pp. 2006–2014). IEEE.
21. Curtis, S. R., Rajivan, P., Jones, D. N., & Gonzalez, C. (2018).
Phishing attempts among the dark triad: Patterns of attack and vulnerability. Computers in Human Behavior, 87, 174–182.
22. El Aassal, A., Baki, S., Das, A., & Verma, R. M. (2020). An indepth benchmarking and evaluation of phishing detection research
for security needs. IEEE Access, 8, 22170–22192.
23. Fatima, R., Yasin, A., Liu, L., & Wang, J. (2019). How persuasive is
a phishing email? A phishing game for phishing awareness. Journal
of Computer Security, 27(6), 581–612.
24. Feng, Q., Tseng, K. K., Pan, J. S., Cheng, P., & Chen, C. (2011).
New anti-phishing method with two types of passwords in openid
system. In 2011 Fifth international conference on genetic and evolutionary computing (pp. 69–72). IEEE.
25. Ferrag, M. A., Maglaras, L., Moschoyiannis, S., & Janicke, H.
(2020). Deep learning for cyber security intrusion detection:
Approaches, datasets, and comparative study. Journal of Information Security and Applications, 50, 102419.
26. Forecast. (2017). Global fraud and cybercrime forecast. https://
rsa.com/en-us/blog/2016-12/2017-global-fraud-cybercrime-fore
cast. Accessed from 20 July 2020
27. Gupta, B. B., Tewari, A., Jain, A. K., & Agrawal, D. P. (2017).
Fighting against phishing attacks: State of the art and future challenges. Neural Computing and Applications, 28(12), 3629–3654.
28. Gupta, B. B., Arachchilage, N. A., & Psannis, K. E. (2018). Defending against phishing attacks: Taxonomy of methods, current issues
and future directions. Telecommunication Systems, 67(2), 247–
267.
29. Hota, H., Shrivas, A., & Hota, R. (2018). An ensemble model for
detecting phishing attack with proposed remove-replace feature
selection technique. Procedia Computer Science, 132, 900–907.
30. Hulten, G. J., Rehfuss, P. S., Rounthwaite, R., Goodman, J. T.,
Seshadrinathan, G., Penta, A. P., Mishra, M., Deyo, R. C., Haber,
E. J., & Snelling, D. A. W. et al. (2014). Finding phishing sites. US
Patent 8,839,418.
31. Hutchinson, S., Zhang, Z., & Liu, Q. (2018). Detecting phishing websites with random forest. In International conference on
machine learning and intelligent communications (pp. 470–479).
Springer.
32. Iwendi, C., Jalil, Z., Javed, A. R., Reddy, T., Kaluri, R., Srivastava,
G., et al. (2020). Keysplitwatermark: Zero watermarking algorithm for software protection against cyber-attacks. IEEE Access,
8, 72650–72660.
33. Jagadeesan, S., Chaturvedi, A., & Kumar, S. (2018). Url phishing
analysis using random forest. International Journal of Pure and
Applied Mathematics, 118(20), 4159–4163.
34. Jain, A. K., & Gupta, B. B. (2018). Towards detection of phishing
websites on client-side using machine learning based approach.
Telecommunication Systems, 68(4), 687–700.
35. Jain, A. K., Parashar, S., Katare, P., & Sharma, I. (2020). Phishskape: A content based approach to escape phishing attacks.
Procedia Computer Science, 171, 1102–1109.
36. James, J., Sandhya, L., & Thomas, C. (2013). Detection of phishing urls using machine learning techniques. In 2013 International
conference on control communication and computing (ICCC) (pp.
304–309). IEEE.
37. Javed, A. R., Jalil, Z., Moqurrab, S. A., Abbas, S., & Liu, X. (2020).
Ensemble adaboost classifier for accurate and fast detection of
botnet attacks in connected vehicles. Transactions on Emerging
Telecommunications Technologies.
38. Javed, A. R., Usman, M., Rehman, S. U., Khan, M. U., & Haghighi,
M. S. (2020). Anomaly detection in automated vehicles using
multistage attention-based convolutional neural network. IEEE
Transactions on Intelligent Transportation Systems, pp. 1–10.
39. Joshi, A., Pattanshetti, P., & Tanuja, R. (2019). Phishing attack
detection using feature selection techniques. In International conference on communication and information processing (ICCIP),
Nutan College of Engineering and Research.
40. Khonji, M., Iraqi, Y., & Jones, A. (2013). Phishing detection: A
literature survey. IEEE Communications Surveys and Tutorials,
15(4), 2091–2121.
41. Kumar, A., Chatterjee, J. M., & Díaz, V. G. (2020). A novel hybrid
approach of svm combined with nlp and probabilistic neural network for email phishing. International Journal of Electrical and
Computer Engineering, 10(1), 486.
42. Li, Y., Yang, Z., Chen, X., Yuan, H., & Liu, W. (2019). A stacking
model using url and html features for phishing webpage detection.
Future Generation Computer Systems, 94, 27–39.
43. Liew, S. W., Sani, N. F. M., Abdullah, M. T., Yaakob, R., & Sharum,
M. Y. (2019). An effective security alert mechanism for real-time
phishing tweet detection on twitter. Computers and Security, 83,
201–207.
44. Mao, J., Bian, J., Tian, W., Zhu, S., Wei, T., Li, A., et al. (2018).
Detecting phishing websites via aggregation analysis of page layouts. Procedia Computer Science, 129, 224–230.
45. Mao, J., Bian, J., Tian, W., Zhu, S., Wei, T., Li, A., et al. (2019).
Phishing page detection via learning classifiers from page layout
feature. EURASIP Journal on Wireless Communications and Networking, 2019(1), 43.
46. Maurya, S., & Jain, A. (2020). Deep learning to combat phishing.
Journal of Statistics and Management Systems, pp. 1–13.
47. Mittal, M., Iwendi, C., Khan, S., & Rehman Javed, A. (2020).
Analysis of security and energy efficiency for shortest route discovery in low-energy adaptive clustering hierarchy protocol using
Levenberg–Marquardt neural network and gated recurrent unit for
intrusion detection system. Transactions on Emerging Telecommunications Technologies, p. e3997.
48. Niranjan, A., Haripriya, D., Pooja, R., Sarah, S., Shenoy, P. D., &
Venugopal, K. (2019). Ekrv: Ensemble of knn and random committee using voting for efficient classification of phishing. In Progress
in advanced computing and intelligent engineering (pp. 403–414).
Springer.
49. Ollmann, G. (2004). The phishing guide understanding and preventing phishing attacks. NGS Software Insight Security Research.
50. Pandey, A., Gill, N., Nadendla, K. S. P., & Thaseen, I. S. (2018).
Identification of phishing attack in websites using random forestsvm hybrid model. In International conference on intelligent
systems design and applications (pp. 120–128). Springer.
123
A comprehensive survey of AI-enabled phishing attacks detection techniques 153
51. Parekh, S., Parikh, D., Kotak, S., & Sankhe, S. (2018). A new
method for detection of phishing websites: Url detection. In 2018
Second international conference on inventive communication and
computational technologies (ICICCT) (pp. 949–952). IEEE.
52. Parsons, K., Butavicius, M., Delfabbro, P., & Lillie, M. (2019).
Predicting susceptibility to social influence in phishing emails.
International Journal of Human-Computer Studies, 128, 17–26.
53. Patil, V., Thakkar, P., Shah, C., Bhat, T., & Godse, S. (2018). Detection and prevention of phishing websites using machine learning
approach. In 2018 Fourth international conference on computing communication control and automation (ICCUBEA) (pp. 1–5).
IEEE.
54. Sahingoz, O. K., Buber, E., Demir, O., & Diri, B. (2019). Machine
learning based phishing detection from urls. Expert Systems with
Applications, 117, 345–357.
55. Shie, E. W. S. (2020). Critical analysis of current research aimed
at improving detection of phishing attacks. Selected computing
research papers, p. 45.
56. Subasi, A., & Kremic, E. (2020). Comparison of adaboost with
multiboosting for phishing website detection. Procedia Computer
Science, 168, 272–278.
57. Subasi, A., Molah, E., Almkallawi, F., & Chaudhery, T. J. (2017).
Intelligent phishing website detection using random forest classifier. In 2017 International conference on electrical and computing
technologies and applications (ICECTA) (pp. 1–5). IEEE.
58. Tyagi, I., Shad, J., Sharma, S., Gaur, S., & Kaur, G. (2018). A novel
machine learning approach to detect phishing websites. In 2018
5th International conference on signal processing and integrated
networks (SPIN) (pp. 425–430). IEEE.
59. Ubing, A. A., Jasmi, S. K. B., Abdullah, A., Jhanjhi, N., & Supramaniam, M. (2019). Phishing website detection: An improved
accuracy through feature selection and ensemble learning. International Journal of Advanced Computer Science and Applications,
10(1), 252–257.
60. Volkamer, M., Renaud, K., Reinheimer, B., & Kunz, A. (2017).
User experiences of torpedo: Tooltip-powered phishing email
detection. Computers and Security, 71, 100–113.
61. Vrbanˇciˇc, G., Fister Jr, I., & Podgorelec, V. (2018). Swarm intelligence approaches for parameter setting of deep learning neural
network: Case study on phishing websites classification. In Proceedings of the 8th international conference on web intelligence,
mining and semantics (pp. 1–8).
62. Williams, E. J., Hinds, J., & Joinson, A. N. (2018). Exploring susceptibility to phishing in the workplace. International Journal of
Human-Computer Studies, 120, 1–13.
63. Yao, W., Ding Y., & Li, X. (2018). Logophish: A new twodimensional code phishing attack detection method. In 2018
IEEE international conference on parallel and distributed processing with applications, ubiquitous computing and communications, big data and cloud computing, social computing
and networking, sustainable computing and communications
(ISPA/IUCC/BDCloud/SocialCom/SustainCom) (pp. 231–236).
IEEE.
64. Yasin, A., Fatima, R., Liu, L., Yasin, A., & Wang, J. (2019).
Contemplating social engineering studies and attack scenarios: A
review study. Security and Privacy, 2(4), e73.
65. Zamir, A., Khan, H. U., Iqbal, T., Yousaf, N., Aslam, F., Anjum,
A., et al. (2020). Phishing web site detection using diverse machine
learning algorithms. The Electronic Library.
Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Abdul Basit is a student at the Department of Computer Science, Air
University, Islamabad, Pakistan. He is currently pursuing his degree in
Masters of Science in Computer Science from Air University, Islamabad, Pakistan. His current research interests include but are not limited to cyber security, artificial intelligence, computer vision, network
security, IoT, smart city, and application development for smart living.
He aims to contribute to interdisciplinary research of computer science
and human-related disciplines.
Maham Zafar is a student at the Department of Computer Science, Air
University, Islamabad, Pakistan. She is currently pursuing his degree
in Masters of Science in Computer Science from Air University, Islamabad, Pakistan. Her current research interests include but are not limited to cyber security, artificial intelligence, computer vision, network
security, IoT, smart city, and application development for smart living.
Xuan Liu (MIEEE’17) graduated from Shandong University, China,
and received M.S. degree from Wuhan Polytechnic University, China
and Ph.D. degree in computer science and engineering from Southeast
University, China. Since 2020, he joined Yangzhou University, China.
He is serving as an Advisory Editor of Wiley Engineering Reports, an
Associate Editor of Springer Telecommunication Systems, IET Smart
Cities, Taylor and Francis International Journal of Computers and
Applications and KeAi International Journal of Intelligent Networks,
an Area Editor of EAI Endorsed Transactions on Internet of Things,
the Lead Guest Editor of Elsevier Internet of Things, Wiley Transactions on Emerging Telecommunications Technologies and Wiley Internet Technology Letters, and the Chair of CollaborateCom 2020 workshop. He served(s) as a TPC Member of ACM MobiCom 2020 workshop, IEEE INFOCOM 2020 workshop, IEEE ICC 2021/2020/2019,
IEEE GlobeCom 2020/2019, IEEE WCNC 2021, IFIP/IEEE IM 2021,
IEEE PIMRC 2020/2019, IEEE MSN 2020, IEEE VTC 2020/2019/
2018, IEEE ICIN2020, IEEE GIIS 2020, IEEE DASC 2019, APNOMS
2020/2019, AdHoc-Now2020, FNC 2020/2019, EAI CollaborateCom
2020/2019, and EAI ChinaCom 2019, etc. Furthermore, he served as
a Reviewer for 20+ reputable conferences/journals including IEEE
INFOCOM, IEEE ICC, IEEE GlobeCom, IEEE WCNC, IEEE PIMRC,
IEEE COMMAG, IEEE TII, IEEE IoT, IEEE CL, Elsevier JNCA,
Elsevier FGCS, Springer WINE, Springer TELS, IET SMC, EAI CollaborateCom, and Wiley IJCS, etc. His main research interests focus
on UAVs-enabled collaborative networking techniques.
Abdul Rehman Javed is currently a lecturer at the Department of
Cyber Security, Air University, Islamabad, Pakistan. He worked with
National Cyber Crimes and Forensics Laboratory, Air University,
Islamabad, Pakistan. He received his Master’s degree in Computer
Science from the National University of Computer and Emerging Sciences, Islamabad, Pakistan and bachelor’s degree in Computer Science from the COMSATS university Islamabad (Sahiwal campus).
He is a reviewer of many well-known journals, including, Sustainable cities and society (Elsevier), Journal of Information Security and
Applications (Elsevier), IEEE Internet of Things Magazine, Transactions on Internet Technology (ACM), Telecommunication Systems
(Springer), IEEE Access and International Journal of Ad Hoc and
Ubiquitous Computing (Inderscience). His current research interests
include but are not limited to mobile and ubiquitous computing, data
analysis, knowledge discovery, data mining, natural language processing, smart homes, and their applications in human activity analysis,
human motion analysis, and e-health. He aims to contribute to interdisciplinary research of computer science and human-related disciplines.
123
154 A. Basit et al.
He has authored more than over 10 peer–reviewed articles on topics
related to cybersecurity, mobile computing, and digital forensics.
Zunera Jalil is currently engaged as faculty with the Department of
Cyber Security, Faculty of Computing and Artificial Intelligence and
as an investigator with National Cybercrimes and Forensics Laboratory, Air University, Islamabad, Pakistan. She earned her PhD degree
in Computer Science from FAST National University of Computer and
Emerging Sciences, Islamabad, Pakistan in 2010 winning scholarship
from Higher Education Commission of Pakistan. She has been working as faculty with International Islamic University, Islamabad; Iqra
University, Islamabad; and Saudi Electronic University, Riyadh, Saudi
Arabia since then. She is reviewer and editor of multiple renowned
international journals in computing and cyber security domain. She
has delivered guest talks at numerous national and international forums
in past. Her current research interests include but are not limited to
computer forensics, cyber-attacks detection using deep learning, intelligent systems, criminal profiling, and data privacy protection.
Kashif Kifayat received the Ph. D. degree in cyber security from Liverpool John Moores University, Liverpool, U.K., in 2008. He is currently
a Professor and the Chair of the Cyber Security Department, Air University, Islamabad, Pakistan. He is highly skilled in Machine Learning,
Matlab, Deep Learning, Algorithms, Big Data Analytics, Data Science, C++, Python, R, and LaTeX. Being a part of National Center of
Cyber Security, he is highly engaged in Mobile forensics.
123


Get Professional Assignment Help Cheaply

Buy Custom Essay

Don't use plagiarized sources. Get Your Custom Essay on
A comprehensive survey of AI-enabled phishing attacks detection techniques
Just from $10/Page
Order Essay

Are you busy and do not have time to handle your assignment? Are you scared that your paper will not make the grade? Do you have responsibilities that may hinder you from turning in your assignment on time? Are you tired and can barely handle your assignment? Are your grades inconsistent?

Whichever your reason is, it is valid! You can get professional academic help from our service at affordable rates. We have a team of professional academic writers who can handle all your assignments.

Why Choose Our Academic Writing Service?

  • Plagiarism free papers
  • Timely delivery
  • Any deadline
  • Skilled, Experienced Native English Writers
  • Subject-relevant academic writer
  • Adherence to paper instructions
  • Ability to tackle bulk assignments
  • Reasonable prices
  • 24/7 Customer Support
  • Get superb grades consistently
 

Online Academic Help With Different Subjects

Literature

Students barely have time to read. We got you! Have your literature essay or book review written without having the hassle of reading the book. You can get your literature paper custom-written for you by our literature specialists.

Finance

Do you struggle with finance? No need to torture yourself if finance is not your cup of tea. You can order your finance paper from our academic writing service and get 100% original work from competent finance experts.

Computer science

Computer science is a tough subject. Fortunately, our computer science experts are up to the match. No need to stress and have sleepless nights. Our academic writers will tackle all your computer science assignments and deliver them on time. Let us handle all your python, java, ruby, JavaScript, php , C+ assignments!

Psychology

While psychology may be an interesting subject, you may lack sufficient time to handle your assignments. Don’t despair; by using our academic writing service, you can be assured of perfect grades. Moreover, your grades will be consistent.

Engineering

Engineering is quite a demanding subject. Students face a lot of pressure and barely have enough time to do what they love to do. Our academic writing service got you covered! Our engineering specialists follow the paper instructions and ensure timely delivery of the paper.

Nursing

In the nursing course, you may have difficulties with literature reviews, annotated bibliographies, critical essays, and other assignments. Our nursing assignment writers will offer you professional nursing paper help at low prices.

Sociology

Truth be told, sociology papers can be quite exhausting. Our academic writing service relieves you of fatigue, pressure, and stress. You can relax and have peace of mind as our academic writers handle your sociology assignment.

Business

We take pride in having some of the best business writers in the industry. Our business writers have a lot of experience in the field. They are reliable, and you can be assured of a high-grade paper. They are able to handle business papers of any subject, length, deadline, and difficulty!

Statistics

We boast of having some of the most experienced statistics experts in the industry. Our statistics experts have diverse skills, expertise, and knowledge to handle any kind of assignment. They have access to all kinds of software to get your assignment done.

Law

Writing a law essay may prove to be an insurmountable obstacle, especially when you need to know the peculiarities of the legislative framework. Take advantage of our top-notch law specialists and get superb grades and 100% satisfaction.

What discipline/subjects do you deal in?

We have highlighted some of the most popular subjects we handle above. Those are just a tip of the iceberg. We deal in all academic disciplines since our writers are as diverse. They have been drawn from across all disciplines, and orders are assigned to those writers believed to be the best in the field. In a nutshell, there is no task we cannot handle; all you need to do is place your order with us. As long as your instructions are clear, just trust we shall deliver irrespective of the discipline.

Are your writers competent enough to handle my paper?

Our essay writers are graduates with bachelor's, masters, Ph.D., and doctorate degrees in various subjects. The minimum requirement to be an essay writer with our essay writing service is to have a college degree. All our academic writers have a minimum of two years of academic writing. We have a stringent recruitment process to ensure that we get only the most competent essay writers in the industry. We also ensure that the writers are handsomely compensated for their value. The majority of our writers are native English speakers. As such, the fluency of language and grammar is impeccable.

What if I don’t like the paper?

There is a very low likelihood that you won’t like the paper.

Reasons being:

  • When assigning your order, we match the paper’s discipline with the writer’s field/specialization. Since all our writers are graduates, we match the paper’s subject with the field the writer studied. For instance, if it’s a nursing paper, only a nursing graduate and writer will handle it. Furthermore, all our writers have academic writing experience and top-notch research skills.
  • We have a quality assurance that reviews the paper before it gets to you. As such, we ensure that you get a paper that meets the required standard and will most definitely make the grade.

In the event that you don’t like your paper:

  • The writer will revise the paper up to your pleasing. You have unlimited revisions. You simply need to highlight what specifically you don’t like about the paper, and the writer will make the amendments. The paper will be revised until you are satisfied. Revisions are free of charge
  • We will have a different writer write the paper from scratch.
  • Last resort, if the above does not work, we will refund your money.

Will the professor find out I didn’t write the paper myself?

Not at all. All papers are written from scratch. There is no way your tutor or instructor will realize that you did not write the paper yourself. In fact, we recommend using our assignment help services for consistent results.

What if the paper is plagiarized?

We check all papers for plagiarism before we submit them. We use powerful plagiarism checking software such as SafeAssign, LopesWrite, and Turnitin. We also upload the plagiarism report so that you can review it. We understand that plagiarism is academic suicide. We would not take the risk of submitting plagiarized work and jeopardize your academic journey. Furthermore, we do not sell or use prewritten papers, and each paper is written from scratch.

When will I get my paper?

You determine when you get the paper by setting the deadline when placing the order. All papers are delivered within the deadline. We are well aware that we operate in a time-sensitive industry. As such, we have laid out strategies to ensure that the client receives the paper on time and they never miss the deadline. We understand that papers that are submitted late have some points deducted. We do not want you to miss any points due to late submission. We work on beating deadlines by huge margins in order to ensure that you have ample time to review the paper before you submit it.

Will anyone find out that I used your services?

We have a privacy and confidentiality policy that guides our work. We NEVER share any customer information with third parties. Noone will ever know that you used our assignment help services. It’s only between you and us. We are bound by our policies to protect the customer’s identity and information. All your information, such as your names, phone number, email, order information, and so on, are protected. We have robust security systems that ensure that your data is protected. Hacking our systems is close to impossible, and it has never happened.

How our Assignment  Help Service Works

1.      Place an order

You fill all the paper instructions in the order form. Make sure you include all the helpful materials so that our academic writers can deliver the perfect paper. It will also help to eliminate unnecessary revisions.

2.      Pay for the order

Proceed to pay for the paper so that it can be assigned to one of our expert academic writers. The paper subject is matched with the writer’s area of specialization.

3.      Track the progress

You communicate with the writer and know about the progress of the paper. The client can ask the writer for drafts of the paper. The client can upload extra material and include additional instructions from the lecturer. Receive a paper.

4.      Download the paper

The paper is sent to your email and uploaded to your personal account. You also get a plagiarism report attached to your paper.

smile and order essaysmile and order essay PLACE THIS ORDER OR A SIMILAR ORDER WITH US TODAY AND GET A PERFECT SCORE!!!

order custom essay paper