Phishing detection based Associative Classification data mining

Neda Abdelhamid; Aladdin Ayesh; Fadi Thabtah

doi:10.1016/j.eswa.2014.03.019

Phishing detection based Associative Classification data mining

Neda Abdelhamid^*, Aladdin Ayesh, Fadi Thabtah

^*Corresponding author for this work

Research output: Contribution to journal › Review article › peer-review

255 Citations (Scopus)

Abstract

Website phishing is considered one of the crucial security challenges for the online community due to the massive numbers of online transactions performed on a daily basis. Website phishing can be described as mimicking a trusted website to obtain sensitive information from online users such as usernames and passwords. Black lists, white lists and the utilisation of search methods are examples of solutions to minimise the risk of this problem. One intelligent approach based on data mining called Associative Classification (AC) seems a potential solution that may effectively detect phishing websites with high accuracy. According to experimental studies, AC often extracts classifiers containing simple "If-Then" rules with a high degree of predictive accuracy. In this paper, we investigate the problem of website phishing using a developed AC method called Multi-label Classifier based Associative Classification (MCAC) to seek its applicability to the phishing problem. We also want to identify features that distinguish phishing websites from legitimate ones. In addition, we survey intelligent approaches used to handle the phishing problem. Experimental results using real data collected from different sources show that AC particularly MCAC detects phishing websites with higher accuracy than other intelligent algorithms. Further, MCAC generates new hidden knowledge (rules) that other algorithms are unable to find and this has improved its classifiers predictive performance.

Original language	English
Pages (from-to)	5948-5959
Number of pages	12
Journal	Expert Systems with Applications
Volume	41
Issue number	13
Early online date	27 Mar 2014
DOIs	https://doi.org/10.1016/j.eswa.2014.03.019
Publication status	Published - 1 Oct 2014
Externally published	Yes

Keywords

Classification
Data mining
Forged websites
Internet security
Phishing

Access to Document

10.1016/j.eswa.2014.03.019

Cite this

@article{8fa04a70f4014c63bcbd12ed2366a755,

title = "Phishing detection based Associative Classification data mining",

abstract = "Website phishing is considered one of the crucial security challenges for the online community due to the massive numbers of online transactions performed on a daily basis. Website phishing can be described as mimicking a trusted website to obtain sensitive information from online users such as usernames and passwords. Black lists, white lists and the utilisation of search methods are examples of solutions to minimise the risk of this problem. One intelligent approach based on data mining called Associative Classification (AC) seems a potential solution that may effectively detect phishing websites with high accuracy. According to experimental studies, AC often extracts classifiers containing simple {"}If-Then{"} rules with a high degree of predictive accuracy. In this paper, we investigate the problem of website phishing using a developed AC method called Multi-label Classifier based Associative Classification (MCAC) to seek its applicability to the phishing problem. We also want to identify features that distinguish phishing websites from legitimate ones. In addition, we survey intelligent approaches used to handle the phishing problem. Experimental results using real data collected from different sources show that AC particularly MCAC detects phishing websites with higher accuracy than other intelligent algorithms. Further, MCAC generates new hidden knowledge (rules) that other algorithms are unable to find and this has improved its classifiers predictive performance.",

keywords = "Classification, Data mining, Forged websites, Internet security, Phishing",

author = "Neda Abdelhamid and Aladdin Ayesh and Fadi Thabtah",

year = "2014",

month = oct,

day = "1",

doi = "10.1016/j.eswa.2014.03.019",

language = "English",

volume = "41",

pages = "5948--5959",

journal = "Expert Systems with Applications",

issn = "0957-4174",

publisher = "PERGAMON-ELSEVIER SCIENCE LTD",

number = "13",

}

TY - JOUR

T1 - Phishing detection based Associative Classification data mining

AU - Abdelhamid, Neda

AU - Ayesh, Aladdin

AU - Thabtah, Fadi

PY - 2014/10/1

Y1 - 2014/10/1

N2 - Website phishing is considered one of the crucial security challenges for the online community due to the massive numbers of online transactions performed on a daily basis. Website phishing can be described as mimicking a trusted website to obtain sensitive information from online users such as usernames and passwords. Black lists, white lists and the utilisation of search methods are examples of solutions to minimise the risk of this problem. One intelligent approach based on data mining called Associative Classification (AC) seems a potential solution that may effectively detect phishing websites with high accuracy. According to experimental studies, AC often extracts classifiers containing simple "If-Then" rules with a high degree of predictive accuracy. In this paper, we investigate the problem of website phishing using a developed AC method called Multi-label Classifier based Associative Classification (MCAC) to seek its applicability to the phishing problem. We also want to identify features that distinguish phishing websites from legitimate ones. In addition, we survey intelligent approaches used to handle the phishing problem. Experimental results using real data collected from different sources show that AC particularly MCAC detects phishing websites with higher accuracy than other intelligent algorithms. Further, MCAC generates new hidden knowledge (rules) that other algorithms are unable to find and this has improved its classifiers predictive performance.

AB - Website phishing is considered one of the crucial security challenges for the online community due to the massive numbers of online transactions performed on a daily basis. Website phishing can be described as mimicking a trusted website to obtain sensitive information from online users such as usernames and passwords. Black lists, white lists and the utilisation of search methods are examples of solutions to minimise the risk of this problem. One intelligent approach based on data mining called Associative Classification (AC) seems a potential solution that may effectively detect phishing websites with high accuracy. According to experimental studies, AC often extracts classifiers containing simple "If-Then" rules with a high degree of predictive accuracy. In this paper, we investigate the problem of website phishing using a developed AC method called Multi-label Classifier based Associative Classification (MCAC) to seek its applicability to the phishing problem. We also want to identify features that distinguish phishing websites from legitimate ones. In addition, we survey intelligent approaches used to handle the phishing problem. Experimental results using real data collected from different sources show that AC particularly MCAC detects phishing websites with higher accuracy than other intelligent algorithms. Further, MCAC generates new hidden knowledge (rules) that other algorithms are unable to find and this has improved its classifiers predictive performance.

KW - Classification

KW - Data mining

KW - Forged websites

KW - Internet security

KW - Phishing

UR - http://www.scopus.com/inward/record.url?scp=84899698551&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2014.03.019

DO - 10.1016/j.eswa.2014.03.019

M3 - Review article

AN - SCOPUS:84899698551

SN - 0957-4174

VL - 41

SP - 5948

EP - 5959

JO - Expert Systems with Applications

JF - Expert Systems with Applications

IS - 13

ER -

Phishing detection based Associative Classification data mining

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this