A Phishing Webpage Detection Method Based on Stacked Autoencoder and Correlation Coefficients

Feng, Jian; Zou, Lianyang; Nan, Tianzhu

doi:10.20532/cit.2019.1004702

Journal of computing and information technology, Vol. 27 No. 2, 2019.

Original scientific paper

https://doi.org/10.20532/cit.2019.1004702

A Phishing Webpage Detection Method Based on Stacked Autoencoder and Correlation Coefficients

Jian Feng ; College of Computer Science and Technology, Xi’an University of Science and Technology, Xi’an, China
Lianyang Zou ; College of Computer Science and Technology, Xi’an University of Science and Technology, Xi’an, China
Tianzhu Nan ; Xi’an Fenghuo Software Technology Co., Ltd., Xi’an, China

Full text: english pdf 977 Kb

page 41-54

downloads: 781

cite

APA 6th Edition

Feng, J., Zou, L. & Nan, T. (2019). A Phishing Webpage Detection Method Based on Stacked Autoencoder and Correlation Coefficients. Journal of computing and information technology, 27 (2), 41-54. https://doi.org/10.20532/cit.2019.1004702

MLA 8th Edition

Feng, Jian, et al. "A Phishing Webpage Detection Method Based on Stacked Autoencoder and Correlation Coefficients." Journal of computing and information technology, vol. 27, no. 2, 2019, pp. 41-54. https://doi.org/10.20532/cit.2019.1004702. Accessed 3 Jun. 2026.

Chicago 17th Edition

Feng, Jian, Lianyang Zou and Tianzhu Nan. "A Phishing Webpage Detection Method Based on Stacked Autoencoder and Correlation Coefficients." Journal of computing and information technology 27, no. 2 (2019): 41-54. https://doi.org/10.20532/cit.2019.1004702

Harvard

Feng, J., Zou, L., and Nan, T. (2019). 'A Phishing Webpage Detection Method Based on Stacked Autoencoder and Correlation Coefficients', Journal of computing and information technology, 27(2), pp. 41-54. https://doi.org/10.20532/cit.2019.1004702

Vancouver

Feng J, Zou L, Nan T. A Phishing Webpage Detection Method Based on Stacked Autoencoder and Correlation Coefficients. Journal of computing and information technology [Internet]. 2019 [cited 2026 June 03];27(2):41-54. https://doi.org/10.20532/cit.2019.1004702

IEEE

J. Feng, L. Zou and T. Nan, "A Phishing Webpage Detection Method Based on Stacked Autoencoder and Correlation Coefficients", Journal of computing and information technology, vol.27, no. 2, pp. 41-54, 2019. [Online]. https://doi.org/10.20532/cit.2019.1004702

Abstract

Phishing is a kind of cyber-attack that targets naive online users by tricking them into revealing sensitive information. There are many anti-phishing solutions proposed to date, such as blacklist or whitelist, heuristic-based and machine learning-based methods. However, online users are still being trapped into revealing sensitive information in phishing websites. In this paper, we propose a novel phishing webpage detection model, based on features that are extracted from URL, source codes of HTML, and the third-party services to represent the basic characters of phishing webpages, which uses a deep learning method – Stacked Autoencoder (SAE) to detect phishing webpages. To make features in the same order of magnitude, three kinds of normalization methods are adopted. In particular, a method to calculate correlation coefficients between weight matrixes of SAE is proposed to determine optimal width of hidden layers, which shows high computational efficiency and feasibility. Based on the testing of a set of phishing and benign webpages, the model using SAE achieves the best performance when compared to other algorithms such as Naive Bayes (NB), Support Vector Machine (SVM), Convolutional Neural Networks (CNN), and Recurrent Neural Networks (RNN). It indicates that the proposed detection model is promising and can be applied effectively to phishing detection.

Keywords

phishing; deep learning; correlation coefficient

Hrčak ID:

228265

URI

https://hrcak.srce.hr/228265

Publication date:

18.11.2019.

Visits: 1.878 *

Login and registration

Journal of computing and information technology, Vol. 27 No. 2, 2019.

Abstract

Keywords

Hrčak ID:

URI

Publication date: