Convolutional neural network optimization for phishing email classification

No Thumbnail Available
Authors
Salinas Monroy, Sergio A.
McGinley, Cameron
Advisors
Issue Date
2021-12-15
Type
Conference paper
Keywords
Phishing , Emails , Deep learning , Convolutional neural networks , Word embedding , Natural language processing
Research Projects
Organizational Units
Journal Issue
Citation
C. McGinley and S. A. S. Monroy, "Convolutional Neural Network Optimization for Phishing Email Classification," 2021 IEEE International Conference on Big Data (Big Data), 2021, pp. 5609-5613, doi: 10.1109/BigData52589.2021.9671531.
Abstract

Phishing emails are one of the most common and effective tools that cybercriminals use to gain access to an organization’s network or personal information. To detect these attacks, email service providers use a variety of tools and indicators, such as the URLs that attackers include in their email messages. However, cybercriminals are able to bypass these detection techniques by omitting URLs in their messages and instead engaging victims in a conversation to advance their attacks. In this paper, we investigate the performance of convolutional neural network (CNN) models that identify phishing attacks by analyzing only the text in the email messages. The models take as input an embedding of the text in the email’s body and output a probability indicating the likelihood that the message is malicious. We evaluate several CNN architectures using real-world phishing emails and find that the best performing one can identify phishing attacks with an accuracy of 98.139%, recall of 98.125%, and precision of 98.269%.

Table of Contents
Description
Click on the DOI link to view this conference paper (may not be free).
Publisher
IEEE
Journal
Book Title
Series
2021 IEEE International Conference on Big Data (Big Data);2021
PubMed ID
DOI
ISSN
EISSN