Enhanced Intrusion Detection Systems Performance with UNSW-NB15 Data Analysis

More, Shweta and Idrissi, Moad and Mahmoud, Haitham and Asyhari, A. Taufiq (2024) Enhanced Intrusion Detection Systems Performance with UNSW-NB15 Data Analysis. Algorithms, 17 (2). p. 64. ISSN 1999-4893

[img]
Preview
Text
algorithms-17-00064.pdf - Published Version
Available under License Creative Commons Attribution.

Download (779kB)

Abstract

The rapid proliferation of new technologies such as Internet of Things (IoT), cloud computing, virtualization, and smart devices has led to a massive annual production of over 400 zettabytes of network traffic data. As a result, it is crucial for companies to implement robust cybersecurity measures to safeguard sensitive data from intrusion, which can lead to significant financial losses. Existing intrusion detection systems (IDS) require further enhancements to reduce false positives as well as enhance overall accuracy. To minimize security risks, data analytics and machine learning can be utilized to create data-driven recommendations and decisions based on the input data. This study focuses on developing machine learning models that can identify cyber-attacks and enhance IDS system performance. This paper employed logistic regression, support vector machine, decision tree, and random forest algorithms on the UNSW-NB15 network traffic dataset, utilizing in-depth exploratory data analysis, and feature selection using correlation analysis and random sampling to compare model accuracy and effectiveness. The performance and confusion matrix results indicate that the Random Forest model is the best option for identifying cyber-attacks, with a remarkable F1 score of 97.80%, accuracy of 98.63%, and low false alarm rate of 1.36%, and thus should be considered to improve IDS system security.

Item Type: Article
Identification Number: https://doi.org/10.3390/a17020064
Dates:
DateEvent
22 January 2024Accepted
1 February 2024Published Online
Uncontrolled Keywords: machine learning in cyber security, UNSW-NB15 dataset, logistic regression, support vector machine, decision tree, random forest
Subjects: CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science
Divisions: Faculty of Computing, Engineering and the Built Environment > School of Computing and Digital Technology
Depositing User: Gemma Tonks
Date Deposited: 11 Mar 2024 16:53
Last Modified: 11 Mar 2024 16:53
URI: https://www.open-access.bcu.ac.uk/id/eprint/15332

Actions (login required)

View Item View Item

Research

In this section...