Fine-tuning Large Language Models for Domain-Specific Tasks: A Comparative Study on Supervised and Parame-ter-Efficient Fine-tuning Techniques for Cybersecurity and IT Support

Sai, Chaithanya and Elmitwally, Nouh and Rice, Iain and Mahmoud, Haitham and Vickers, Ian and Schmoor, Xavier (2024) Fine-tuning Large Language Models for Domain-Specific Tasks: A Comparative Study on Supervised and Parame-ter-Efficient Fine-tuning Techniques for Cybersecurity and IT Support. In: The 4th International Conference of Advanced Computing and Informatics, 16 - 17 December 2024, Birmingham City University. (In Press)

[thumbnail of ICACIN_2024_Camera_Ready_Paper_32.pdf] Text
ICACIN_2024_Camera_Ready_Paper_32.pdf - Accepted Version
Restricted to Repository staff only

Download (548kB) | Request a copy

Abstract

This study investigates the fine-tuning of open-source large language models (LLMs) for domain-specific tasks, such as question-answering in cybersecu-rity and IT support. It focuses on two fine-tuning techniques: Supervised Fi-ne-Tuning (SFT) and Parameter-Efficient Fine-Tuning (PEFT), specifically Low-Rank Adaptation (LoRA). The research compares the performance of 21 open-source LLMs, ranging from 2 to 9 billion parameters models like Llama2-7B, Llama3.1-7B, Mistral-7B, Falcon-7B, Phi-3.5, and Gemma2-9B. SFT consistently delivers high accuracy and low train-evaluation loss, while LoRA significantly reduces GPU memory usage and computational costs without compromising performance. The research findings emphasize the importance of selecting optimal fine-tuning techniques and model architec-tures for domain-specific tasks and also highlight advancements in fine-tuning LLMs for efficient and scalable AI solutions in production.

Item Type: Conference or Workshop Item (Paper)
Dates:
Date
Event
17 December 2024
Accepted
Subjects: CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science
Divisions: Faculty of Computing, Engineering and the Built Environment > College of Computing
Depositing User: Nouh Elmitwally
Date Deposited: 13 Feb 2025 12:47
Last Modified: 13 Feb 2025 12:47
URI: https://www.open-access.bcu.ac.uk/id/eprint/16136

Actions (login required)

View Item View Item

Research

In this section...