Cyber for AI at SemEval-2025 Task 4: Forgotten but Not Lost: The Balancing Act of Selective Unlearning in Large Language Models
By: Dinesh Srivasthav P, Bala Mallikarjunarao Garlapati
Potential Business Impact:
Removes private info from AI without retraining.
Large Language Models (LLMs) face significant challenges in maintaining privacy, ethics, and compliance, when sensitive or obsolete data must be selectively removed. Retraining these models from scratch is computationally infeasible, necessitating efficient alternatives. As part of the SemEval 2025 Task 4, this work focuses on the application of selective unlearning in LLMs to address this challenge. In this paper, we present our experiments and findings, primarily leveraging global weight modification to achieve an equilibrium between effectiveness of unlearning, knowledge retention, and target model's post-unlearning utility. We also detail the task-specific evaluation mechanism, results, and challenges. Our algorithms have achieved an aggregate score of 0.409 and 0.389 on the test set for 7B and 1B target models, respectively, demonstrating promising results in verifiable LLM unlearning.
Similar Papers
SHA256 at SemEval-2025 Task 4: Selective Amnesia -- Constrained Unlearning for Large Language Models via Knowledge Isolation
Computation and Language
Removes private info from AI without breaking it.
Towards Benchmarking Privacy Vulnerabilities in Selective Forgetting with Large Language Models
Machine Learning (CS)
Tests AI's ability to forget data safely.
A Survey on Unlearning in Large Language Models
Computation and Language
Lets AI forget private or bad information.