ReLearn is a novel pipeline for unlearning in large language models that enhances targeted forgetting while maintaining high-quality output. It addresses limitations of existing methods by introducing a comprehensive evaluation framework that includes new metrics for knowledge preservation and generation quality. Experiments demonstrate that ReLearn effectively mitigates the negative effects of reverse optimization on coherent text generation.
+ unlearning
language-models ✓
data-augmentation ✓
evaluation-metrics ✓
machine-learning ✓