2025-11-17T20:07:13.334490

Weed Out, Then Harvest: Dual Low-Rank Adaptation is an Effective Noisy Label Detector for Noise-Robust Learning

Yuan, Chen, Zhang

Parameter-efficient fine-tuning (PEFT) large language models (LLMs) have shown impressive performance in various downstream tasks. However, in many real-world scenarios, the collected training data inevitably contains noisy labels. To learn from noisy labels, most solutions select samples with small losses for model training. However, the selected samples, in turn, impact the loss computation in the next iteration. An inaccurate initial selection can create a vicious cycle, leading to suboptimal performance. To break this cycle, we propose Delora, a novel framework that decouples the sample selection from model training. For sample selection, Delora establishes a noisy label detector by introducing clean and noisy LoRA. Benefiting from the memory effect, the clean LoRA is encouraged to memorize clean data, while the noisy LoRA is constrained to memorize mislabeled data, which serves as a learnable threshold for selecting clean and noisy samples. For model training, Delora can use carefully selected samples to fine-tune language models seamlessly. Experimental results on synthetic and real-world noisy datasets demonstrate the effectiveness of Delora in noisy label detection and text classification.

academic

Weed Out, Then Harvest: Dual Low-Rank Adaptation is an Effective Noisy Label Detector for Noise-Robust Learning

基本信息

论文ID: 2510.10208
标题: Weed Out, Then Harvest: Dual Low-Rank Adaptation is an Effective Noisy Label Detector for Noise-Robust Learning
作者: Bo Yuan, Yulin Chen, Yin Zhang (浙江大学)
分类: cs.CL (计算语言学)
发表时间: 2024年10月11日
论文链接: https://arxiv.org/abs/2510.10208v1

摘要

参数高效微调(PEFT)大语言模型在各种下游任务中表现出色，但现实场景中训练数据不可避免地包含噪声标签。现有的噪声标签学习方法通常选择小损失样本进行训练，但这种选择会影响下一轮的损失计算，不准确的初始选择会造成恶性循环。本文提出Delora框架，通过解耦样本选择和模型训练来打破这一循环。该框架引入清洁LoRA和噪声LoRA构建噪声标签检测器，利用记忆效应使清洁LoRA记忆干净数据，噪声LoRA记忆错误标记数据，作为可学习阈值选择样本。实验结果表明Delora在噪声标签检测和文本分类任务上的有效性。

研究背景与动机

问题定义

核心问题: 在大语言模型的参数高效微调过程中，如何处理训练数据中不可避免的噪声标签问题
重要性: 现实世界的数据收集过程中必然存在标注错误，这会严重影响模型性能和泛化能力
现有方法局限性:
- 传统小损失选择策略存在"恶性循环"问题：样本选择影响损失计算，损失计算又影响样本选择
- 依赖手动设置阈值，实用性受限
- 在高噪声场景下性能不稳定