Tag: Synthetic Data
All the articles with the tag "Synthetic Data".
-
How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning
本文通过控制实验研究SFT和RL在增强LLM推理能力中的相互作用,发现短CoT预热对RL有中等贡献,回溯次数需与任务难度匹配,且RL对SFT数据正确性依赖较小而对结构一致性敏感。
-
Label-efficient Single Photon Images Classification via Active Learning
This paper proposes an active learning framework for single-photon image classification that uses imaging condition-aware synthetic augmentation and a diversity-guided uncertainty-inconsistency sampling strategy to achieve high accuracy (97% on synthetic, 90.63% on real-world data) with significantly fewer labeled samples (1.5% and 8%, respectively) compared to baselines.
-
Context-Free Synthetic Data Mitigates Forgetting
本文提出了一种上下文无关合成数据(CFS)方法,通过生成无条件样本并结合微调和预训练损失,缓解大型语言模型在数据不可知场景下的灾难性遗忘,实验在Olmo-1B和R1-Distill-Llama-8B模型上验证了其有效性。
-
本文提出Reasoning CPT方法,通过在持续预训练中加入合成隐藏思维数据,显著提升大型语言模型在跨领域推理、困难问题解决和推理效率方面的表现,特别是在MMLU基准上实现了最高3.3%的整体提升和困难问题上约8%的改进。
-
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks
本文提出PLAN-AND-ACT框架,通过分离规划和执行模块、利用合成数据训练和动态重规划,提高LLM代理在复杂长期任务中的性能,并在web导航基准上达到state-of-the-art结果。