Tag: Synthetic Data

All the articles with the tag "Synthetic Data".

Label-efficient Single Photon Images Classification via Active Learning

Published: 16 May, 2025 at 11:11 AM

95.02 🤔

This paper proposes an active learning framework for single-photon image classification that uses imaging condition-aware synthetic augmentation and a diversity-guided uncertainty-inconsistency sampling strategy to achieve high accuracy (97% on synthetic, 90.63% on real-world data) with significantly fewer labeled samples (1.5% and 8%, respectively) compared to baselines.
How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning

Published: 3 Jun, 2025 at 11:29 AM

87.38 🤔

本文通过控制实验研究SFT和RL在增强LLM推理能力中的相互作用，发现短CoT预热对RL有中等贡献，回溯次数需与任务难度匹配，且RL对SFT数据正确性依赖较小而对结构一致性敏感。
Context-Free Synthetic Data Mitigates Forgetting

Published: 23 May, 2025 at 11:15 AM

86.17 🤔

本文提出了一种上下文无关合成数据（CFS）方法，通过生成无条件样本并结合微调和预训练损失，缓解大型语言模型在数据不可知场景下的灾难性遗忘，实验在Olmo-1B和R1-Distill-Llama-8B模型上验证了其有效性。
Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning

Published: 18 May, 2025 at 11:14 AM

85.14 🤔

本文提出Reasoning CPT方法，通过在持续预训练中加入合成隐藏思维数据，显著提升大型语言模型在跨领域推理、困难问题解决和推理效率方面的表现，特别是在MMLU基准上实现了最高3.3%的整体提升和困难问题上约8%的改进。
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks

Published: 4 May, 2025 at 04:26 PM

70.33 🤔

本文提出PLAN-AND-ACT框架，通过分离规划和执行模块、利用合成数据训练和动态重规划，提高LLM代理在复杂长期任务中的性能，并在web导航基准上达到state-of-the-art结果。

Tag: Synthetic Data

Label-efficient Single Photon Images Classification via Active Learning

How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning

Context-Free Synthetic Data Mitigates Forgetting

Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning

Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks