Tag: Reasoning

All the articles with the tag "Reasoning".

Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models

Published: 1 Jun, 2025 at 11:43 AM

88.88 🤔

本文提出动态思维模式优化框架（DTO），通过分割和优化大型推理模型的推理路径，显著减少计算开销并提升准确率，在数学推理基准上实现高达12%的准确率提升和47%的FLOPs减少。
Activation Control for Efficiently Eliciting Long Chain-of-thought Ability of Language Models

Published: 28 May, 2025 at 11:26 AM

88.80 🤔

本文通过分析大型语言模型中长链式思维能力的激活模式，提出了一种训练无关的激活控制方法（EELo-CoT）和参数高效微调策略，在推理时动态调整激活值以显著提升自反思率和准确率。
Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs

Published: 23 May, 2025 at 11:09 AM

88.75 🤔

本文通过层级上下文掩码和跨任务补丁方法，验证了大型语言模型内部存在‘内部思维链’，即在不同网络深度学习并按序执行复合任务的子任务，从而提升了模型透明度并为指令级行为控制开辟了新路径。
When Reasoning Beats Scale: A 1.5B Reasoning Model Outranks 13B LLMs as Discriminator

Published: 8 May, 2025 at 06:13 PM

88.68 🤔

This paper demonstrates that a 1.5B parameter reasoning model (Distill-R1) outperforms larger non-reasoning LLMs as a discriminator in a text-to-SQL planning framework by leveraging a novel soft score extraction method from chain-of-thought outputs, though it struggles significantly as a generator.
From Compression to Expansion: A Layerwise Analysis of In-Context Learning

Published: 30 May, 2025 at 11:15 AM

88.68 🤔

本文通过统计几何分析揭示了大型语言模型在上下文学习中的层级压缩-扩展现象，早期层压缩任务信息，后期层扩展生成预测，并探讨了模型大小、演示数量和噪声对性能的影响。

Tag: Reasoning

Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models

Activation Control for Efficiently Eliciting Long Chain-of-thought Ability of Language Models

Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs

When Reasoning Beats Scale: A 1.5B Reasoning Model Outranks 13B LLMs as Discriminator

From Compression to Expansion: A Layerwise Analysis of In-Context Learning