Tag: Reasoning

All the articles with the tag "Reasoning".

RARE: Retrieval-Augmented Reasoning Modeling

Published: 21 May, 2025 at 11:28 AM

86.57 🤔

RARE提出了一种新范式，通过将领域知识存储外部化并优化推理能力，使轻量级模型在多领域基准测试中实现最先进的性能，超越检索增强的GPT-4和DeepSeek-R1。
ThinkSwitcher: When to Think Hard, When to Think Fast

Published: 24 May, 2025 at 11:12 AM

86.56 🤔

ThinkSwitcher通过一个轻量级自适应框架，使单一大型推理模型根据任务复杂性动态切换长短链式推理模式，在数学推理基准上减少20-30%计算成本，同时在复杂任务上保持较高准确率。
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Published: 8 May, 2025 at 06:17 PM

86.55 🤔

This paper investigates zero RL training on diverse open base models, achieving significant accuracy and response length improvements while identifying key factors like reward design and data difficulty that influence the emergence of reasoning behaviors.
Putting It All into Context: Simplifying Agents with LCLMs

Published: 19 May, 2025 at 11:19 AM

86.55 🤔

本文提出基于长上下文语言模型（LCLM）的‘state-in-context’代理设计，通过将整个环境状态纳入上下文简化软件工程任务的代理架构，在SWE-bench Verified上实现与复杂脚手架方法相当的性能（Gemini-2.5-Pro达到50.8% pass@1）。
Understanding Cross-Lingual Inconsistency in Large Language Models

Published: 26 May, 2025 at 11:22 AM

86.50 🤔

本文通过*logit lens*分析大型语言模型（LLMs）的跨语言不一致性，发现大型模型倾向于在个别语言子空间操作而非共享语义空间，并提出跨语言激活引导方法以提升小型模型的多语言推理性能和知识转移。

Tag: Reasoning

RARE: Retrieval-Augmented Reasoning Modeling

ThinkSwitcher: When to Think Hard, When to Think Fast

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Putting It All into Context: Simplifying Agents with LCLMs

Understanding Cross-Lingual Inconsistency in Large Language Models