Tag: Inference Time

All the articles with the tag "Inference Time".

RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning

Published: 23 May, 2025 at 11:16 AM

90.48 🤔

本文提出RL-of-Thoughts (RLoT) 方法，通过强化学习训练轻量化导航模型，在推理时动态构建任务特定逻辑结构，显著提升大型语言模型在多领域推理任务中的表现，并展现出跨模型和任务的强迁移能力。
ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy

Published: 26 May, 2025 at 11:23 AM

85.33 🤔

ThinkLess 提出了一种无需训练的推理效率提升框架，通过注意力分析揭示 CoT 推理冗余并早期终止生成，结合轻量级输出调节机制，在保持准确率的同时显著降低 token 使用量和推理时间。