Tag: Agent

All the articles with the tag "Agent".

Putting It All into Context: Simplifying Agents with LCLMs

Published: 19 May, 2025 at 11:19 AM

86.55 🤔

本文提出基于长上下文语言模型（LCLM）的‘state-in-context’代理设计，通过将整个环境状态纳入上下文简化软件工程任务的代理架构，在SWE-bench Verified上实现与复杂脚手架方法相当的性能（Gemini-2.5-Pro达到50.8% pass@1）。
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Published: 13 May, 2025 at 11:12 AM

76.49 🤔

ARTIST, a novel framework unifying agentic reasoning, reinforcement learning, and tool integration, enables LLMs to autonomously orchestrate external tools within multi-turn reasoning, achieving up to 22% accuracy gains on complex math tasks and significant improvements in multi-turn function calling over baselines.
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks

Published: 4 May, 2025 at 04:26 PM

70.33 🤔

本文提出PLAN-AND-ACT框架，通过分离规划和执行模块、利用合成数据训练和动态重规划，提高LLM代理在复杂长期任务中的性能，并在web导航基准上达到state-of-the-art结果。
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making

Published: 9 May, 2025 at 11:08 AM

70.12 🤔

This paper introduces VLM Q-Learning, an offline-to-online reinforcement learning method that fine-tunes Vision-Language Models for interactive decision-making by filtering suboptimal actions with a critic head, achieving significant performance improvements over supervised fine-tuning across multiple multimodal agent tasks.
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics

Published: 7 May, 2025 at 09:32 AM

62.29 🤔

本文提出了一种基于LLM的代理编排机器人系统，通过模块化任务规划和RAG记忆检索实现家庭环境中长程任务的自主执行，并在三个场景中展示了较高的任务规划准确率和记忆召回改进。

Tag: Agent

Putting It All into Context: Simplifying Agents with LCLMs

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks

VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making

LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics