Tag: Human-AI Interaction

All the articles with the tag "Human-AI Interaction".

Streaming, Fast and Slow: Cognitive Load-Aware Streaming for Efficient LLM Serving

Published: 4 May, 2025 at 04:30 PM

60.43 🤔

本文提出基于认知负载的适应性流式传输框架，用于优化 LLM 服务，通过动态调整输出速度减少计算资源消耗高达 16.8%，同时维持用户满意度。
EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning

Published: 4 May, 2025 at 04:27 PM

60.29 🤔

本文提出EPO方法，通过强化学习优化一个专门的战略推理模型，辅助任意LLM代理在动态环境中实现长期目标对齐，提升战略推理能力。
A closer look at how large language models trust humans: patterns and biases

Published: 4 May, 2025 at 04:29 PM

56.91 🤔

本研究通过模拟实验首次揭示大型语言模型对人类的隐性信任模式，显示其类似于人类受可信度维度影响，但存在模型异质性和人口统计学偏差。
MARFT: Multi-Agent Reinforcement Fine-Tuning

Published: 4 May, 2025 at 04:28 PM

56.39 🤔

本文提出MARFT框架，通过序列决策和信任区域优化在LLM-based多代理系统中实现高效强化微调，提升代理协作能力并解决传统MARL的适用性问题。
Monte Carlo Planning with Large Language Model for Text-Based Game Agents

Published: 4 May, 2025 at 04:30 PM

55.97 🤔

本文提出MC-DML算法，通过整合大型语言模型的动态记忆机制与蒙特卡罗树搜索，提升文本-based游戏代理的规划效率和性能，实验结果显示其在初始阶段就优于需多次迭代的强基线。

Tag: Human-AI Interaction

Streaming, Fast and Slow: Cognitive Load-Aware Streaming for Efficient LLM Serving

EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning

A closer look at how large language models trust humans: patterns and biases

MARFT: Multi-Agent Reinforcement Fine-Tuning

Monte Carlo Planning with Large Language Model for Text-Based Game Agents