Tag: Human-AI Interaction

All the articles with the tag "Human-AI Interaction".

The dynamic interplay between in-context and in-weight learning in humans and neural networks

Published: 6 May, 2025 at 11:20 PM

70.07 🤔

本文通过神经网络中上下文学习（ICL）与权重学习（IWL）的动态交互，统一解释了人类学习中的组合性泛化、课程效应及灵活性与保留性权衡，为认知科学双过程理论提供了新视角。
TeLLMe: An Energy-Efficient Ternary LLM Accelerator for Prefilling and Decoding on Edge FPGAs

Published: 4 May, 2025 at 04:29 PM

70.02 🤔

本文提出TeLLMe，一种能量高效的三元LLM FPGA加速器，通过表查找矩阵引擎和反向注意力优化，支持预填充和解码阶段，在7W功率下实现高达9.51 tokens/s吞吐量和低预填充延迟。
Nemotron-Research-Tool-N1: Exploring Tool-Using Language Models with Reinforced Reasoning

Published: 19 May, 2025 at 11:19 AM

69.75 🤔

本文提出Nemotron-Research-Tool-N1，通过基于规则的强化学习和二元奖励函数训练工具调用语言模型，在不依赖标注推理轨迹的情况下显著提升工具调用能力，实验表明其在多个基准上超越GPT-4o等强基线。
Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution

Published: 12 May, 2025 at 11:18 AM

69.15 🤔

This paper introduces Gaussian Concept Subspace (GCS), a framework to model concept representations in LLMs as Gaussian distributions, demonstrating improved robustness, faithfulness, and plausibility over single vector methods, with effective application in emotion steering tasks.
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game

Published: 10 May, 2025 at 10:59 AM

68.75 🤔

This paper introduces MM-Escape, a benchmark using the customizable 3D environment EscapeCraft to evaluate multimodal reasoning in MLLMs through room escape tasks, revealing that while models like GPT-4o achieve high success in simple scenarios, performance drops significantly with increased difficulty, exposing distinct limitations in reasoning and spatial awareness.

Tag: Human-AI Interaction

The dynamic interplay between in-context and in-weight learning in humans and neural networks

TeLLMe: An Energy-Efficient Ternary LLM Accelerator for Prefilling and Decoding on Edge FPGAs

Nemotron-Research-Tool-N1: Exploring Tool-Using Language Models with Reinforced Reasoning

Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution

How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game