Tag: Robustness

All the articles with the tag "Robustness".

Pushing the boundary on Natural Language Inference

Published: 4 May, 2025 at 04:30 PM

56.51 🤔

本文提出使用Group Relative Policy Optimization结合Chain-of-Thought学习的方法提升自然语言推理任务的性能，无需标注推理路径，通过参数高效微调在对抗性基准上实现最先进结果。
An Empirical Study of Evaluating Long-form Question Answering

Published: 4 May, 2025 at 04:31 PM

55.78 🤔

本文实证研究了长形式问题回答的自动评估指标，证明了基于LLM的指标在准确性和稳定性上的优势，同时分析了其偏差和改进策略。
CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks

Published: 4 May, 2025 at 04:32 PM

55.01 🤔

本文提出CachePrune方法，通过基于DPO损失的特征归因识别并修剪KV缓存中的关键神经元，防御间接提示注入攻击，同时保持模型响应质量。
ASIDE: Architectural Separation of Instructions and Data in Language Models

Published: 4 May, 2025 at 04:27 PM

53.34 🤔

本文提出ASIDE方法，通过在嵌入级别应用固定正交旋转实现大型语言模型的指令-数据架构分离，提高了模型的安全性和对提示注入攻击的鲁棒性，同时不牺牲性能。
Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review

Published: 4 May, 2025 at 04:31 PM

50.24 🤔

本文通过系统综述和实证基准测试，比较了LLMs的不确定性量化与校准方法，揭示了这些方法的有效性、局限性，并为未来研究提供了关键洞见。

Tag: Robustness

Pushing the boundary on Natural Language Inference

An Empirical Study of Evaluating Long-form Question Answering

CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks

ASIDE: Architectural Separation of Instructions and Data in Language Models

Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review