Tag: AI Ethics

All the articles with the tag "AI Ethics".

Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design

Published: 12 May, 2025 at 11:08 AM

64.27 🤔

This position paper advocates for redesigning Large Language Models as 'reasonable parrots' that integrate argumentation theory principles to foster critical thinking through multi-persona dialogues, challenging users with diverse perspectives rather than providing one-sided answers.
Evidence of conceptual mastery in the application of rules by Large Language Models

Published: 4 May, 2025 at 04:26 PM

62.79 🤔

本文通过心理实验证明大型语言模型在规则应用中表现出概念掌握能力，能够泛化到新情境并部分模仿人类对时间压力等语境的敏感性。
Reason2Attack: Jailbreaking Text-to-Image Models via LLM Reasoning

Published: 4 May, 2025 at 04:27 PM

58.67 🤔

本文提出Reason2Attack方法，通过基于Frame Semantics的CoT示例合成和带攻击过程奖励的强化学习，增强LLM的推理能力，以高效生成对抗性提示实现对T2I模型的越狱攻击。
A closer look at how large language models trust humans: patterns and biases

Published: 4 May, 2025 at 04:29 PM

56.91 🤔

本研究通过模拟实验首次揭示大型语言模型对人类的隐性信任模式，显示其类似于人类受可信度维度影响，但存在模型异质性和人口统计学偏差。
SuperARC: An Agnostic Test for Narrow, General, and Super Intelligence Based On the Principles of Recursive Compression and Algorithmic Probability

Published: 4 May, 2025 at 04:26 PM

54.84 🤔

本文提出SuperARC测试框架，通过算法概率和Kolmogorov复杂度的原理，设计了一个客观的AGI和ASI评估方法，证明递归压缩等价于预测，并展示了LLMs的局限性。

Tag: AI Ethics

Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design

Evidence of conceptual mastery in the application of rules by Large Language Models

Reason2Attack: Jailbreaking Text-to-Image Models via LLM Reasoning

A closer look at how large language models trust humans: patterns and biases

SuperARC: An Agnostic Test for Narrow, General, and Super Intelligence Based On the Principles of Recursive Compression and Algorithmic Probability