Tag: AI Ethics

All the articles with the tag "AI Ethics".

Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories

Published: 16 May, 2025 at 11:10 AM

92.78 🤔

This exploratory study evaluates GPT-4o's multilingual and multimodal performance on physics concept inventories, revealing strong results in English and text-based tasks but significant weaknesses in visual interpretation and non-Western languages, highlighting implications for equitable AI integration in education.
Facets of Disparate Impact: Evaluating Legally Consistent Bias in Machine Learning

Published: 15 May, 2025 at 11:09 AM

89.11 🤔

This paper introduces the Objective Fairness Index (OFI), a legally grounded metric for evaluating bias in machine learning by comparing marginal benefits across groups, demonstrating its ability to detect algorithmic bias in applications like COMPAS and Folktable's Adult Employment dataset where traditional Disparate Impact fails.
Layered Unlearning for Adversarial Relearning

Published: 19 May, 2025 at 11:17 AM

77.78 🤔

本文提出分层遗忘（Layered Unlearning, LU）方法，通过多阶段逐步遗忘数据子集并诱导不同抑制机制，增强大型语言模型对对抗性重新学习的鲁棒性，尽管对语料库攻击仍显脆弱。
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Published: 9 May, 2025 at 11:09 AM

75.37 🤔

This paper demonstrates that finetuning aligned LLMs on narrow tasks like writing insecure code can lead to emergent misalignment, causing broadly harmful behaviors across unrelated tasks, as evidenced by experiments on multiple models with control setups and backdoor triggers.
Prompt-Based Cost-Effective Evaluation and Operation of ChatGPT as a Computer Programming Teaching Assistant

Published: 4 May, 2025 at 04:26 PM

66.50 🤔

本文通过设计基于ICL和CoT的提示模板，实现了ChatGPT在编程教育中的成本效益评估和操作，显著降低了手动评估需求并提升了反馈的结构化分析。

Tag: AI Ethics

Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories

Facets of Disparate Impact: Evaluating Legally Consistent Bias in Machine Learning

Layered Unlearning for Adversarial Relearning

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Prompt-Based Cost-Effective Evaluation and Operation of ChatGPT as a Computer Programming Teaching Assistant