Posts

All the articles I've posted.

PennyLang: Pioneering LLM-Based Quantum Code Generation with a Novel PennyLane-Centric Dataset

Published: 4 May, 2025 at 04:26 PM

55.37 🤔

本文提出 PennyLang 数据集和 RAG/GraphRAG 框架，通过提升 LLM 在 PennyLane 量子代码生成中的准确性和正确性，填补了 AI 辅助量子编程的空白。
Learning Explainable Dense Reward Shapes via Bayesian Optimization

Published: 4 May, 2025 at 04:30 PM

55.26 🤔

本文提出一种通过Bayesian Optimization学习解释性密集奖励形状的方法，以解决RLHF中奖励稀疏问题，实现token级信用分配优化，提升训练效率和性能，同时保持最优政策不变。
CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks

Published: 4 May, 2025 at 04:32 PM

55.01 🤔

本文提出CachePrune方法，通过基于DPO损失的特征归因识别并修剪KV缓存中的关键神经元，防御间接提示注入攻击，同时保持模型响应质量。
SuperARC: An Agnostic Test for Narrow, General, and Super Intelligence Based On the Principles of Recursive Compression and Algorithmic Probability

Published: 4 May, 2025 at 04:26 PM

54.84 🤔

本文提出SuperARC测试框架，通过算法概率和Kolmogorov复杂度的原理，设计了一个客观的AGI和ASI评估方法，证明递归压缩等价于预测，并展示了LLMs的局限性。
Synergizing RAG and Reasoning: A Systematic Review

Published: 4 May, 2025 at 04:28 PM

54.75 🤔

本论文系统综述了检索增强生成（RAG）与推理能力的协同整合，构建了多维分类框架、提供了实用指南，并指出了未来研究方向，以推进RAG系统在复杂任务中的认知能力。

Posts

PennyLang: Pioneering LLM-Based Quantum Code Generation with a Novel PennyLane-Centric Dataset

Learning Explainable Dense Reward Shapes via Bayesian Optimization

CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks

SuperARC: An Agnostic Test for Narrow, General, and Super Intelligence Based On the Principles of Recursive Compression and Algorithmic Probability

Synergizing RAG and Reasoning: A Systematic Review