Posts

All the articles I've posted.

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Published: 4 May, 2025 at 04:31 PM

59.95 🤔

本文提出Token-Shuffle方法，通过利用视觉词汇维度冗余动态合并和恢复图像令牌，实现高效的高分辨率文本到图像生成，同时在统一自回归框架下保持出色性能。
Latent Factor Models Meets Instructions: Goal-conditioned Latent Factor Discovery without Task Supervision

Published: 4 May, 2025 at 04:27 PM

59.70 🤔

本文提出Instruct-LF方法，通过结合LLMs的指令遵循能力和梯度-based统计模型，实现无需任务监督的目标导向潜在因素发现，提高了下游任务性能并在人工评估中被偏好。
SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Published: 4 May, 2025 at 04:28 PM

59.39 🤔

本研究提出 SpargeAttn，一种通用稀疏注意力机制，通过两阶段在线过滤器和量化技术加速各种模型的推理，同时保持端到端性能无损。
Reason2Attack: Jailbreaking Text-to-Image Models via LLM Reasoning

Published: 4 May, 2025 at 04:27 PM

58.67 🤔

本文提出Reason2Attack方法，通过基于Frame Semantics的CoT示例合成和带攻击过程奖励的强化学习，增强LLM的推理能力，以高效生成对抗性提示实现对T2I模型的越狱攻击。
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

Published: 4 May, 2025 at 04:32 PM

56.98 🤔

本文提出DeepSeek-Prover-V2，通过子目标分解和强化学习统一非正式和正式数学推理，显著提升了神经定理证明的性能，在多个基准上达到最先进水平。

Posts

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Latent Factor Models Meets Instructions: Goal-conditioned Latent Factor Discovery without Task Supervision

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Reason2Attack: Jailbreaking Text-to-Image Models via LLM Reasoning

DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition