Tag: Efficiency
All the articles with the tag "Efficiency".
-
SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning
SelfBudgeter通过自适应令牌预算预测和强化学习优化,在MATH数据集上实现74.47%响应长度压缩,同时保持接近原始准确性,显著提升大型推理模型的效率。
-
Deformable Beta Splatting
Deformable Beta Splatting (DBS) enhances real-time radiance field rendering by introducing deformable Beta Kernels for superior geometric fidelity, Spherical Beta for efficient color encoding, and kernel-agnostic MCMC optimization, achieving state-of-the-art visual quality with 45% fewer parameters and 1.5x faster rendering than 3DGS-MCMC.
-
UnifyFL: Enabling Decentralized Cross-Silo Federated Learning
UnifyFL proposes a decentralized cross-silo federated learning framework using Ethereum blockchain and IPFS to enable trust-based collaboration among organizations, achieving comparable accuracy to centralized FL with flexible aggregation policies and efficient handling of stragglers through synchronous and asynchronous modes.
-
Two Is Better Than One: Rotations Scale LoRAs
本文提出 *RadarGate*,一种基于几何的门控方法,通过旋转和拉伸操作增强 LoRA-MoE 的表达能力,在拟合、泛化和可扩展性方面显著优于现有方法,实验结果在 6 个基准数据集的 21 个任务上得到验证。
-
PASER: Post-Training Data Selection for Efficient Pruned Large Language Model Recovery
PASER提出了一种针对剪枝后大语言模型能力恢复的后训练数据选择方法,通过语义聚类、能力退化感知选择和负面效应缓解,在有限数据预算下显著提升恢复性能并降低计算成本。