Tag: Efficiency

All the articles with the tag "Efficiency".

An Efficient Sparse Kernel Generator for O(3)-Equivariant Deep Networks

Published: 15 May, 2025 at 11:06 AM

88.71 🤔

This paper introduces a GPU sparse kernel generator for the Clebsch-Gordon tensor product in O(3)-equivariant deep networks, achieving significant speedups (up to 10x over e3nn and 1.3x-2.0x over cuEquivariance) by leveraging JIT compilation, static analysis, and kernel fusion, particularly enhancing performance in computational chemistry models like Nequip and MACE.
Nonparametric learning of covariate-based Markov jump processes using RKHS techniques

Published: 8 May, 2025 at 12:18 AM

88.71 🤔

本文提出了一种基于再生核希尔伯特空间（RKHS）的非参数化方法，通过频率学和贝叶斯框架建模连续时间马尔可夫链（CTMC）中协变量驱动的非线性转移率，显著提升了个体化状态转移预测的准确性。
Parameter-Efficient Fine-Tuning with Column Space Projection

Published: 31 May, 2025 at 11:16 AM

88.66 🤔

本文提出PiCa，一种基于谱特性的参数高效微调方法，通过将梯度投影到预训练权重的低秩列子空间并结合权重共享，在显著减少参数量的同时实现了优于LoRA和SVFT的性能。
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Published: 30 May, 2025 at 11:15 AM

88.65 🤔

R1-Searcher++ 通过两阶段训练策略（SFT 和 RL），结合奖励机制和记忆模块，使大型语言模型自适应地平衡内部知识与外部检索，在多跳问答任务中显著提升准确性和检索效率。
Making Small Language Models Efficient Reasoners: Intervention, Supervision, Reinforcement

Published: 17 May, 2025 at 11:04 AM

88.64 🤔

This paper introduces Temperature Scaling (TS) and Trace Length Control for Dynamic Reasoning (TLDR) to enhance token efficiency in small language models, achieving up to 50% reduction in response length with minimal accuracy loss across multiple reasoning benchmarks.

Tag: Efficiency

An Efficient Sparse Kernel Generator for O(3)-Equivariant Deep Networks

Nonparametric learning of covariate-based Markov jump processes using RKHS techniques

Parameter-Efficient Fine-Tuning with Column Space Projection

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Making Small Language Models Efficient Reasoners: Intervention, Supervision, Reinforcement