Tag: Parameter Efficiency

All the articles with the tag "Parameter Efficiency".

Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

Published: 4 May, 2025 at 04:30 PM

86.83 😐

本文提出Param∆方法，通过直接添加参数差值在零成本下实现后训练知识向新基模型的转移，达到与传统后训练相当的性能。