Tag: Interpretability

All the articles with the tag "Interpretability".

Empirical Evaluation of Progressive Coding for Sparse Autoencoders

Published: 4 May, 2025 at 04:33 PM

51.36 🤔

本文通过实证评估比较了Matryoshka SAEs和基于字典幂律修剪的方法，以实现SAEs的渐进式编码，提高计算效率、重建保真度和可解释性。