Tag: Efficiency

All the articles with the tag "Efficiency".

CB-cPIR: Code-Based Computational Private Information Retrieval

Published: 8 May, 2025 at 11:08 AM

93.98 🤔

CB-cPIR introduces a code-based single-server computational private information retrieval scheme that enhances security against subquery attacks by using high-weight secret vectors and dual queries, achieving lower communication and computational costs compared to lattice-based schemes like XPIR and SimplePIR.
Test-time Correlation Alignment

Published: 8 May, 2025 at 12:21 AM

93.31 🤔

本文提出测试时相关性对齐（TCA）范式，通过构建伪源域相关性并应用线性变换对齐测试数据特征，显著提升测试时适应（TTA）性能，同时保持高效性和源域知识。
Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models

Published: 8 May, 2025 at 06:12 PM

91.54 🤔

This paper introduces Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning (LS-Mixture SFT), which combines long and short CoT datasets to fine-tune non-reasoning LLMs, achieving a 2.3% average accuracy improvement and 47.61% response length reduction on reasoning benchmarks.
MoM: Linear Sequence Modeling with Mixture-of-Memories

Published: 8 May, 2025 at 06:19 PM

89.33 🤔

The Mixture-of-Memories (MoM) architecture introduces multiple independent memory states with a routing mechanism to enhance memory capacity and reduce interference in linear sequence modeling, achieving significant performance gains over other linear models on recall-intensive tasks and nearing Transformer performance at larger scales while maintaining efficiency.
Always Skip Attention

Published: 8 May, 2025 at 11:06 AM

89.20 🤔

This paper theoretically demonstrates the ill-conditioning of Self-Attention Blocks in Vision Transformers without skip connections, highlights their role as regularizers, and proposes Token Graying (SVD and DCT) to improve input token conditioning, achieving modest performance gains in supervised and self-supervised tasks.

Tag: Efficiency

CB-cPIR: Code-Based Computational Private Information Retrieval

Test-time Correlation Alignment

Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models

MoM: Linear Sequence Modeling with Mixture-of-Memories

Always Skip Attention