Tag: Speculative Decoding

All the articles with the tag "Speculative Decoding".

Accelerating Large Language Model Reasoning via Speculative Search

Published: 13 May, 2025 at 11:12 AM

78.41 🤔

Speculative Search (SpecSearch) accelerates LLM reasoning by up to 2.12× through a bi-level speculative thought generator that collaborates between small and large models, maintaining comparable reasoning quality via a quality-preserving rejection mechanism.