Fyde OS ISO - Search

About 584,000 results

Open links in new tab

Any time

emergentmind.com
https://www.emergentmind.com › topics › distillspec-method
DistillSpec: Speculative Decoding Method
Oct 22, 2025 · The paper introduces a framework that aligns a lightweight draft model with a large target model to improve token acceptance rates and achieve 10–45% inference speedup. It …
arxiv.org
https://arxiv.org › abs
[2503.07807] Training Domain Draft Models for Speculative Decoding ...
Mar 10, 2025 · However, when adapting speculative decoding to domain-specific target models, the acceptance rate of the generic draft model drops significantly due to domain shift. In this …
openreview.net
https://openreview.net › pdf
[PDF]
D S : IMPROVING SPECULATIVE DECODING VIA K …
ABSTRACT Speculative decoding (SD) accelerates large language model inference by employing a faster draft model for generating multiple tokens, which are then verified in parallel by the …
researchgate.net
https://www.researchgate.net › publication
[PDF]
arXiv:2503.07807v1 [cs.CL] 10 Mar 2025 - ResearchGate
ABSTRACT predict the output of a target model. However, when adapting speculative decoding to domain-specific target models, the acceptance rate of the generic draft model d
research.google
https://research.google › pubs › distillspec-improving...
DistillSpec: Improving speculative decoding via knowledge distillation
Nonetheless, identifying an accurate and compact draft model aligned with the target model presents challenges. To address this, we propose leveraging white-box knowledge distillation, …
avinash-ravichandran.github.io
https://avinash-ravichandran.github.io › publication
Training Domain Draft Models for Speculative Decoding: Best …
Jan 1, 2025 · Download paper here Recommended citation: Fenglu Hong and Ravi Raju and Jonathan Lingjie Li and Bo Li and Urmish Thakker and Avinash Ravichandran and …
iclr.cc
https://iclr.cc › virtual › poster
DistillSpec: Improving Speculative Decoding via Knowledge Distillation
Finally, in practical scenarios with models of varying sizes, first using distillation to boost the performance of the target model and then applying DistillSpec to train a well-aligned draft …
aimodels.fyi
https://www.aimodels.fyi › papers › arxiv › training...
Training Domain Draft Models for Speculative Decoding: Best …
Mar 11, 2025 · The research investigates optimal methods for training draft models used in speculative decoding. The authors experiment with various training methods, focusing on …
paperium.net
https://www.paperium.net › article › en › adaspec...
AdaSPEC: Selective Knowledge Distillation for Efficient Spec
Adaptive selective distillation for faster speculative decoding At first glance, the efficiency trick called Speculative Decoding looks straightforward: a smaller draft model proposes tokens and …
synthical.com
https://synthical.com › article › Training-Domain-Draft...
Training Domain Draft Models for Speculative Decoding: Best …
However, when adapting speculative decoding to domain-specific target models, the acceptance rate of the generic draft model drops significantly due to domain shift. In this work, we …
chatpaper.com
https://chatpaper.com › chatpaper › paper
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative ...
Oct 22, 2025 · AdaSPEC is a novel method that enhances speculative decoding by selectively filtering difficult tokens during knowledge distillation, resulting in improved token acceptance …

Some results have been removed
Pagination
- 1
- 2
- 3
- Next

DistillSpec: Speculative Decoding Method

[2503.07807] Training Domain Draft Models for Speculative Decoding ...

D S : IMPROVING SPECULATIVE DECODING VIA K …

arXiv:2503.07807v1 [cs.CL] 10 Mar 2025 - ResearchGate

DistillSpec: Improving speculative decoding via knowledge distillation

Training Domain Draft Models for Speculative Decoding: Best …

DistillSpec: Improving Speculative Decoding via Knowledge Distillation

Training Domain Draft Models for Speculative Decoding: Best …

AdaSPEC: Selective Knowledge Distillation for Efficient Spec

Training Domain Draft Models for Speculative Decoding: Best …

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative ...