No, speculative decoding has exactly the same accuracy as the target model. It is mathematically identical to greedy decoding.
You will see that tokens not predicted by greedy sampling of the target model are rejected. Ergo, they are mathematically identical.
No, speculative decoding has exactly the same accuracy as the target model. It is mathematically identical to greedy decoding.