mlx-community/speculative-decoding

mlx-community/speculative-decoding

Releases0
Stars8
Native speculative decoding implementation for fast LLM inference on Apple Silicon using MLX-Swift.
Subscribe above to receive notifications when new versions are released.
0 releases
Feed
VersionDate
Stability
Stability is determined by the version string and my be inaccurate.