
mlx-community/speculative-decoding
Releases0
Stars8
Native speculative decoding implementation for fast LLM inference on Apple Silicon using MLX-Swift.
Subscribe above to receive notifications when new versions are released.
0 releases
| Version | Date | Stability Stability is determined by the version string and my be inaccurate. |
|---|