mlx-community/speculative-decoding

Releases0

Stars8

Native speculative decoding implementation for fast LLM inference on Apple Silicon using MLX-Swift.

Subscribe above to receive notifications when new versions are released.

0 releases

	Version	Date	Stability Stability is determined by the version string and my be inaccurate.