
mlx-community/speculative-decoding
Releases0
Stars8
Native speculative decoding implementation for fast LLM inference on Apple Silicon using MLX-Swift.
Collections containing this project
Showing collections based on your access.
This project is not in any collections you can view.