mlx-community/speculative-decoding

mlx-community/speculative-decoding

Releases0
Stars8
Native speculative decoding implementation for fast LLM inference on Apple Silicon using MLX-Swift.

Collections containing this project

Showing collections based on your access.

This project is not in any collections you can view.