
Michael-A-Kuykendall/shimmy
Releases39
Frequency1 week 2 hours
Last Release
Stars5.32K
⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.
Collections containing this project
Showing collections based on your access.
This project is not in any collections you can view.