Michael-A-Kuykendall/shimmy

Michael-A-Kuykendall/shimmy

Releases39
Frequency1 week 2 hours
Last Release
Stars5.32K
⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.

Collections containing this project

Showing collections based on your access.

This project is not in any collections you can view.