
Michael-A-Kuykendall/shimmy
Releases39
Frequency1 week 2 hours
Last Release
Stars5.32K
⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.
Subscribe above to receive notifications when new versions are released.
| Version | Date | Stability Stability is determined by the version string and my be inaccurate. | |
|---|---|---|---|
| v2.1.0 | Stable | ||
| v2.0.1 | Stable | ||
| v2.0.0 | Stable | ||
| archive/pre-v2.0.0-history | RC | ||
| archive/llama-cpp-era-v1.9.0 | RC | ||
| v1.9.0 | Stable | ||
| v1.8.2 | Stable | ||
| v1.8.1 | Stable | ||
| v1.8.0 | Stable | ||
| v1.7.4 | Stable | ||
| v1.7.3 | Stable | ||
| v1.7.2 | Stable | ||
| v1.7.2-test6 | Unknown | ||
| v1.7.2-test5 | Unknown | ||
| v1.7.2-test4 | Unknown | ||
| v1.7.0 | Stable | ||
| v1.6.0 | Stable | ||
| v1.5.6 | Stable | ||
| v1.5.5 | Stable | ||
| v1.4.2 | Stable | ||
| v1.5.4 | Stable | ||
| v1.5.3 | Stable | ||
| v1.5.2 | Stable | ||
| v1.5.1 | Stable | ||
| v1.5.0 | Stable |