Michael-A-Kuykendall/shimmy

Michael-A-Kuykendall/shimmy

Releases39
Frequency1 week 2 hours
Last Release
Stars5.32K
⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.
Subscribe above to receive notifications when new versions are released.
VersionDate
Stability
Stability is determined by the version string and my be inaccurate.
v2.1.0 Stable
v2.0.1 Stable
v2.0.0 Stable
archive/pre-v2.0.0-history RC
archive/llama-cpp-era-v1.9.0 RC
v1.9.0 Stable
v1.8.2 Stable
v1.8.1 Stable
v1.8.0 Stable
v1.7.4 Stable
v1.7.3 Stable
v1.7.2 Stable
v1.7.2-test6 Unknown
v1.7.2-test5 Unknown
v1.7.2-test4 Unknown
v1.7.0 Stable
v1.6.0 Stable
v1.5.6 Stable
v1.5.5 Stable
v1.4.2 Stable
v1.5.4 Stable
v1.5.3 Stable
v1.5.2 Stable
v1.5.1 Stable
v1.5.0 Stable
Previous12Next