release
alert
Auto-detect
Apple App Store
Rust Crate (Cargo)
Chocolatey Package
Docker Image
Debian Package (Bookworm)
Go Module
Ruby GEM
GitHub Repository
GitLab Repository
Maven Central
NPM Package
NuGet Package
Packagist Package
Python Package (PyPI)
VS Code Extension
WordPress Plugin
Search
/
Sign in
Michael-A-Kuykendall/shimmy
GitHub
github.com
Releases
39
Frequency
1 week 2 hours
Last Release
3 days ago
Monday, 1 June 2026, 18:50
Stars
5.32K
⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.
Log in to subscribe
Releases
39
Links
1
Collections
0
Linked projects
shimmy
Crates.io
Lightweight Ollama-compatible inference server with native SafeTensors support. No Python dependencies, cross-platform WebGPU acceleration via Airframe.