vllm-project/vllm

vllm-project/vllm

Releases167
Frequency6 days 23 hours
Last Release
Stars83.5K
A high-throughput and memory-efficient inference and serving engine for LLMs
Subscribe above to receive notifications when new versions are released.
VersionDate
Stability
Stability is determined by the version string and my be inaccurate.
v0.19.0 Stable
v0.19.0rc1 RC
v0.19.0rc0 RC
v0.18.2rc0 RC
v0.18.2 Stable
v0.18.1 Stable
v0.18.1rc0 RC
v0.18.0 Stable
v0.18.0rc2 RC
v0.18.0rc1 RC
v0.17.2rc0 RC
v0.18.0rc0 RC
v0.17.1 Stable
v0.17.1rc0 RC
v0.17.0 Stable
v0.17.0rc1 RC
v0.17.0rc0 RC
v0.16.1rc0 RC
v0.16.0 Stable
qwen3_5 Stable
v0.16.0rc3 RC
v0.16.0rc2 RC
v0.16.0rc1 RC
v0.15.2rc0 RC
v0.15.1 Stable