vllm-project/vllm

vllm-project/vllm

Releases167
Frequency6 days 23 hours
Last Release
Stars83.5K
A high-throughput and memory-efficient inference and serving engine for LLMs
Subscribe above to receive notifications when new versions are released.
VersionDate
Stability
Stability is determined by the version string and my be inaccurate.
v0.15.1rc1 RC
v0.15.1rc0 RC
v0.16.0rc0 RC
v0.15.0 Stable
v0.15.0rc3 RC
v0.15.0rc2 RC
v0.15.0rc1 RC
v0.15.0rc0 RC
v0.14.1 Stable
v0.14.0 Stable
v0.14.0rc2 RC
v0.14.0rc1 RC
v0.14.0rc0 RC
v0.13.0 Stable
v0.13.0rc4 RC
v0.13.0rc3 RC
v0.13.0rc2 RC
v0.13.0rc1 RC
v0.12.0 Stable
v0.11.2 Stable
v0.11.1.1 Stable
v0.11.1 Stable
v0.11.1rc7 RC
v0.11.1rc6 RC
v0.11.1rc5 RC