vllm-project/vllm

vllm-project/vllm

Releases167
Frequency6 days 23 hours
Last Release
Stars83.5K
A high-throughput and memory-efficient inference and serving engine for LLMs
Subscribe above to receive notifications when new versions are released.
VersionDate
Stability
Stability is determined by the version string and my be inaccurate.
v0.9.2rc1 RC
v0.9.1 Stable
v0.9.1rc2 RC
v0.9.1rc1 RC
v0.9.0.1 Stable
v0.9.0 Stable
v0.8.5.post1 Unknown
v0.8.5 Stable
v0.8.4 Stable
v0.8.3 Stable
v0.8.3rc1 RC
v0.8.2 Stable
v0.8.1 Stable
v0.8.0 Stable
v0.8.0rc2 RC
v0.8.0rc1 RC
v0.7.3 Stable
v0.7.2 Stable
v0.7.1 Stable
v0.7.0 Stable
v0.6.6 Stable
v0.6.6.post1 Unknown
v0.6.5 Stable
v0.6.4 Stable
v0.6.4.post1 Unknown