vllm

vllm

Python Package Index

Releases93

Frequency1 week 5 days

Last Releaseabout 15 hours ago

A high-throughput and memory-efficient inference and serving engine for LLMs

Log in to subscribe

Linked projects

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Package Index

A high-throughput and memory-efficient inference and serving engine for LLMs