vllm

vllm

Python Package Index

Releases93

Frequency1 week 5 days

Last Releaseabout 15 hours ago

A high-throughput and memory-efficient inference and serving engine for LLMs

Log in to subscribe

Collections containing this project

Showing collections based on your access.

This project is not in any collections you can view.