Releases0
A high-throughput and memory-efficient inference and serving engine for LLMs
Subscribe above to receive notifications when new versions are released.
VersionDate
Stability
Stability is determined by the version string and my be inaccurate.
PreviousNext