
xorbitsai/inference
Releases138
Frequency1 week 15 hours
Last Release
Stars9.33K
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
Collections containing this project
Showing collections based on your access.
This project is not in any collections you can view.