The Open Source AI Orchestration Framework
What is MLRun?
Integrations
Resources
MLRun Hub
Docs
Blog
Get Started
Vllm module
modules
Use module
GitHub
View source
Documentation
Example Notebook
View source
Updated 11 days ago
Description
Deploys a vLLM OpenAI-compatible LLM server as an MLRun application runtime, with configurable GPU usage, node selection, tensor parallelism, and runtime flags.
module Version
latest
latest
1.0.0
MLRun Version
1.10.0
Kind
generic
Tags
genai
Publisher
Iguazio