Preview Flyte 2 for productionHosted on Union.ai
vLLM

vLLM

flyteplugins-vllm

Flyte SDK (v2)Model Servingvllminferencellmservinggpu

Serve large language models using vLLM with Flyte Apps.

Install
pip install flyteplugins-vllm

Quick Start(example, may need adjustment)

pip install flyteplugins-vllm

from flytekit import task, workflow
from flyteplugins.vllm import DEFAULT_VLLM_IMAGE, VLLMAppEnvironment

config = DEFAULT_VLLM_IMAGE(...)

@task
def my_task() -> None:
    ...

@workflow
def my_workflow() -> None:
    my_task()

Available Imports (2)

typeDEFAULT_VLLM_IMAGE

Configuration type for vLLM.

from flyteplugins.vllm import DEFAULT_VLLM_IMAGE

configVLLMAppEnvironment

App environment backed by vLLM for serving large language models.

extends dataclass — configuration or data structure for plugin setup

from flyteplugins.vllm import VLLMAppEnvironment

Related Plugins

Package Info

Min Flyte SDK
Modules2

Downloads

Last day58
Last week265
Last month893

Contributors