Model Serving
Model inference, serving, and deployment · 9 plugins
Dgxc-lepton
Flytekitflytekitplugins-dgxc-lepton
A professional Flytekit plugin that enables seamless deployment and management of AI inference endpoints using Lepton AI infrastructure within Flyte workflows.
Inference
Flytekitflytekitplugins-inference
Serve models natively in Flyte tasks using inference providers like NIM, Ollama, and others.
ONNX PyTorch
Flytekitflytekitplugins-onnxpytorch
This plugin allows you to generate ONNX models from your PyTorch models.
ONNX ScikitLearn
Flytekitflytekitplugins-onnxscikitlearn
This plugin allows you to generate ONNX models from your ScikitLearn models.
ONNX TensorFlow
Flytekitflytekitplugins-onnxtensorflow
This plugin allows you to generate ONNX models from your TensorFlow Keras models.
OpenAI
Flytekitflytekitplugins-openai
The plugin currently features ChatGPT and Batch API connectors.
OpenAI
v2Flyte SDK (v2)flyteplugins-openai
This plugin provides a drop-in replacement for OpenAI packages. It provides
SGLang
v2Flyte SDK (v2)flyteplugins-sglang
Serve large language models using SGLang with Flyte Apps.
vLLM
v2Flyte SDK (v2)flyteplugins-vllm
Serve large language models using vLLM with Flyte Apps.