Preview Flyte 2 for productionHosted on Union.ai
Back to all plugins

ML Training

Distributed training frameworks and compute engines · 10 plugins

Dask

Dask

Flytekit

flytekitplugins-dask

Flyte can execute dask jobs natively on a Kubernetes Cluster, which manages the virtual dask cluster's lifecycle

3 types
3 modules
1.2K/mo
1.3.0+
Dask

Dask

v2Flyte SDK (v2)

flyteplugins-dask

Flyte can execute dask jobs natively on a Kubernetes Cluster, which manages the virtual dask cluster's lifecycle

3 types
3 modules
1.8K/mo
Kubeflow MPI

Kubeflow MPI

Flytekit

flytekitplugins-kfmpi

This plugin uses the Kubeflow MPI Operator and provides an extremely simplified interface for executing distributed training.

5 types
2 tasks
7 modules
28.9K/mo
1.6.1+
Kubeflow PyTorch

Kubeflow PyTorch

Flytekit

flytekitplugins-kfpytorch

This plugin uses the Kubeflow PyTorch Operator and provides an extremely simplified interface for executing distributed training using various PyTorch backends.

7 types
7 modules
26.0K/mo
1.6.1+
Kubeflow TensorFlow

Kubeflow TensorFlow

Flytekit

flytekitplugins-kftensorflow

This plugin uses the Kubeflow TensorFlow Operator and provides an extremely simplified interface for executing distributed training using various TensorFlow backends.

7 types
1 task
8 modules
1.4K/mo
1.6.1+
pytorch

pytorch

v2Flyte SDK (v2)

flyteplugins-pytorch

Union can execute PyTorch distributed training jobs natively on a Kubernetes Cluster, which manages the lifecycle of worker pods, rendezvous coordination, spin-up, and tear down. It leverages the open-sourced TorchElastic (torch.distributed.elastic) launcher and the Kubeflow PyTorch Operator, enabling fault-tolerant and elastic training across multiple nodes.

1 type
1 module
945/mo
Ray

Ray

Flytekit

flytekitplugins-ray

Flyte backend can be connected with Ray. Once enabled, it allows you to run flyte task on Ray cluster

2 types
1 task
3 modules
84.1K/mo
Ray

Ray

v2Flyte SDK (v2)

flyteplugins-ray

Union can execute Ray jobs natively on a Kubernetes Cluster,

2 types
1 task
3 modules
2.0K/mo
Spark

Spark

Flytekit

flytekitplugins-spark

Flyte can execute Spark jobs natively on a Kubernetes Cluster, which manages a virtual cluster’s lifecycle, spin-up, and tear down. It leverages the open-sourced Spark On K8s Operator and can be enabled without signing up for any service. This is like running a transient spark cluster — a type of cluster spun up for a specific Spark job and torn down after completion.

10 types
1 agent
1 task
12 modules
80.6K/mo
1.15.1+
Spark

Spark

v2Flyte SDK (v2)

flyteplugins-spark

Union can execute Spark jobs natively on a Kubernetes Cluster, which manages a virtual cluster’s lifecycle, spin-up, and tear down. It leverages the open-sourced Spark On K8s Operator and can be enabled without signing up for any service. This is like running a transient spark cluster — a type of cluster spun up for a specific Spark job and torn down after completion.

1 type
1 module
2.9K/mo