Modin
flytekitplugins-modin
Modin is a pandas-accelerator that helps handle large datasets. It is a light-weight extension that is similar to the pandas API. It uses the concept of parallelism to reduce overhead, and improve the performance of pandas operations by leveraging the compute resources available.
pip install flytekitplugins-modinQuick Start(example, may need adjustment)
See full examplespip install flytekitplugins-modin
from flytekit import task, workflow
from flytekitplugins.modin import ModinPandasDataFrameTransformer
config = ModinPandasDataFrameTransformer(...)
@task
def my_task() -> None:
...
@workflow
def my_workflow() -> None:
my_task()Available Imports (1)
Transforms ModinPandas DataFrame's to and from a Schema (typed/untyped).
extends TypeTransformer — converts python types to/from flyte-native types
from flytekitplugins.modin import ModinPandasDataFrameTransformer
Dependencies
Related Plugins
polars
This plugin provides native support for Polars DataFrames and LazyFrames in Flyte, enabling efficient data processing with Polars' high-performance DataFrame library.
Polars
Polars is a blazingly fast DataFrames library implemented in Rust using Apache Arrow Columnar Format as memory model.
Vaex
Vaex is a high-performance Python library for lazy out-of-core DataFrames
Async FSSpec
The Flyte async fsspec plugin is a powerful addition to the Flyte ecosystem designed to optimize the performance of object transmission. This plugin focuses on overriding key methods of the file systems in fsspec to introduce efficiency improvements, resulting in accelerated data transfers between Flyte workflows and object storage.