Vaex
flytekitplugins-vaex
Vaex is a high-performance Python library for lazy out-of-core DataFrames
pip install flytekitplugins-vaexQuick Start(example, may need adjustment)
pip install flytekitplugins-vaex
from flytekit import task, workflow
from flytekitplugins.vaex import ParquetToVaexDataFrameDecodingHandler, VaexDataFrameToParquetEncodingHandler
config = ParquetToVaexDataFrameDecodingHandler(...)
@task
def my_task() -> None:
...
@workflow
def my_workflow() -> None:
my_task()Available Imports (2)
Configuration type for Vaex.
from flytekitplugins.vaex import ParquetToVaexDataFrameDecodingHandler
Configuration type for Vaex.
from flytekitplugins.vaex import VaexDataFrameToParquetEncodingHandler
Dependencies
Related Plugins
Modin
Modin is a pandas-accelerator that helps handle large datasets. It is a light-weight extension that is similar to the pandas API. It uses the concept of parallelism to reduce overhead, and improve the performance of pandas operations by leveraging the compute resources available.
polars
This plugin provides native support for Polars DataFrames and LazyFrames in Flyte, enabling efficient data processing with Polars' high-performance DataFrame library.
Polars
Polars is a blazingly fast DataFrames library implemented in Rust using Apache Arrow Columnar Format as memory model.
Async FSSpec
The Flyte async fsspec plugin is a powerful addition to the Flyte ecosystem designed to optimize the performance of object transmission. This plugin focuses on overriding key methods of the file systems in fsspec to introduce efficiency improvements, resulting in accelerated data transfers between Flyte workflows and object storage.