Data & DataFrame
DataFrame libraries and data processing tools · 8 plugins
Async FSSpec
Flytekitflytekitplugins-async-fsspec
The Flyte async fsspec plugin is a powerful addition to the Flyte ecosystem designed to optimize the performance of object transmission. This plugin focuses on overriding key methods of the file systems in fsspec to introduce efficiency improvements, resulting in accelerated data transfers between Flyte workflows and object storage.
FSSpec
Flytekitflytekitplugins-data-fsspec
This plugin provides an implementation of the data persistence layer in Flytekit that uses fsspec. Once this plugin
Geopandas
Flytekitflytekitplugins-geopandas
GeoPandas GeoPandas is an open source project to make working with geospatial data in python easier.
Hugging Face
Flytekitflytekitplugins-huggingface
Hugging Face is a community and data science platform that provides: Tools that enable users to build, train and deploy ML models based on open source (OS) code and technologies
Modin
Flytekitflytekitplugins-modin
Modin is a pandas-accelerator that helps handle large datasets. It is a light-weight extension that is similar to the pandas API. It uses the concept of parallelism to reduce overhead, and improve the performance of pandas operations by leveraging the compute resources available.
polars
v2Flyte SDK (v2)flyteplugins-polars
This plugin provides native support for Polars DataFrames and LazyFrames in Flyte, enabling efficient data processing with Polars' high-performance DataFrame library.
Polars
Flytekitflytekitplugins-polars
Polars is a blazingly fast DataFrames library implemented in Rust using Apache Arrow Columnar Format as memory model.
Vaex
Flytekitflytekitplugins-vaex
Vaex is a high-performance Python library for lazy out-of-core DataFrames