Hive
flytekitplugins-hive
Flyte backend can be connected with various Hive services. Once enabled, it allows you to query a Hive service (e.g., Qubole) and retrieve typed schema (optionally).
pip install flytekitplugins-hiveQuick Start(example, may need adjustment)
See full examplespip install flytekitplugins-hive
from flytekit import task, workflow
from flytekitplugins.hive import HiveConfig, HiveSelectTask, HiveTask
@task(task_config=HiveSelectTask(...))
def my_task() -> None:
...
@workflow
def my_workflow() -> None:
my_task()Available Imports (3)
HiveConfig should be used to configure a Hive Task.
extends dataclass — configuration or data structure for plugin setup
from flytekitplugins.hive import HiveConfig
Task for Hive.
from flytekitplugins.hive import HiveSelectTask
This is the simplest form of a Hive Task, that can be used even for tasks that do not produce any output.
extends SQLTask — a flyte task that can be used in workflows
from flytekitplugins.hive import HiveTask
Related Plugins
Bigquery
BigQuery enables us to build data-intensive applications without operational burden. Flyte backend can be connected with the BigQuery service. Once enabled, it can allow you to query a BigQuery table.
BigQuery
This plugin provides BigQuery integration for Flyte, enabling you to run BigQuery queries as Flyte tasks.
Dolt
The DoltTable plugin is a wrapper that uses Dolt to move data between pandas.DataFrame’s at execution time and database tables at rest.
DuckDB
Run analytical workloads with ease using DuckDB.