Parallel computing with task scheduling
- Updated Mar 12, 2026
- Python
Build software better, together
Parallel computing with task scheduling
cuDF - GPU DataFrame Library
N-D labeled arrays and datasets in Python
STUMPY is a powerful and scalable Python library for modern time series analysis
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
A distributed task scheduler for Dask
Lightweight and extensible compatibility layer between dataframe libraries!
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Scalable machine 🤖 learning for time series forecasting.
Python package for earth-observing satellite data processing
Eliot: the logging system that tells you *why* it happened
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
Fast data store for Pandas time-series data
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Distributed SQL Engine in Python using Dask
Library of derived climate variables, ie climate indicators, based on xarray.
Geospatial image resampling in Python
A full pipeline AutoML tool for tabular data
Add a description, image, and links to the dask topic page so that developers can more easily learn about it.
To associate your repository with the dask topic, visit your repo's landing page and select "manage topics."