51 Open Source Dask Software Projects
Free and open source dask code projects including engines, APIs, generators, and tools.
Stumpy 1479 ⭐
STUMPY is a powerful and scalable Python library for computing a Matrix Profile, which can be used for a variety of time series data mining tasks
Swifter 1400 ⭐
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Ironmussa Optimus 939 ⭐
:truck: Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Dask Knit 53 ⭐
Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
Jsybrandt Agatha 41 ⭐
AGATHA: Automatic Graph-mining And Transformer based Hypothesis generation Approach
Opendataanalytics Gaia 29 ⭐
Gaia is a geospatial analysis library jointly developed by Kitware and Epidemico.
Daskperiment 27 ⭐
Reproducibility for Humans: A lightweight tool to perform reproducible machine learning experiment.
Cesm Lens Aws 20 ⭐
Examples of analysis of CESM LENS data publicly available on Amazon S3 (us-west-2 region) using xarray and dask
Arboreto 20 ⭐
A scalable python-based framework for gene regulatory network inference using tree-based ensemble regressors.
Dvc_dask_use_case 15 ⭐
A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.
Mpes 16 ⭐
Distributed data processing routines for multidimensional photoemission spectroscopy (MPES)
Mercat 12 ⭐
MerCat: python code for versatile k-mer counting and diversity estimation for database independent property analysis for meta -ome data
Bumblebee 75 ⭐
🚕 A spreadsheet-like data preparation web app that works over Optimus (pandas, dask, cuDF, dask-cuDF and PySpark)