55 Open Source Data Wrangling Software Projects
Free and open source data wrangling code projects including engines, APIs, generators, and tools.
Openrefine 7651 ⭐
OpenRefine is a free, open source power tool for working with messy data and improving it
Ironmussa Optimus 939 ⭐
:truck: Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Data Forge Ts 899 ⭐
Prose 441 ⭐
Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
Cracking The Data Science Interview 304 ⭐
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Data Forge JS 140 ⭐
Data Analysis Using Python 75 ⭐
Exploratory data analysis 📊using python 🐍of used car 🚘 database taken from ⓚ𝖆𝖌𝖌𝖑𝖊
Uc R.github.io 74 ⭐
Main repository for R programming courses @ University of Cincinnati, courses and tutorials that focus on data wrangling, exploration, visualization, and analysis with R.
Data Wrangling With Python 41 ⭐
Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Data Science 101 17 ⭐
Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Pranavsuri Data Analyst Nanodegree 12 ⭐
This repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Xplore 13 ⭐
A python package built for data scientist/analysts, AI/ML engineers for exploring features of a dataset in minimal number of lines of code for quick analysis before data wrangling and feature extraction.
Null 13 ⭐
Functions matter! Prosto is a data processing toolkit radically changing how data is processed by relying on both set and function operations. No join-groupby, No map-reduce.