47 Open Source Data Transformation Software Projects
Free and open source data transformation code projects including engines, APIs, generators, and tools.
Glom 1450 ⭐
☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️
Ironmussa Optimus 1173 ⭐
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Transformerkit 847 ⭐
A block-based API for NSValueTransformer, with a growing collection of useful examples.
Scriptfusion Porter 553 ⭐
:lipstick: Scalable and durable all-purpose data import abstraction for publishing testable APIs and SDKs.
Prose 525 ⭐
Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
Pglogical 554 ⭐
Logical Replication extension for PostgreSQL 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
Big Data Mapreduce Course 107 ⭐
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Php Serializer 45 ⭐
Serialize PHP variables, including objects, in any format. Support to unserialize it too.
Jim Schwoebel Allie 90 ⭐
🤖 A machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers).
Daany 45 ⭐
Daany - .NET DAta ANalYtics .NET 5 library with the implementation of DataFrame, Time series decompositions and Linear Algebra routines BLASS and LAPACK.
Datapackage M 26 ⭐
Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
Richflow 16 ⭐
Opportus Object Mapper 16 ⭐
Maps generically data from source to target object via extensible strategies and controls
Odpf Optimus 576 ⭐
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Jupyter Naas Naas 155 ⭐
⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Clojure Dsl Resources 96 ⭐
A curated list of Clojure resources for dealing with domain-specific languages.
Fastverse 87 ⭐
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
Hk Atm Locator 10 ⭐
:atm: 香港自動櫃員機定位器 :atm: Centralising Automated Teller Machine (ATM) Data in Hong Kong in a well-defined yet standardised format and display in a web portal for public use