80 Open Source Data Analytics Software Projects
Free and open source data analytics code projects including engines, APIs, generators, and tools.
Awesome Bigdata 9344 ⭐
A curated list of awesome big data frameworks, ressources and other awesomeness.
Arx Deidentifier Arx 357 ⭐
ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.
Flashx 219 ⭐
FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.
Koolreport 204 ⭐
This is an Open Source PHP Reporting Framework which you can use to write perfect data reports or to construct awesome dashboards using PHP
Gspread Pandas 197 ⭐
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Iamsivab Data Science Resources 136 ⭐
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Featurelabs Compose 143 ⭐
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
Data Analysis Using Python 75 ⭐
Exploratory data analysis 📊using python 🐍of used car 🚘 database taken from ⓚ𝖆𝖌𝖌𝖑𝖊
Basketball_analytics 74 ⭐
Repository which contains various scripts and work with various basketball statistics
Dbiir Rainbow 62 ⭐
A data layout optimization framework for wide tables stored on HDFS. See rainbow's webpage
Data Wrangling With Python 41 ⭐
Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Kdd2018 Tutorial 36 ⭐
Companion repository for the KDD'18 hands-on tutorial on Higher-Order Data Analytics for Temporal Network Data
Skytrax Data Warehouse 33 ⭐
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
Time Series Analysis Of Inflation Rates Using Shinydashboard 26 ⭐
This project will aim at studying and analyzing the inflation rates of countries globally. The dataset is a public dataset downloaded from International Monetary Fund(IMF) which consists of the inflation rates of countries from 1980 to 2017 and the projected inflation rates of the countries till 2022. Finally I will be producing a dashboard build in R to visualize and analyze the inflation rates.
Dev Decal Spring 2020 25 ⭐
CS 198-077 Blockchain for Developers DeCal Spring 2020, taught at UC Berkeley in spring 2020 by the Blockchain at Berkeley Education Department.
Fraud Detection In Online Transactions 24 ⭐
Detecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Algorithm may result in Overfitting
Datapackage M 22 ⭐
Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
The Applied Sql Data Analytics Workshop 19 ⭐
A Quick, Interactive Approach to Learning Analytics with SQL
Tauferlab Mimir 18 ⭐
Mimir is a new implementation of MapReduce over MPI. Mimir inherits the core principles of existing MapReduce frameworks, such as MR-MPI, while redesigning the execution model to incorporate a number of sophisticated optimization techniques that achieve similar or better performance with significant reduction in the amount of memory used.
Analytics With Kafka Redshift Metabase 17 ⭐
An example system that captures a large stream of product usage data, or events, and provides both real-time data visualization and SQL-based data analytics.
Pyplan Ide 16 ⭐
Pyplan is a graphical Integrated Development Environment for creating and sharing Data Analytics Apps.
Tirthajyoti Mlr 12 ⭐
Multiple linear regression with statistical inference, residual analysis, direct CSV loading, and other features
Churn Modelling Dataset 13 ⭐
Predicting which set of the customers are gong to churn out from the organization by looking into some of the important attributes and applying Machine Learning and Deep Learning on it.
Awesome Qlik 11 ⭐
A curated list of awesome Qlik extensions and resources for Qlik Sense and QlikView
Coe Industry Day 11 ⭐
Information on the Phase II Industry Day for the Centers of Excellence at USDA.
Vit_university 10 ⭐
This repository contains teaching materials for Data Analytics at VIT University in Vellore India
Whatscloud 10 ⭐
WhatsCloud is an android app which allows you to analyze your WhatsApp chat history on the fly with only one click
Data Scince Ml Project 10 ⭐
In this repository i created many data scince - machine learning projects like(Deep dream,weather prediction,Movie recommender system etc) with code & datasets
Danfo.js 978 ⭐