Open Source Libs
Find Open Source Packages
Open Source Libraries
👉
Exploratory Data Analysis
113 Open Source Exploratory Data Analysis Software Projects
Free and open source exploratory data analysis code projects including engines, APIs, generators, and tools.
Pandas Profiling
8434 ⭐
Create HTML profiling reports from pandas DataFrame objects
Great_expectations
5987 ⭐
Always know what to expect from your data.
Scattertext
1745 ⭐
Beautiful visualizations of how language differs among document types.
Sweetviz
1897 ⭐
Visualize and compare datasets, target values and associations, with one line of code.
Data Science Your Way
552 ⭐
Ways of doing Data Science Engineering and Machine Learning in R and Python
Dataprep
1138 ⭐
DataPrep — The easiest way to prepare data in Python
Musicmood
391 ⭐
A machine learning approach to classify songs by mood.
Visdat
392 ⭐
Preliminary Exploratory Visualisation of Data
Datavisualization
241 ⭐
Tutorials on visualizing data using python packages like bokeh, plotly, seaborn and igraph
Kdepy
333 ⭐
Kernel Density Estimation in Python
Lotteryprediction
216 ⭐
:full_moon_with_face: Lottery prediction besides of following "law of proability","Probability: Independent Events", there are still "Saying "a Tail is due", or "just one more go, my luck is due to change" is called The Gambler's Fallacy" existed.
Mirador Mirador
187 ⭐
Tool for visual exploration of complex data.
Inspectdf
229 ⭐
🛠️ 📊 Tools for Exploring and Comparing Data Frames
Autoeda Resources
347 ⭐
A list of software and papers related to automatic and fast Exploratory Data Analysis
Harunurrashid97 100 Days Of Ml Code
184 ⭐
A day to day plan for this challenge. Covers both theoritical and practical aspects
Ditching Excel For Python
195 ⭐
Functionalities in Excel translated to Python
Handyspark
165 ⭐
HandySpark - bringing pandas-like capabilities to Spark dataframes
Xda
112 ⭐
R package for exploratory data analysis
Spark R Notebooks
114 ⭐
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
How To Score 0.8134 In Titanic Kaggle Challenge
117 ⭐
Solution of the Titanic Kaggle competition
Impy
113 ⭐
Impy is a Python3 library with features that help you in your computer vision tasks.
Hn_so_analysis
97 ⭐
Is there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causality
Expandar
108 ⭐
R Package for Interactive Panel Data Exploration
Data Analysis Using Python
114 ⭐
Exploratory data analysis 📊using python 🐍of used car 🚘 database taken from ⓚ𝖆𝖌𝖌𝖑𝖊
Edator
75 ⭐
A python package that performs exploratory data analysis for users. Additionally, it generates 3 types of output files (cleaned CSV, plots and a text report).
Data Journalism
76 ⭐
Data journalism and easy to replicate notebooks using Python, R, and Web visualisations
Breast Cancer Risk Prediction
109 ⭐
Classification of Breast Cancer diagnosis Using Support Vector Machines
Correlationfunnel
98 ⭐
Speed Up Exploratory Data Analysis (EDA)
Edarf
64 ⭐
exploratory data analysis using random forests
Ben519 Mltools
66 ⭐
Exploratory and diagnostic machine learning tools for R
Densenet Mura Pytorch
59 ⭐
Implementation of DenseNet model on Standford's MURA dataset using PyTorch
Kushner_eb5_census
49 ⭐
Jared Kushner and his partners used a program meant for job-starved areas to build a luxury skyscraper
Vtree
60 ⭐
An R package for calculating and drawing variable trees
Leila
56 ⭐
Librería para la evaluación de calidad de datos, e interacción con el portal de datos.gov.co
Sequence Mining
45 ⭐
Probabilistic Sequence Mining
Lux Org Lux
3279 ⭐
Automatically visualize your pandas dataframe via a single print! 📊 💡
Machine_learning_and_deep_learning
240 ⭐
Must Read Papers For Ml
162 ⭐
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Predicting Football Match Outcome Using Machine Learning
52 ⭐
Football Match prediction using machine learning algorithms in jupyter notebook
Learnr
56 ⭐
Exploratory, Inferential and Predictive data analysis. Feel free to show your :heart: by giving a star :star:
Exploratory_data_analysis_visualization_python
65 ⭐
Data analysis and visualization with PyData ecosystem: Pandas, Matplotlib Numpy, and Seaborn
Furniture
42 ⭐
The furniture R package contains table1 for publication-ready simple and stratified descriptive statistics, tableC for publication-ready correlation matrixes, and other tables #rstats
Great Northern Diver Loon
35 ⭐
A Toolkit for Interactive Statistical Data Visualization
Human Resource Analytics And Employee Churn Prediction
31 ⭐
A Data science and Analytics project with the main aim of doing some Descriptive and Exploratory Data Analysis and then applying predictive modelling for predicting why and which are the best and most experienced employees leaving prematurely?
Exploripy
37 ⭐
Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.
Nostradamus
30 ⭐
🧠 An open-source machine learning application for analyzing software defect reports extracted from bug tracking systems.
Walmart Sales Prediction
25 ⭐
Exploration
30 ⭐
Science des Données Saison 2: Exploration statistique multidimensionnelle, ACP, AFC, AFD, Classification non supervisée
Fsharpgephistreamer
23 ⭐
F# functions for streaming any kind of graph/network data to the network visualization tool gephi
Metaomgraph
28 ⭐
MetaOmGraph: a workbench for interactive exploratory data analysis of large expression datasets
Mandliya Ml
28 ⭐
A 60 days+ streak of daily learning of ML/DL/Maths concepts through projects
Kaggle
23 ⭐
Kaggle Kernels (Python, R, Jupyter Notebooks)
Msds593
21 ⭐
MSDS593 -- Exploratory data analysis (EDA) at the University of San Francisco
Campaign Critic
16 ⭐
A web app that helps Kickstarter creators increase their chances of being funded
Automobile Dataset Analysis
23 ⭐
This project analyzes and visualizes the Used Car Prices from the Automobile dataset in order to predict the most probable car price
Archaic
17 ⭐
Exploration, clustering, visualization and classification of DNA damage patterns
Data Science 101
19 ⭐
Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Fraud Analysis
33 ⭐
Insurance fraud claims analysis project
Inspector Lestrade
15 ⭐
Neural network to predict eye gaze location from front-facing camera
Itemset Mining
18 ⭐
Probabilistic Itemset Mining
Adenine
15 ⭐
ADENINE: A Data ExploratioN PipelINE
Smarteda
25 ⭐
a R package for data exploratory analysis
Ibm Ml With Python Capstone Project
13 ⭐
My capstone project for the course IBM Data Science ML with Python .
Pcaworkshop
30 ⭐
An introduction to matrix factorization and PCA and SVD.
Tdf
17 ⭐
🚴🏅📊Tour de France winners and stages data
Tidytab
14 ⭐
Create tidyverse-friendly tables of frequencies
End To End Lead Scoring
17 ⭐
An end-to-end enterprise-grade example of working a data science problem.
Eda_python
12 ⭐
Sample Jupyter book about basics of Exploratory Data Analysis
Pranavsuri Data Analyst Nanodegree
13 ⭐
This repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Open Data Explorer
12 ⭐
To promote exploration and use of open data - currently in beta
Edapy
16 ⭐
Exploratory Data Analysis with Python
Lending_loan_prediction
13 ⭐
The purpose of this project is to process the dataset, analyze it, do some feature engineering and finally make a predictive loan model for an applicant.
Exploratory Data Analysis Tools
13 ⭐
A survey of tools that make EDA more automated.
Exploratory Data Analysis With Python
10 ⭐
Exploratory Data Analysis
Mmpf
10 ⭐
Monte-Carlo methods for prediction functions
Kuntal G Books
10 ⭐
This repository contains code and bonus content which will be added from time to time for the books "Learning Generative Adversarial Network- GAN" and "R Data Analysis Cookbook - 2nd Edition" by Packt
Complete Life Cycle Of A Data Science Project
255 ⭐
Complete-Life-Cycle-of-a-Data-Science-Project
Exploratory_data_analysis Wine_quality_dataset
20 ⭐
read my blog @ https://medium.com/@theprasadpatil/exploratory-data-analysis-8fc1cb20fd15
Laderast Burro
12 ⭐
Exploring data together using shiny (burro(w) into the data)
Dataprofessor Code
534 ⭐
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Data Describe
289 ⭐
data⎰describe: Pythonic EDA Accelerator for Data Science
Skimpy
146 ⭐
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
The Sparks Foundation
77 ⭐
📌 This repo. Contains Basic - Advance level Machine learning / business analysis Projects. 👨💻
Olliepy
46 ⭐
OlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning experiments by utilising the power and structure of modern web applications. The data scientist only needs to provide the data and any required information and OlliePy will generate the rest.
Mebauer Data Analysis Using Python
64 ⭐
Data Analysis Using Python: A Beginner’s Guide Featuring NYC Open Data
Data Inspector
36 ⭐
Data Inspector is an open-source python library that brings 15++ types of different functions to make EDA, data cleaning easier.
Data Science Series
32 ⭐
For all those who're struggling to find a good hands-on resource (with case studies) to master their Data Science skills, Here's all what you need!
The Data Analysis Workshop
34 ⭐
A New Interactive Approach to Learning Data Analysis
Whatsapp Chat Data Analysis
20 ⭐
An Exhaustive WhatsApp Chat Data Analysis.
Shaildeliwala Experiments
18 ⭐
Code and notebooks containing my experiments in data science, EDA, visualization, and machine learning
Dqlab Career Track
17 ⭐
A collection of scripts written to complete DQLab Data Analyst Career Track 📊
Tseuler
15 ⭐
A library for Time-Series exploration, analysis & modelling.
Market_basket_analysis
15 ⭐
Market Basket Analysis with Recommendation Algorithms & Shiny App Implementation of a Product Recommendation System for an Online Retailer
Data Science End To End
14 ⭐
A Respository to get you job ready as a Data Scientist
Sominw Kaggle
14 ⭐
Data Analysis using datasets from Kaggle
Edge2guard
14 ⭐
Code for PerCom Workshop paper title 'Edge2Guard: Botnet Attacks Detecting Offline Models for Resource-Constrained IoT Devices'
Nycbuildingenergyuse
14 ⭐
Creating Regression Models Of Building Emissions On Google Cloud
Data Purifier
12 ⭐
A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning, and Automated Data Preprocessing For Machine Learning and Natural Language Processing Applications in Python.
Lastancientone Data Science
12 ⭐
Using Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Bangalore House Prediction App
11 ⭐
Predicts home prices of Bangalore. Used Flutter, Flask and Jupyter Notebook.