271 Open Source Datascience Software Projects
Free and open source datascience code projects including engines, APIs, generators, and tools.
Industry Machine Learning 6108 ⭐
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Clevercsv 957 ⭐
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Datastream.io 857 ⭐
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Ai Series 714 ⭐
:books: [.md & .ipynb] Series of Artificial Intelligence & Deep Learning, including Mathematics Fundamentals, Python Practices, NLP Application, etc. 💫 人工智能与深度学习实战，数理统计篇 | 机器学习篇 | 深度学习篇 | 自然语言处理篇 | 工具实践 Scikit & Tensoflow & PyTorch 篇 | 行业应用 & 课程笔记
Business Machine Learning 626 ⭐
A curated list of practical business machine learning (BML) and business data science (BDS) applications for Accounting, Customer, Employee, Legal, Management and Operations (by @firmai)
SocIOS Brasil 500 ⭐
Captura os dados de sócios das empresas brasileiras na Receita Federal e exporta para um formato legível por humanos
Dataframe JS 405 ⭐
Notebooks Statistics And Machinelearning 279 ⭐
Jupyter Notebooks from the old UnsupervisedLearning.com (RIP) machine learning and statistics blog
SalarIOS Magistrados 247 ⭐
Baixa as planilhas de salários de magistrados, extrai os contracheques, limpa e exporta pra CSV
Introduction Datascience Python Book 323 ⭐
Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications
My Awesome Ai Bookmarks 239 ⭐
Curated list of my reads, implementations and core concepts of Artificial Intelligence, Deep Learning, Machine Learning by best folk in the world.
Oie Resources 365 ⭐
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Datacamp Python Data Science Track 401 ⭐
All the slides, accompanying code and exercises all stored in this repo. 🎈
Melusine 268 ⭐
Melusine is a high-level library for emails classification and feature extraction "dédiée aux courriels français".
Harunurrashid97 100 Days Of Ml Code 184 ⭐
A day to day plan for this challenge. Covers both theoritical and practical aspects
Wikipedia Mirror 202 ⭐
🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
Awesome Shiny Apps For Statistics 140 ⭐
🌟 A curated list of Awesome Shiny Apps for Statistics (ASAS)🌟
Blockchain2graph 138 ⭐
Blockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Renku 169 ⭐
The Renku Project provides a platform and tools for reproducible and collaborative data analysis.
Iamsivab Data Science Resources 184 ⭐
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Emotion Classification From Audio Files 263 ⭐
Understanding emotions from audio files using neural networks and multiple datasets.
Khuyentran1401 Machine Learning Articles 141 ⭐
List of interesting articles on different topics of machine learning and deep learning
Repo2docker Action 106 ⭐
A GitHub action to build data science environment images with repo2docker and push them to registries.
Data_science_blogs 201 ⭐
A repository to keep track of all the code that I end up writing for my blog posts.
Climate Change Data 287 ⭐
:earth_africa: A curated list of APIs, open data and ML/AI projects on climate change
Openuba 202 ⭐
A robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Covid19 Dashboard 148 ⭐
🦠 Django + Plotly Coronavirus dashboard. Powerful data driven Python web-app, with an awesome UI. Contributions welcomed! Featured on 🕶Awesome-list
Predictive Maintenance 97 ⭐
A notebook tutorial series for performing predictive maintenance using machine learning
Datasciencecampus Mobius 48 ⭐
Scripts to extract data from the COVID-19 Google Community Mobility Reports
Xgboost Smote Detect Fraud 53 ⭐
Can we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Mass Ts 70 ⭐
MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.
Sars2pack 55 ⭐
An R package with over 50 highly cited, read-to-use, up-to-date COVID-19 pandemic data resources
D20datascience 47 ⭐
Data science investigations into the mechanics of the world's greatest role playing game
Hackyhourhandbook 42 ⭐
A handbook for those who want to start coordinating Hacky Hour events in their University/Institute
⛲️ Commons Marketplace client & server to explore, download, and publish open data sets in the Ocean Protocol Network.
Bike Sharing Demand Kaggle 33 ⭐
Top 5th percentile solution to the Kaggle knowledge problem - Bike Sharing Demand
Predict Opioid Prescribers 35 ⭐
A pattern focusing on how to use scikit learn and python in Watson Studio to predict opioid prescribers based off of a 2014 kaggle dataset.
Data Science Template 38 ⭐
A template for Data Science projects with a solid Software Engineering Architecture
Jenkins Ci 27 ⭐
Minimal example to setup a Jenkins-CI pipeline for data science projects on OpenShift in a couple of minutes.
Datasciencetutorials.jl 83 ⭐
A set of tutorials to show how to use Julia for data science (DataFrames, MLJ, ...)
Python For Datascience Machine Learning Bootcamp Udemy 31 ⭐
Repository for the course on Udemy - Python for Data Science and Machine Learning Bootcamp , Jose Portilla
Machinelearning_breastcancer_python 26 ⭐
Machine Learning Applications using Sklearn, matplotlib, pandas, and seaborn
Human Resource Analytics And Employee Churn Prediction 31 ⭐
A Data science and Analytics project with the main aim of doing some Descriptive and Exploratory Data Analysis and then applying predictive modelling for predicting why and which are the best and most experienced employees leaving prematurely?
Leonjessen Talks 28 ⭐
Repository of publicly available talks by Leon Eyrich Jessen, PhD. Talks cover Data Science and R in the context of research
Open Data Lab 26 ⭐
an initiative to provide infrastructure for reproducible workflows around open data