191 Open Source Datascience Software Projects
Free and open source datascience code projects including engines, APIs, generators, and tools.
Industry Machine Learning 5671 ⭐
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Clevercsv 852 ⭐
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Datastream.io 791 ⭐
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Ai Series 685 ⭐
:books: [.md & .ipynb] Series of Artificial Intelligence & Deep Learning, including Mathematics Fundamentals, Python Practices, NLP Application, etc. 💫 人工智能与深度学习实战，数理统计篇 | 机器学习篇 | 深度学习篇 | 自然语言处理篇 | 工具实践 Scikit & Tensoflow & PyTorch 篇 | 行业应用 & 课程笔记
Business Machine Learning 547 ⭐
A curated list of practical business machine learning (BML) and business data science (BDS) applications for Accounting, Customer, Employee, Legal, Management and Operations (by @firmai)
SocIOS Brasil 414 ⭐
Captura os dados de sócios das empresas brasileiras na Receita Federal e exporta para um formato legível por humanos
Dataframe JS 353 ⭐
Notebooks Statistics And Machinelearning 264 ⭐
Jupyter Notebooks from the old UnsupervisedLearning.com (RIP) machine learning and statistics blog
SalarIOS Magistrados 248 ⭐
Baixa as planilhas de salários de magistrados, extrai os contracheques, limpa e exporta pra CSV
Introduction Datascience Python Book 241 ⭐
Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications
My Awesome Ai Bookmarks 216 ⭐
Curated list of my reads, implementations and core concepts of Artificial Intelligence, Deep Learning, Machine Learning by best folk in the world.
Oie Resources 226 ⭐
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Datacamp Python Data Science Track 204 ⭐
All the slides, accompanying code and exercises all stored in this repo. 🎈
Melusine 198 ⭐
Melusine is a high-level library for emails classification and feature extraction "dédiée aux courriels français".
Harunurrashid97 100 Days Of Ml Code 166 ⭐
A day to day plan for this challenge. Covers both theoritical and practical aspects
Tech.ml.dataset 159 ⭐
Clojure dataframe library and pipeline for data processing and machine learning
Wikipedia Mirror 140 ⭐
🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kimix + ZIM dump, and MediaWiki/XOWA + XML dump
Awesome Shiny Apps For Statistics 123 ⭐
🌟 A curated list of Awesome Shiny Apps for Statistics (ASAS)🌟
Blockchain2graph 124 ⭐
Blockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Renku 127 ⭐
The Renku Project provides a platform and tools for reproducible and collaborative data analysis.
Iamsivab Data Science Resources 136 ⭐
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Emotion Classification From Audio Files 131 ⭐
Understanding emotions from audio files using neural networks and multiple datasets.
Khuyentran1401 Machine Learning Articles 106 ⭐
List of interesting articles on different topics of machine learning and deep learning
Data_science_blogs 94 ⭐
A repository to keep track of all the code that I end up writing for my blog posts.
Climate Change Data 79 ⭐
:earth_africa: A curated list of APIs, open data and ML/AI projects on climate change
Openuba 70 ⭐
A robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Covid19 Dashboard 68 ⭐
🦠 Django + Plotly Coronavirus dashboard. Powerful data driven Python web-app, with an awesome UI. Contributions welcomed! Found on 🕶Awesome-list
Predictive Maintenance 57 ⭐
A notebook tutorial series for performing predictive maintenance using machine learning
Datasciencecampus Mobius 45 ⭐
Scripts to extract data from the COVID-19 Google Community Mobility Reports
Xgboost Smote Detect Fraud 42 ⭐
Can we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Mass Ts 43 ⭐
MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.
Sars2pack 41 ⭐
An R package with nearly 40 highly cited, read-to-use, up-to-date COVID-19 pandemic data resources
D20datascience 40 ⭐
Data science investigations into the mechanics of the world's greatest role playing game
Hackyhourhandbook 36 ⭐
A handbook for those who want to start coordinating Hacky Hour events in their University/Institute
Commons 34 ⭐
⛲️ Commons Marketplace client & server to explore, download, and publish open data sets in the Ocean Protocol Network.
Bike Sharing Demand Kaggle 33 ⭐
Top 5th percentile solution to the Kaggle knowledge problem - Bike Sharing Demand
Predict Opioid Prescribers 31 ⭐
A pattern focusing on how to use scikit learn and python in Watson Studio to predict opioid prescribers based off of a 2014 kaggle dataset.
Data Science Template 30 ⭐
A template for Data Science projects with a solid Software Engineering Architecture
Jenkins Ci 28 ⭐
Minimal example to setup a Jenkins-CI pipeline for data science projects on OpenShift in a couple of minutes.
Datasciencetutorials.jl 33 ⭐
A set of tutorials to show how to use Julia for data science (DataFrames, MLJ, ...)
Python For Datascience Machine Learning Bootcamp Udemy 26 ⭐
Repository for the course on Udemy - Python for Data Science and Machine Learning Bootcamp , Jose Portilla
Machinelearning_breastcancer_python 26 ⭐
Machine Learning Applications using Sklearn, matplotlib, pandas, and seaborn
Human Resource Analytics And Employee Churn Prediction 25 ⭐
A Data science and Analytics project with the main aim of doing some Descriptive and Exploratory Data Analysis and then applying predictive modelling for predicting why and which are the best and most experienced employees leaving prematurely?
Leonjessen Talks 25 ⭐
Repository of publicly available talks by Leon Eyrich Jessen, PhD. Talks cover Data Science and R in the context of research
Open Data Lab 24 ⭐
an initiative to provide infrastructure for reproducible workflows around open data