115 Open Source Data Augmentation Software Projects
Free and open source data augmentation code projects including engines, APIs, generators, and tools.
Dali 2862 ⭐
A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications
Textattack 941 ⭐
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP
Nlp_xiaojiang 803 ⭐
自然语言处理（nlp），小姜机器人（闲聊检索式chatbot），BERT句向量-相似度（Sentence Similarity），XLNET句向量-相似度（text xlnet embedding），文本分类（Text classification）， 实体提取（ner，bert+bilstm+crf），数据增强（text augment, data enhance），同义句同义词生成，句子主干提取（mainpart），中文汉语短文本相似度，文本特征工程，keras-http-service调用
Inltk 661 ⭐
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Eda_nlp_for_chinese 548 ⭐
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
Data Augmentation Review 582 ⭐
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.
Random Erasing 458 ⭐
Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST
Demiseom Specaugment 363 ⭐
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Amazon Forest Computer Vision 340 ⭐
Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks
Audiomentations 236 ⭐
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Torchsat 206 ⭐
🔥TorchSat 🌏 is an open-source deep learning framework for satellite imagery analysis based on PyTorch.
Syndata Generation 188 ⭐
Code used to generate synthetic scenes and bounding box annotations for object detection. This was used to generate data used in the Cut, Paste and Learn paper
Tensorflow Mnist Cnn 174 ⭐
MNIST classification using Convolutional NeuralNetwork. Various techniques such as data augmentation, dropout, batchnormalization, etc are implemented.
Stylealign 167 ⭐
[ICCV 2019]Aggregation via Separation: Boosting Facial Landmark Detector with Semi-Supervised Style Transition
Featurelabs Compose 143 ⭐
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
All Conv Keras 114 ⭐
All Convolutional Network: (https://arxiv.org/abs/1412.6806#) implementation in Keras
Semsegpipeline 117 ⭐
A simpler way of reading and augmenting image segmentation data into TensorFlow
Unsupervised Data Augmentation 109 ⭐
Unofficial PyTorch Implementation of Unsupervised Data Augmentation.
Aaltd18 97 ⭐
Data augmentation using synthetic data for time series classification with deep residual networks
Pose Adv Aug 83 ⭐
Code for "Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation" (CVPR 2018)
Ghost Free Shadow Removal 96 ⭐
[AAAI 2020] Towards Ghost-free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN
Cutmix 82 ⭐
a Ready-to-use PyTorch Extension of Unofficial CutMix Implementations with more improved performance.
Pedestrian Synthesis Gan 67 ⭐
Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond
How Far Can We Go With Mnist 60 ⭐
A collection of codes for 'how far can we go with MNIST' challenge
Mrnet 60 ⭐
Implementation of the paper: Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet
Wb_color_augmenter 59 ⭐
WB color augmenter improves the accuracy of image classification and image semantic segmentation methods by emulating different WB effects (ICCV 2019) [Python & Matlab].
Dips 54 ⭐
NAACL 2019: Submodular optimization-based diverse paraphrasing and its effectiveness in data augmentation
Evoskeleton 76 ⭐
Official project website for the CVPR 2020 paper (Oral Presentation) "Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data"
Handwriting_recogition_using_adversarial_learning 47 ⭐
"Handwriting Recognition in Low-resource Scripts using Adversarial Learning ”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
Machine_learning_course 43 ⭐
Artificial intelligence/machine learning course at UCF in Spring 2020 (Fall 2019 and Spring 2019)
Fastai_sparse 42 ⭐
3D augmentation and transforms of 2D/3D sparse data, such as 3D triangle meshes or point clouds in Euclidean space. Extension of the Fast.ai library to train Sub-manifold Sparse Convolution Networks
Skin Data Augmentation 43 ⭐
Source code for the paper 'Data Augmentation for Skin Lesion Analysis' - Best Paper Award at the ISIC Skin Image Analysis Workshop @ MICCAI 2018
Emotionalconversionstargan 42 ⭐
This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
Jim Schwoebel Allie 44 ⭐
🤖 A machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers).
Doccreator 40 ⭐
DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation
Bird_species_classification 35 ⭐
Supervised Classification of bird species :bird: in high resolution images, especially for, Himalayan birds, having diverse species with fairly low amount of labelled data
Audio_degrader 34 ⭐
Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.
Bankcard Recognizer 33 ⭐
💳 Extracting numbers from bankcard, based on Deep Learning. 基于深度学习的银行卡号识别与定位系统。
Veri Artirma Data Augmentation 22 ⭐
Bu repoda veri artırma (data augmentation) ile ilgili pratik uygulamalara ulaşabilirsiniz.
All Classifiers 2019 21 ⭐
A collection of computer vision projects for Acute Lymphoblastic Leukemia classification/early detection.
Learning From Rules 22 ⭐
Implementation of experiments in paper "Learning from Rules Generalizing Labeled Exemplars" to appear in ICLR2020 (https://openreview.net/forum?id=SkeuexBtDr)
Awesome Mixed Sample Data Augmentation 23 ⭐
A collection of awesome things about mixed sample data augmentation
Deepsentipers 16 ⭐
DeepSentiPers: Novel Deep Learning Models Trained Over Proposed Augmented Persian Sentiment Corpus
Data Augmentation For Object Detection 21 ⭐
Data Augmentation For Object Detection using Pytorch and PIL
Semisupervised_timeseries_infogan 14 ⭐
A tensorflow implementation of informative generative adversarial network (InfoGAN ) to one dimensional ( 1D ) time series data with a supervised loss function. So it's called semisupervised Info GAN.
Social_distancing_with_ai 21 ⭐
Monitor people violating Social Distancing or not wearing Face Masks in public through CCTV footage.
Image Rotation And Cropping Tensorflow 12 ⭐
Image rotation and cropping out the black borders in TensorFlow
Data Augmentation 11 ⭐
These functions will randomly distort image data to amplify total amount of images for use in machine learning algorithms.