124 Open Source Image Captioning Software Projects
Free and open source image captioning code projects including engines, APIs, generators, and tools.
A Pytorch Tutorial To Image Captioning 1904 ⭐
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Cameramanager 1205 ⭐
Simple Swift class to provide all the configurations you need to create custom camera view in your app
Bottom Up Attention 1175 ⭐
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Self Critical.pytorch 823 ⭐
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
Omninet 476 ⭐
Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
Adaptiveattention 320 ⭐
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
Kuanghuei Scan 378 ⭐
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
Caption_generator 251 ⭐
A modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.
Show Control And Tell 254 ⭐
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Up Down Captioner 221 ⭐
Automatic image captioning model based on Caffe, using features from bottom-up attention.
Show Adapt And Tell 148 ⭐
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Dataturks 220 ⭐
ML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.
Neural Nuts Image Caption Generator 143 ⭐
[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow
Jdai Cv Image Captioning 206 ⭐
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
Sightseq 119 ⭐
Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection
Image Caption Generator 160 ⭐
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Arnet 96 ⭐
CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Adaptive 97 ⭐
Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Medical Report Generation 131 ⭐
A pytorch implementation of On the Automatic Generation of Medical Imaging Reports.
Video2description 142 ⭐
Video to Text: Natural language description generator for some given video. [Video Captioning]
Stylenet 57 ⭐
A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"
Updown Baseline 69 ⭐
Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".
Image_captioning 56 ⭐
generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset
Cam2caption 50 ⭐
[DEPRECATED] An Android application which converts camera feed to captions in real time
Cs231n_assignments 46 ⭐
[Assignments] CS231N: Convolutional Neural Networks for Visual Recognition (2016 & 2017)
Fenglinliu98 Mia 55 ⭐
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）
Cavp 46 ⭐
Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Network for Fine-Grained Image Captioning (TPAMI 2019)
Aat 44 ⭐
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
Udacity Cvnd Projects 37 ⭐
My solutions to the projects assigned for the Udacity Computer Vision Nanodegree
Anubhavshrimal Machine Learning 48 ⭐
The projects I do in Machine Learning with PyTorch, keras, Tensorflow, scikit learn and Python.
Punny_captions 32 ⭐
An implementation of the NAACL 2018 paper "Punny Captions: Witty Wordplay in Image Descriptions".
Optimization_of_image_description_metrics_using_policy_gradient_methods 27 ⭐
Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods
Wentong Dst Up Down Captioner 29 ⭐
Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"
Show Attend And Tell 57 ⭐
A PyTorch implementation of the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Attn Gan 21 ⭐
Pytorch implementation of paper: AttnGAN Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks
Srinadhu Cs231n 31 ⭐
My solutions for Assignments of CS231n: Convolutional Neural Networks for Visual Recognition
Stt 17 ⭐
A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.
Im2p 15 ⭐
Tensorflow implementation of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs
Deep Learning Image Caption Generator 23 ⭐
Deep CNN-LSTM for Generating Image Descriptions :smiling_imp:
Neural Image Captioning 12 ⭐
Implementation of Neural Image Captioning model using Keras with Theano backend
Miteshputhranneu Image Caption Generator 32 ⭐
The LSTM model generates captions for the input images after extracting features from pre-trained VGG-16 model. (Computer Vision, NLP, Deep Learning, Python)
Introduction_to_deep_learning_coursera 32 ⭐
Intro to Deep Learning by National Research University Higher School of Economics
Show_attend_and_tell.keras 14 ⭐
A keras implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
Fine Grained Image Captioning 16 ⭐
The pytorch implementation on “Fine-Grained Image Captioning with Global-Local Discriminative Objective”
Tika Dockers 15 ⭐
A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for images and video
Self Critical 17 ⭐
PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"
Yuanxiaosc Image Captioning 21 ⭐
CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述
A Neural Compositional Paradigm For Image Captioning 11 ⭐
Implementation of 'A Neural Compositional Paradigm for Image Captioning' by B. Dai, S.Fidler, D. Lin
Troboto Udacity 44 ⭐
This repo includes all the projects I have finished in the Udacity Nanodegree programs
Kyfafyd Mirrorgan 23 ⭐
Reproduction of the paper MirrorGAN: Learning Text-to-image Generation by Redescription
Diverse_and_specific_image_captioning 13 ⭐
Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions.
Image Captioining 17 ⭐
The objective is to process by generating textual description from an image – based on the objects and actions in the image. Using generative models so that it creates novel sentences. Pipeline type models uses two separate learning process, one for language modelling and other for image recognition. It first identifies objects in image and provides the result to the Inception-v3 model to convert into word embedding vector than into series of LSTM cells to get desired captions.
Xmodaler 831 ⭐
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Image Captioning Dlct 119 ⭐
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
Deep_learning_in_python_2018 80 ⭐
Deep Learning workshop including image classification, face recognition, Object detection, language modelling, image captioning and neural machine translation.