51 Open Source Language Modeling Software Projects
Free and open source language modeling code projects including engines, APIs, generators, and tools.
Implement modern LSTM cell by tensorflow and test them by language modeling task for PTB. Highway State Gating, Hypernets, Recurrent Highway, Attention, Layer norm, Recurrent dropout, Variational dropout.
Language Modeling15 ⭐
Language modeling on the Penn Treebank (PTB) corpus using a trigram model with linear interpolation, a neural probabilistic language model, and a regularized LSTM.
Relational Rnn Pytorch239 ⭐
An implementation of DeepMind's Relational Recurrent Neural Networks in PyTorch.
Tape Neurips2019116 ⭐
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology. (DEPRECATED)
Pretraining For Language Understanding73 ⭐
Pre-training of Language Models for Language Understanding
Pytorch Translm21 ⭐
An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.
Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)
Comparatively Finetuning Bert95 ⭐
Comparatively fine-tuning pretrained BERT models on downstream, text classification tasks with different architectural configurations in PyTorch.
Repository for the lectures taught in the course named "Natural Language Processing" at the University of Guilan, Department of Computer Engineering.
Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
Neurips Micronet34 ⭐
[JMLR 2020] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion
Songlab Cal Tape392 ⭐
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.
Lingua Go384 ⭐
👄 The most accurate natural language detection library for Go, suitable for long and short text alike
Group Transformer17 ⭐
Official code for Group-Transformer (Scale down Transformer by Grouping Features for a Lightweight Character-level Language Model, COLING-2020).
Codemixed Text Generator16 ⭐
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
Recurrent Fwp28 ⭐
Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"
Incontext Learning11 ⭐
Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"
Theano Recurrence40 ⭐
Recurrent Neural Networks (RNN, GRU, LSTM) and their Bidirectional versions (BiRNN, BiGRU, BiLSTM) for word & character level language modelling in Theano
Deep Lyrics133 ⭐
Lyrics Generator aka Character-level Language Modeling with Multi-layer LSTM Recurrent Neural Network
This repository holds some of the notebooks I use to study some data for the CMU Data Science Club.
:round_pushpin: :office: :bank: :post_office: :convenience_store: :department_store: LNEx: Location Name Extractor