35 Open Source Lemmatizer Software Projects
Free and open source lemmatizer code projects including engines, APIs, generators, and tools.
Word_forms 518 ⭐
Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.
Pymystem3 248 ⭐
A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggestion, please make a pull request. We are very open to accepting any contributions.
Lemmatizer 100 ⭐
Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy
Uralicnlp 41 ⭐
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English
Cstlemma 24 ⭐
Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.
Libmorph 15 ⭐
libmorph rus/ukr - fast & accurate morphological analyzer/analyses for Russian and Ukrainian
Banglakit Lemmatizer 16 ⭐
A rule-based lemmatizer for Bengali / Bangla based written in Python. Under active development.
Wordless 455 ⭐
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
Johnsnowlabs Nlu 436 ⭐
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Simplemma 14 ⭐
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency