Open Source Libs
Find Open Source Packages
Open Source Libraries
👉
Chinese Nlp
67 Open Source Chinese Nlp Software Projects
Free and open source chinese nlp code projects including engines, APIs, generators, and tools.
Chinese Xinhua
8839 ⭐
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
Awesome Chinese Nlp
6656 ⭐
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
Nlp_chinese_corpus
6750 ⭐
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Ltp
3702 ⭐
Language Technology Platform
Lac
2849 ⭐
百度NLP:分词,词性标注,命名实体识别,词重要性
Fastnlp
2484 ⭐
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Information Extraction Chinese
1906 ⭐
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Thulac Python
1635 ⭐
An Efficient Lexical Analyzer for Chinese
Didi Chinesenlp
1471 ⭐
Datasets, SOTA results of every fields of Chinese NLP
Jcseg
781 ⭐
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch
Thulac
676 ⭐
An Efficient Lexical Analyzer for Chinese
Chinese Chatbot Pytorch Implementation
634 ⭐
:four_leaf_clover: Another Chinese chatbot implemented in PyTorch, which is the sub-module of intelligent work order processing robot. 👩🔧
Chinese_models_for_spacy
570 ⭐
SpaCy 中文模型 | Models for SpaCy that support Chinese
Small Chinese Corpus
481 ⭐
Some useful Chinese corpus datasets 中文语料小数据
Weixin_public_corpus
488 ⭐
微信公众号语料库
Zhparser
471 ⭐
zhparser is a PostgreSQL extension for full-text search of Chinese language
Chinese Nlp Corpus
610 ⭐
Collections of Chinese NLP corpus
Ddparser
719 ⭐
百度开源的依存句法分析系统
Chineseaddress_ocr
333 ⭐
Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。
Thulac Java
298 ⭐
An Efficient Lexical Analyzer for Chinese
Thuctc
187 ⭐
An Efficient Chinese Text Classifier
Weatherbot
209 ⭐
一个基于 Rasa 的中文天气情况问询机器人(chatbot), 带 Web UI 界面
G2pc
183 ⭐
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
Gossiping Chinese Corpus
157 ⭐
PTT 八卦版問答中文語料
Segmentit
165 ⭐
任何 JS 环境可用的中文分词包,fork from leizongmin/node-segment
Chinese Automatic Speech Recognition
251 ⭐
Chinese speech recognition
Microtokenizer
124 ⭐
一个微型&算法全面的中文分词引擎 | A micro tokenizer for Chinese
Chinese_nlu_by_using_rasa_nlu
116 ⭐
使用 RASA NLU 来构建中文自然语言理解系统(NLU)| Use RASA NLU to build a Chinese Natural Language Understanding System (NLU)
Zhopenie
109 ⭐
Chinese Open Information Extraction (Tree-based Triple Relation Extraction Module)
Thucke
110 ⭐
THU Chinese Keyphrase Extraction Toolkit
Chinese Chatbot
184 ⭐
中文聊天机器人,基于10万组对白训练而成,采用注意力机制,对一般问题都会生成一个有意义的答复。已上传模型,可直接运行,跑不起来直播吃键盘。
Fancy Nlp
253 ⭐
NLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your imagination about NLP.
Chinese_semantic_role_labeling
71 ⭐
基于 Bi-LSTM 和 CRF 的中文语义角色标注
Douban Dushu Dataset
46 ⭐
A dataset contains 37 million douban dushu comments
Tensorflow_nlp
34 ⭐
Deep natural language process toolkit
Chinesener
35 ⭐
named entity recognition for Chinese.
Thulac.so
34 ⭐
An Efficient Lexical Analyzer for Chinese
Deepdivechineseapps
34 ⭐
DeepDive Tutorial with Chinese Support
Cnn Question Classification Keras
29 ⭐
Chinese Question Classifier (Keras Implementation) on BQuLD
Classic_chinese_punctuate
29 ⭐
classic Chinese punctuate experiment with keras using daizhige(殆知阁古代文献藏书) dataset
Nlp4kec
28 ⭐
This package provide the Korean and English, Chinese morphological analyzer on R.
Chinesebert
23 ⭐
This is a chinese Bert model specific for question answering
Bert_tokenization_for_java
38 ⭐
This is a java version of Chinese tokenization descried in BERT.
Samurais Chop
17 ⭐
Chinese Tokenizer module for Python
Chinesenounphraseextraction
15 ⭐
使用词性模板抽取中文语料中的名词短语
Berserker
16 ⭐
Berserker - BERt chineSE woRd toKenizER
Stanfordcorenlp Chinese
23 ⭐
Chinese implementation of the Python official interface for Stanford CoreNLP Java server application to parse, tokenize, part-of-speech tag, etc. Chinese texts.
Chinese Nlp Ner
14 ⭐
一套针对中文实体识别的BLSTM-CRF解决方案
Electra_with_tensorflow
13 ⭐
This is an implementation of electra according to the paper {ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators}
Rasa_nlu_chinese_example
13 ⭐
rasa_nlu中文案例
Esapp
12 ⭐
An unsupervised Chinese word segmentation tool.
Cnn_chinese_text_classification
13 ⭐
运用cnn + highway network网络结构中文文本分类
Chinese Char Lm
14 ⭐
explores Chinese language models with sub-character level visual information
Ai Resources Zh
10 ⭐
人工智能与数据科学资源中文索引
Awesome Word Segmentation
11 ⭐
A curated list of resources dedicated to word segmentation
Zi Dataset
32 ⭐
汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。
Machinetranslation Transformer
25 ⭐
中文->英文的机器翻译,完全基于kreas-transformer。模型已上传,可直接跑。
Chinese Chat Title Ner Bert Bilstm Crf
13 ⭐
This is a task on Chinese chat title NER via BERT-BiLSTM-CRF model.
Rime Cantonese
263 ⭐
Rime Cantonese input schema | 粵語拼音輸入方案
Chinese Minority Plm
79 ⭐
CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)
Chinese Sentence Pair Modeling
47 ⭐
Use deep models including BiLSTM, ABCNN, ESIM, RE2, BERT, etc. and evaluate on 5 Chinese NLP datasets: LCQMC, BQ Corpus, ChineseSTS, OCNLI, CMNLI
Chinese_medical_words
48 ⭐
手工整理医疗行业词汇、术语等语料。可用于语音识别、对话系统等各类nlp模型训练。
Cacl2
50 ⭐
Lexicon for Chinese lexical analyzing, 中文语言分词词库
Abner Wong Textrank
32 ⭐
keyword extraction and summarization for Chinese text by TextRank
Jastfkjg Chinese Relation Extraction
11 ⭐
Chinese rule based relation extraction
Pnlp
11 ⭐
Pre-Processing NLP.
Punctuator
14 ⭐
A small seq2seq punctuator tool based on DistilBERT