164 Open Source Information Extraction Software Projects
Free and open source information extraction code projects including engines, APIs, generators, and tools.
Information Extraction Chinese 1906 ⭐
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Annotated Semantic Relationships Datasets 615 ⭐
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
Nlp Cube 452 ⭐
Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing
Nlp Projects 399 ⭐
word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding
Open Entity Relation Extraction 394 ⭐
Knowledge triples extraction and knowledge base construction based on dependency syntax for open domain text.
Gcn Over Pruned Trees 345 ⭐
Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)
Aggcn 373 ⭐
Attention Guided Graph Convolutional Networks for Relation Extraction (authors' PyTorch implementation for the ACL19 paper)
Tacred Relation 307 ⭐
PyTorch implementation of the position-aware attention model for relation extraction
Multiple Relations Extraction Only Look Once 309 ⭐
Multiple-Relations-Extraction-Only-Look-Once. Just look at the sentence once and extract the multiple pairs of entities and their corresponding relations. 端到端联合多关系抽取模型，可用于 http://lic2019.ccf.org.cn/kg 信息抽取。
Ner Bert Pytorch 305 ⭐
PyTorch solution of named entity recognition task Using Google AI's pre-trained BERT model.
Casrel 495 ⭐
A Novel Cascade Binary Tagging Framework for Relational Triple Extraction. Accepted by ACL 2020.
Oie Resources 365 ⭐
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Holmes Extractor 338 ⭐
Information extraction from English and German texts based on predicate logic
Event Registry Python 198 ⭐
Python package for API access to news articles and events in the Event Registry
Open Ie Papers 154 ⭐
Open Information Extraction (OpenIE) and Open Relation Extraction (ORE) papers and data.
Davidsbatista Snowball 152 ⭐
Implementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)
Dan Jurafsky Chris Manning Natural Language Processing 146 ⭐
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Wandora 114 ⭐
Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.
Pytorch_multi_head_selection_re 121 ⭐
BERT + reproduce "Joint entity recognition and relation extraction as a multi-head selection problem" for Chinese and English IE
Triggerner 162 ⭐
TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
Saber 96 ⭐
Saber is a deep-learning based tool for information extraction in the biomedical domain. Pull requests are welcome! Note: this is a work in progress. Many things are broken, and the codebase is not stable.
Prediction_API 87 ⭐
AMiner Prediction API is a toolkit for science data prediction, such as scholar portrait property prediction.
Distre 80 ⭐
[ACL 19] Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction
Awesome Bioie 180 ⭐
🧫 A curated list of resources relevant to doing Biomedical Information Extraction (including BioNLP)
Abbreviation Extraction 69 ⭐
Python3 implementation of the Schwartz-Hearst algorithm for extracting abbreviation-definition pairs
Neural_name_tagging 37 ⭐
Code for "Reliability-aware Dynamic Feature Composition for Name Tagging" (ACL2019)
Re Cnn Pytorch 43 ⭐
PyTorch implementation of relation extraction via convolutional neural network with multi-size convolution kernels.
Alter Nlu 40 ⭐
Natural language understanding library for chatbots with intent recognition and entity extraction.
Knowledge Graph Nlp In Action 57 ⭐
从模型训练到部署，实战知识图谱(Knowledge Graph)&自然语言处理(NLP)。涉及 Tensorflow, Bert+Bi-LSTM+CRF,Neo4j等 涵盖 Named Entity Recognition,Text Classify,Information Extraction,Relation Extraction 等任务。
Understanding Financial Reports Using Natural Language Processing 40 ⭐
Investigate how mutual funds leverage credit derivatives by studying their routine filings to the SEC using NLP techniques 📈🤑
Shifted Label Distribution 36 ⭐
Source code for paper "Looking Beyond Label Noise: Shifted Label Distribution Matters in Distantly Supervised Relation Extraction" (EMNLP 2019)
Xponents 38 ⭐
Geographic Place, Date/time, and Pattern entity extraction toolkit along with text extraction from unstructured data and GIS outputters.
Informationextractionsystem 28 ⭐
Information Extraction System can perform NLP tasks like Named Entity Recognition, Sentence Simplification, Relation Extraction etc.
Mlmt 26 ⭐
Code for the paper "A Multi-lingual Multi-task Architecture for Low-resource Sequence Labeling" (ACL2018)
Knowledgegraph 29 ⭐
This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.
Pico_parser 27 ⭐
A clinical BERT-based NLP tool for parsing clinical trial abstracts following the PICO framework
Whour 24 ⭐
Tool for information gathering, IPReverse, AdminFInder, DNS, WHOIS, SQLi Scanner with google.
Lnex 21 ⭐
:round_pushpin: :office: :bank: :post_office: :convenience_store: :department_store: LNEx: Location Name Extractor
Hungarian Text Mining Workshop 18 ⭐
Materials for the Text Mining workshop held in the HuNLP meetup, June 2017
Palladian 28 ⭐
Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.
Irel Reading Group 15 ⭐
This repository contains the resources used for presentation/discussion in weekly iRE Lab meetings.
Serritor 23 ⭐
Oncotext 19 ⭐
OncoText is an information extraction service for breast pathology reports. It supports over 20 categories including DCIS, includes pretrained models, and supports flexible addition of new categories, new training data, and parsing new reports.
Tabledisentangler 19 ⭐
Functional and structural analysis of tables in research papers (Table disentangling)