33 Open Source Record Linkage Software Projects
Free and open source record linkage code projects including engines, APIs, generators, and tools.
Libpostal 3297 ⭐
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Dedupe 3227 ⭐
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Yomguithereal Talisman 620 ⭐
Record Linkage Resources 65 ⭐
Resources for tackling record linkage / deduplication / data matching problems
Splink 127 ⭐
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Senzing Awesome 32 ⭐
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Entity Embed 86 ⭐
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Cleanzr Fasthash 11 ⭐
Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).
Ngmarchant Oasis 10 ⭐
A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).