31 Open Source Lsh Software Projects
Free and open source lsh code projects including engines, APIs, generators, and tools.
Datasketch 1648 ⭐
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble
Falconn 994 ⭐
FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)
James Bowman Nlp 352 ⭐
Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
Tarsoslsh 185 ⭐
A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It implements Locality-sensitive Hashing (LSH) and multi index hashing for hamming space.
Mattilyra Lsh 212 ⭐
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
Vectorsinsearch 77 ⭐
Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Snapy 25 ⭐
SnaPy is a Python library for detecting near duplicate texts using Locality Sensitive Hashing.
Product Quantization 37 ⭐
🙃Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search.
H2_alsh 15 ⭐
Accurate and Fast Locality-Sensitive Hashing Scheme for Maximum Inner Product Search (KDD 2018)
Drsy Motis 39 ⭐
Mobile(iOS) Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
Qalsh 12 ⭐
Query-Aware Locality-Sensitive Hashing for Approximate Nearest Neighbor Search (PVLDB 2015 and VLDBJ 2017)
Neural Scam Artist 15 ⭐
Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.