Open Source Libs
Find Open Source Packages
Open Source Libraries
👉
Avx2
82 Open Source Avx2 Software Projects
Free and open source avx2 code projects including engines, APIs, generators, and tools.
SimdJSON
15250 ⭐
Parsing gigabytes of JSON per second
Asm Dude
3909 ⭐
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Onednn
2665 ⭐
oneAPI Deep Neural Network Library (oneDNN)
Simd
1475 ⭐
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.
Mind Wheels
889 ⭐
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Vcdevel Vc
1163 ⭐
SIMD Vector Classes for C++
Libsimdpp
990 ⭐
Portable header-only C++ low level SIMD library
Simde
1347 ⭐
Implementations of SIMD instruction sets for systems which don't natively support them.
Directxmath
1050 ⭐
DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
Minio Highwayhash
746 ⭐
Native Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
Fastnoisesimd
570 ⭐
C++ SIMD Noise Library
Turbopfor Integer Compression
588 ⭐
Fastest Integer Compression
SimdJSONsharp
568 ⭐
C# bindings for lemire/simdjson (and full C# port)
Libxsmm
597 ⭐
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Fastbase64
344 ⭐
SIMD-accelerated base64 codecs
Boost.simd
232 ⭐
Boost SIMD
Sse Popcount
249 ⭐
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Turbo Run Length Encoding
227 ⭐
TurboRLE-Fastest Run Length Encoding
Libpopcnt
244 ⭐
🚀 Fast C/C++ bit population count library
Hybridizer Basic Samples
199 ⭐
Examples of C# code compiled to GPU by hybridizer
Highwayhash
199 ⭐
Node.js implementation of HighwayHash, Google's fast and strong hash function
Wojciechmula Toys
220 ⭐
Storage for my snippets, toy programs, etc.
Tensorflow Optimized Wheels
120 ⭐
TensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Sse4 Strstr
138 ⭐
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Corrfunc
133 ⭐
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Base64simd
119 ⭐
Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Nsimd
218 ⭐
Agenium Scale vectorization library for CPUs and GPUs
Ctranslate2
229 ⭐
Fast inference engine for Transformer models
Turbo Base64
123 ⭐
Turbo Base64 - Fastest Base64 SIMD/Neon/Altivec
Umesimd
79 ⭐
UME::SIMD A library for explicit simd vectorization.
Fcml Lib
72 ⭐
General purpose machine code manipulation library for x86-32 (IA-32) and x86-64 (AMD64) architectures (Assembler, Disassembler, Library).
Yandex Argon2
66 ⭐
Implementation of argon2 (i, d, id) algorithms with CPU dispatching
Unisimd Assembler
71 ⭐
SIMD macro assembler unified for ARM, MIPS, PPC and x86
Rv
78 ⭐
RV: A Unified Region Vectorizer for LLVM
Md5 Simd
99 ⭐
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Cute Nucleotides
63 ⭐
Cute tricks for SIMD vectorized binary encoding and decoding of nucleotides, in Rust.
Positional Popcount
46 ⭐
Fast C functions for the computing the positional popcount (pospopcnt).
Op_rbf
57 ⭐
Optimized Recursive Bilateral Filter
Utf8
57 ⭐
Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Turbo Histogram
42 ⭐
Fastest Histogram Construction
Sixtyfour
44 ⭐
How fast can we brute force a 64-bit comparison?
Turbo Transpose
39 ⭐
Transpose: SIMD Integer+Floating Point Compression Filter
Libalgebra
33 ⭐
Fast C header-only library for popcnt, pospopcnt, and set algebraic operations
Asmc
59 ⭐
Asmc Macro Assembler
Cex
43 ⭐
The CEX Cryptographic library in C++
Docker Nginx Boringssl
29 ⭐
NGINX Docker built top of rolling release BoringSSL
Fast Hex
37 ⭐
Fast, SIMD hex string encoder and decoder C++ lib and Node.js module
Cpuwhat
25 ⭐
Nim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Ultra Sort
26 ⭐
DSL for SIMD Sorting on AVX2 & AVX512
Triple_accel
56 ⭐
Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.
SimdJSON Rs
17 ⭐
Rust version of lemire's SimdJson
Quadray Engine
18 ⭐
Realtime raytracer using SIMD on ARM, MIPS, PPC and x86
Sol
18 ⭐
A fast C99 vector library with Nim bindings.
Parsing Int Series
19 ⭐
Parse multiple decimal integers separated by arbitrary number of delimiters
Simd Byte Lookup
20 ⭐
SIMDized check which bytes are in a set
Libflagstats
14 ⭐
Efficient C functions to compute the summary statistics (flagstats) for sequencing read sets.
Stormbitmaps
12 ⭐
Fast algorithms for computing XX^T for binary matrices
Ksim
13 ⭐
The little simulator that could.
Ternary Logic
19 ⭐
Support for ternary logic in SSE, XOP, AVX2 and x86 programs
Circlecollision
32 ⭐
Implementing Different Methods of Circle to Circle Collision Detection using variety of new Technologies: Vulkan Graphics/Compute API, AVX2/AVX-512
Fld Ecc Vec
17 ⭐
Vectorized implementation of Ed25519 and Ed448
Sliceslice Rs
55 ⭐
A fast implementation of single-pattern substring search using SIMD acceleration.
Google Highway
567 ⭐
Performance-portable, length-agnostic SIMD with runtime dispatch
Simdutf8
400 ⭐
SIMD-accelerated UTF-8 validation for Rust.
Jfalcou Eve
315 ⭐
Expressive Vector Engine - SIMD in C++ Goes Brrrr
Osaca
200 ⭐
Open Source Architecture Code Analyzer
Cpurasterizer
98 ⭐
CPU Based Rasterizer Engine
Simdutf
106 ⭐
Unicode routines (UTF8, UTF16): billions of characters per second.
Chromium_clang
71 ⭐
Chromium browser compiled with the Clang/LLVM compiler.
Maxmath
43 ⭐
A C# SIMD math library for use with Unity only, substantially extending Unity.Mathematics by new types and functions, using Unity.Burst.
Dd Avx_v3
35 ⭐
Library of High Precision Sparse Matrix Operations Accelerated by SIMD
Block Aligner
53 ⭐
SIMD-accelerated library for computing global and X-drop affine gap sequence alignments using an adaptive block-based algorithm.
Awesome Simd
37 ⭐
A curated list of awesome SIMD frameworks, libraries and software
Simd Adler32
19 ⭐
A SIMD-accelerated Adler-32 rolling hash algorithm implementation.
Smolscale
13 ⭐
Fast, embeddable C code for smooth image scaling and pixel format conversion
Rakau
14 ⭐
C++17 N-body Barnes-Hut on heterogeneous hardware architectures
Fastcode
12 ⭐
A list of fast libraries, primarily x86/64 C++ and Node.js C++ extensions
Bson To JSon
10 ⭐
Fast BSON to JSON string transcoder
Wonder93 Argon2
10 ⭐
A multi-arch library implementing the Argon2 password hashing algorithm.
Guided_missile_simulation
16 ⭐
Air-to-Air Missile Simulation written in Fortran
Std_find_simd
19 ⭐
std::find simd version
Ericgrange Dwscript
75 ⭐
Delphi Web Script general purpose scripting engine