Vpu Count
24 ⭐
Information about AVX-512 support on recent Intel processors
Quadray Engine
18 ⭐
Realtime raytracer using SIMD on ARM, MIPS, PPC and x86
Unisimd Assembler
71 ⭐
SIMD macro assembler unified for ARM, MIPS, PPC and x86
Parsing Int Series
19 ⭐
Parse multiple decimal integers separated by arbitrary number of delimiters
Fastcode
12 ⭐
A list of fast libraries, primarily x86/64 C++ and Node.js C++ extensions
Ultra Sort
26 ⭐
DSL for SIMD Sorting on AVX2 & AVX512
Rakau
14 ⭐
C++17 N-body Barnes-Hut on heterogeneous hardware architectures
Wojciechmula Toys
220 ⭐
Storage for my snippets, toy programs, etc.
Avx512counters
39 ⭐
AVX-512 hardware counters collector written in Go, based on Go toolchain
Simd Byte Lookup
20 ⭐
SIMDized check which bytes are in a set
Base64 Avx512
175 ⭐
Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Vcdevel Vc
1163 ⭐
SIMD Vector Classes for C++
Positional Popcount
46 ⭐
Fast C functions for the computing the positional popcount (pospopcnt).
Stormbitmaps
12 ⭐
Fast algorithms for computing XX^T for binary matrices
Nsimd
218 ⭐
Agenium Scale vectorization library for CPUs and GPUs
Libflagstats
14 ⭐
Efficient C functions to compute the summary statistics (flagstats) for sequencing read sets.
Std Simd
387 ⭐
std::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Libalgebra
33 ⭐
Fast C header-only library for popcnt, pospopcnt, and set algebraic operations
Google Highway
567 ⭐
Performance-portable, length-agnostic SIMD with runtime dispatch
Libxsmm
597 ⭐
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Md5 Simd
99 ⭐
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Boost.simd
232 ⭐
Boost SIMD
Md5 Optimisation
34 ⭐
The fastest MD5 implementation using x86 assembly
Simd
1475 ⭐
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.
Sse Popcount
249 ⭐
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Sse4 Strstr
138 ⭐
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Univdisasm
77 ⭐
x86 Disassembler and Analyzer
Std_find_simd
19 ⭐
std::find simd version
Corrfunc
133 ⭐
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Sleef
419 ⭐
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Cex
43 ⭐
The CEX Cryptographic library in C++
Yask
76 ⭐
YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Asm Dude
3909 ⭐
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Xsimd
1233 ⭐
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Onednn
2665 ⭐
oneAPI Deep Neural Network Library (oneDNN)
Kfr
1177 ⭐
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Sha256 Simd
719 ⭐
Accelerate SHA256 computations in pure Go using Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
Wonder93 Argon2
10 ⭐
A multi-arch library implementing the Argon2 password hashing algorithm.
Base64simd
119 ⭐
Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Umesimd
79 ⭐
UME::SIMD A library for explicit simd vectorization.
Ternary Logic
19 ⭐
Support for ternary logic in SSE, XOP, AVX2 and x86 programs
Rv
78 ⭐
RV: A Unified Region Vectorizer for LLVM
Libpopcnt
244 ⭐
🚀 Fast C/C++ bit population count library
Osaca
200 ⭐
Open Source Architecture Code Analyzer
Hybridizer Basic Samples
199 ⭐
Examples of C# code compiled to GPU by hybridizer
Simde
1347 ⭐
Implementations of SIMD instruction sets for systems which don't natively support them.
Libsimdpp
990 ⭐
Portable header-only C++ low level SIMD library