40 Open Source Avx512 Software Projects
Free and open source avx512 code projects including engines, APIs, generators, and tools.
Asm Dude 3816 ⭐
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Simd 1136 ⭐
C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
Kfr 893 ⭐
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Xsimd 877 ⭐
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)
Sha256 Simd 593 ⭐
Accelerate SHA256 computations in pure Go using AVX512, SHA256 and AVX2 for Intel and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core) in comparison to AVX2. On SHA256 speeds up observed at 4x in comparison to AVX2.
Libxsmm 485 ⭐
Library targeting Intel Architecture for specialized dense and sparse matrix operations, and deep learning primitives.
Base64 Avx512 149 ⭐
Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"
Base64simd 103 ⭐
Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Yask 64 ⭐
YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Md5 Simd 57 ⭐
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Libflagstats 12 ⭐
Efficient C functions to compute the summary statistics (flagstats) for sequencing read sets.