wickedfoo - Overview

Skip to content

Navigation Menu

Sign in

Appearance settings

View wickedfoo's full-sized avatar

Jeff Johnson wickedfoo

SIMD + GPU + FPGA + ASIC stuff for AI/ML. I wrote the original PyTorch GPU backend, GPU Faiss, and many other AI GPU things broadly in use across the industry.

Block or report wickedfoo

Pinned Loading

  1. An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.

    SystemVerilog 400 40

  2. A library for efficient similarity search and clustering of dense vectors.

    C++ 39.6k 4.3k

  3. Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python 98.8k 27.4k

  4. GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compression of numerical and other data types in HPC/ML applications.

    Cuda 380 33

  5. Quantize transformers to any learned arbitrary 4-bit numeric format

    Python 53 5