wujack - Overview

Skip to content

Navigation Menu

Sign in

Appearance settings

View wujack's full-sized avatar

Block or report wujack

Popular repositories Loading

  1. Forked from FMInference/FlexLLMGen

    Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput large-batch generation.

    Python