zhink - Overview

View zhink's full-sized avatar

Block or report zhink

Pinned Loading

  1. Forked from PaddlePaddle/Paddle

    PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

    C++

  2. Forked from NVIDIA/cutlass

    CUDA Templates for Linear Algebra Subroutines

    C++

  3. Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  4. Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Python

  5. High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

    Python 3.7k 735