yinghai - Overview
Pinned Loading
-
Forked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Python 1
-
Forked from apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Python
-
Forked from NVIDIA/TensorRT
TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
C++
-
Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Python
-
Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python