jerryzh168 - Overview
Pinned Loading
-
Forked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
-
Forked from pytorch/ao
torchao: PyTorch Architecture Optimization (AO). A repository to host AO techniques and performant kernels that work with PyTorch.
Python 1
-
Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Python
-
Forked from huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python
-
Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python