candanzg - Overview
Popular repositories Loading
-
vllm-ascend vllm-ascend Public
Forked from vllm-project/vllm-ascend
Community maintained hardware plugin for vLLM on Ascend
C++
-
vllm vllm Public
Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
-
recsys-examples recsys-examples Public
Forked from NVIDIA/recsys-examples
Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
Python
-
static-constraint-decoding static-constraint-decoding Public
Forked from youtube/static-constraint-decoding
Sparse Transition Matrix-Accelerated Trie Index for Constrained Decoding (https://arxiv.org/abs/2602.22647)
Python