kevinch-nv - Overview
Popular repositories Loading
-
Forked from NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ 1
-
Forked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
C++
-
Forked from microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance scoring engine for ML models
C++