shifan3 - Overview
Popular repositories Loading
-
TensorRT-LLM-qwen2-vl TensorRT-LLM-qwen2-vl Public
Forked from NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ 1
-
symengine symengine Public
Forked from symengine/symengine
SymEngine is a fast symbolic manipulation library, written in C++
C++
-
sentence-transformers sentence-transformers Public
Forked from huggingface/sentence-transformers
Sentence Embeddings with BERT & XLNet
Python
-
pt-search-deploy pt-search-deploy Public
-
fast-autocomplete fast-autocomplete Public
Forked from seperman/fast-autocomplete
Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enough
Python