Lyaction - Overview
Pinned Loading
-
A tool designed for llm offline distributed inference from Odps datasource.
Python 4
-
Forked from huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python
-
Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python