A2S - Der Bildungseinrichter (The Learning Furnisher)
Pinned Loading
-
Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
-
Forked from vllm-project/llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Python