WangErXiao - Overview
Skip to content
Sign in
AI CODE CREATION
GitHub CopilotWrite better code with AI
GitHub SparkBuild and deploy intelligent apps
GitHub ModelsManage and compare prompts
MCP RegistryNewIntegrate external tools
View all features
Sign up
A high-throughput and memory-efficient inference and serving engine for LLMs
Python 71.9k 13.9k
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Python 2.8k 418