raullenchai - Overview
Rapid-MLX Rapid-MLX Public
Forked from waybarrios/vllm-mlx
The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI repl…