raullenchai - Overview

Rapid-MLX Rapid-MLX Public

Forked from waybarrios/vllm-mlx

The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI repl…

Python 128 64