GitHub - CopilotKit/llmock: Deterministic mock LLM server for testing *across processes* — fixture-based routing with SSE streaming

Deterministic mock LLM server for testing. A real HTTP server on a real port — not an in-process interceptor — so every process in your stack (Playwright, Next.js, agent workers, microservices) can point at it via OPENAI_BASE_URL / ANTHROPIC_BASE_URL and get reproducible, instant responses. Streams SSE in real OpenAI, Claude, Gemini, Bedrock, Azure, Vertex AI, Ollama, and Cohere API formats, driven entirely by fixtures. Zero runtime dependencies.

Quick Start

npm install @copilotkit/llmock
import { LLMock } from "@copilotkit/llmock";

const mock = new LLMock({ port: 5555 });

mock.onMessage("hello", { content: "Hi there!" });

const url = await mock.start();
// Point your OpenAI client at `url` instead of https://api.openai.com

// ... run your tests ...

await mock.stop();

Features

CLI Quick Reference

Option Short Default Description
--port -p 4010 Port to listen on
--host -h 127.0.0.1 Host to bind to
--fixtures -f ./fixtures Path to fixtures directory or file
--latency -l 0 Latency between SSE chunks (ms)
--chunk-size -c 20 Characters per SSE chunk
--watch -w Watch fixture path for changes and reload
--log-level info Log verbosity: silent, info, debug
--validate-on-load Validate fixture schemas at startup
--chaos-drop 0 Chaos: probability of 500 errors (0-1)
--chaos-malformed 0 Chaos: probability of malformed JSON (0-1)
--chaos-disconnect 0 Chaos: probability of disconnect (0-1)
--metrics Enable Prometheus metrics at /metrics
--record Record mode: proxy unmatched to real APIs
--strict Strict mode: fail on unmatched requests
--provider-* Upstream URL per provider (with --record)
--help Show help
# Start with bundled example fixtures
llmock

# Custom fixtures on a specific port
llmock -p 8080 -f ./my-fixtures

# Simulate slow responses
llmock --latency 100 --chunk-size 5

# Record mode: proxy unmatched requests to real APIs and save as fixtures
llmock --record --provider-openai https://api.openai.com --provider-anthropic https://api.anthropic.com

# Strict mode in CI: fail if any request doesn't match a fixture
llmock --strict -f ./fixtures

Documentation

Full API reference, fixture format, E2E patterns, and provider-specific guides:

https://llmock.copilotkit.dev/docs.html

Real-World Usage

CopilotKit uses llmock across its test suite to verify AI agent behavior across multiple LLM providers without hitting real APIs.

License

MIT