Overview - Code Graph Knowledge System

Deployment Overview¶

Choose the right deployment mode based on your needs and available infrastructure.

🎯 Deployment Modes¶

Minimal - Code Graph Only¶

Perfect for: Developers who want code intelligence without LLM overhead

Requirements:
  - Neo4j database
  - Docker & docker-compose
  - No LLM needed ✓
  - No embedding model needed ✓

Resources:
  - Image size: ~500MB
  - Memory: ~1GB RAM
  - Startup time: ~5 seconds

Available Features:

✅ Repository ingestion and code parsing
✅ File relationship discovery (imports, dependencies)
✅ Impact analysis (who depends on this file?)
✅ Context packing for AI assistants
✅ Full-text search on file paths and content
❌ Memory Store
❌ Knowledge RAG
❌ Auto-extraction

Use When:

You want code navigation and analysis only
You don't need LLM-powered features
You're working in air-gapped environments
You want minimal resource usage

→ Minimal Deployment Guide

Standard - Code Graph + Memory¶

Perfect for: Teams building project knowledge bases

Requirements:
  - Neo4j database
  - Docker & docker-compose
  - Embedding model (Ollama/OpenAI/Gemini) ✓
  - No LLM needed ✓

Resources:
  - Image size: ~600MB
  - Memory: ~2GB RAM
  - Startup time: ~8 seconds

Available Features:

✅ All Minimal features
✅ Manual memory management (add/update/delete)
✅ Vector-based memory search
✅ Project memory summaries
✅ Memory superseding (track decision changes)
❌ Auto-extraction from git/conversations
❌ Knowledge RAG

Use When:

You want to maintain project decision logs
You need searchable team knowledge
You have access to an embedding service
You prefer manual curation over auto-extraction

→ Standard Deployment Guide

Full - All Features¶

Perfect for: Teams wanting complete AI-powered capabilities

Requirements:
  - Neo4j database
  - Docker & docker-compose
  - LLM (Ollama/OpenAI/Gemini/OpenRouter) ✓
  - Embedding model ✓

Resources:
  - Image size: ~800MB
  - Memory: ~4GB RAM (+ LLM requirements)
  - Startup time: ~15 seconds

Available Features:

✅ All Standard features
✅ Automatic memory extraction from:
- Git commits
- AI conversations
- Code comments (TODO/FIXME/NOTE)
- Q&A sessions
✅ Knowledge base RAG:
- Document ingestion
- Intelligent Q&A
- Multi-format support
✅ Batch repository analysis

Use When:

You want fully automated knowledge extraction
You need document Q&A capabilities
You have LLM infrastructure available
You want maximum AI assistance

→ Full Deployment Guide

🔄 Mode Comparison Matrix¶

Feature Category	Minimal	Standard	Full
Code Graph
Repository ingestion	✅	✅	✅
Incremental updates	✅	✅	✅
File search	✅	✅	✅
Impact analysis	✅	✅	✅
Context packing	✅	✅	✅
Memory Store
Add memory	❌	✅	✅
Search memories	❌	✅ (vector)	✅ (vector)
Update/delete	❌	✅	✅
Supersede	❌	✅	✅
Extract from git	❌	❌	✅ (LLM)
Extract from chat	❌	❌	✅ (LLM)
Extract from code	❌	❌	✅ (LLM)
Knowledge RAG
Add documents	❌	❌	✅
Query knowledge	❌	❌	✅ (LLM)
Vector search	❌	❌	✅
Infrastructure
Neo4j	Required	Required	Required
Embedding	-	Required	Required
LLM	-	-	Required
Performance
Image size	500MB	600MB	800MB
RAM usage	1GB	2GB	4GB+
Startup time	5s	8s	15s

🏗️ Architecture Diagrams¶

Minimal Mode Architecture¶

graph TB
    subgraph "Client"
        A[Claude Desktop / API Client]
    end

    subgraph "Docker Network"
        B[MCP Server<br/>Minimal]
        C[(Neo4j<br/>Graph DB)]
    end

    subgraph "Code Graph Services"
        D[Code Ingestor]
        E[Graph Service]
        F[Ranker]
        G[Pack Builder]
    end

    A -->|MCP/REST| B
    B --> D
    B --> E
    B --> F
    B --> G
    D -->|Store| C
    E -->|Query| C

    style B fill:#90EE90
    style C fill:#87CEEB

Standard Mode Architecture¶

graph TB
    subgraph "Client"
        A[Claude Desktop / API Client]
    end

    subgraph "Docker Network"
        B[MCP Server<br/>Standard]
        C[(Neo4j<br/>Graph DB)]
    end

    subgraph "Code Graph Services"
        D[Code Ingestor]
        E[Graph Service]
    end

    subgraph "Memory Services"
        F[Memory Store]
    end

    subgraph "External"
        G[Embedding Service<br/>Ollama/OpenAI]
    end

    A -->|MCP/REST| B
    B --> D
    B --> E
    B --> F
    D -->|Store| C
    E -->|Query| C
    F -->|Store/Search| C
    F -->|Vectorize| G

    style B fill:#FFD700
    style C fill:#87CEEB
    style G fill:#FFA07A

Full Mode Architecture¶

graph TB
    subgraph "Client"
        A[Claude Desktop / API Client]
    end

    subgraph "Docker Network"
        B[MCP Server<br/>Full]
        C[(Neo4j<br/>Graph DB)]
        D[Ollama<br/>Optional]
    end

    subgraph "All Services"
        E[Code Graph]
        F[Memory Store]
        G[Knowledge RAG]
        H[Memory Extractor]
    end

    subgraph "External/Optional"
        I[LLM Service<br/>OpenAI/Gemini]
        J[Embedding Service]
    end

    A -->|MCP/REST| B
    B --> E
    B --> F
    B --> G
    B --> H
    E -->|Store| C
    F -->|Store/Search| C
    G -->|Store/Query| C
    F -->|Vectorize| J
    G -->|Generate| I
    H -->|Analyze| I

    D -.->|Local LLM| I
    D -.->|Local Embed| J

    style B fill:#FF6347
    style C fill:#87CEEB
    style D fill:#DDA0DD

🚀 Quick Decision Guide¶

Use this flowchart to choose your deployment mode:

graph TD
    A[Start] --> B{Do you need<br/>LLM features?}
    B -->|No| C{Do you need<br/>memory search?}
    B -->|Yes| D[Full Mode]
    C -->|No| E[Minimal Mode]
    C -->|Yes| F{Can you provide<br/>embedding service?}
    F -->|Yes| G[Standard Mode]
    F -->|No| E

    E --> H[✓ Code Graph only<br/>✓ No external deps<br/>✓ Fast & lightweight]
    G --> I[✓ Code Graph<br/>✓ Memory Store<br/>⚠ Need embedding]
    D --> J{Do you have<br/>local GPU?}
    J -->|Yes| K[Use with-ollama profile]
    J -->|No| L[Use cloud LLM]
    K --> M[✓ All features<br/>✓ Self-hosted<br/>⚠ High resources]
    L --> N[✓ All features<br/>✓ Lower resources<br/>⚠ API costs]

    style E fill:#90EE90
    style G fill:#FFD700
    style K fill:#FF6347
    style L fill:#FF6347

📋 Pre-Deployment Checklist¶

For All Modes¶

Docker installed (version 20.10+)
docker-compose installed (version 1.29+)
At least 4GB free disk space
Ports 7474, 7687, 8000 available
.env file configured

Additional for Standard Mode¶

Embedding service available:
- Local Ollama running, or
- OpenAI API key, or
- Google API key for Gemini

Additional for Full Mode¶

LLM service available:
- Local Ollama running, or
- OpenAI API key, or
- Google API key, or
- OpenRouter API key
Embedding service (same as Standard)
For local Ollama: GPU with 8GB+ VRAM (optional but recommended)

🔄 Switching Between Modes¶

You can switch deployment modes at any time. Data in Neo4j is preserved.

# Stop current deployment
make docker-stop

# Start different mode
make docker-minimal    # or
make docker-standard   # or
make docker-full

Configuration Required

When switching to Standard or Full mode, update your .env file with required API keys and service URLs.

Deployment Overview¶

🎯 Deployment Modes¶

Minimal - Code Graph Only¶

Standard - Code Graph + Memory¶

Full - All Features¶

🔄 Mode Comparison Matrix¶

🏗️ Architecture Diagrams¶

Minimal Mode Architecture¶

Standard Mode Architecture¶

Full Mode Architecture¶

🚀 Quick Decision Guide¶

📋 Pre-Deployment Checklist¶

For All Modes¶

Additional for Standard Mode¶

Additional for Full Mode¶

🔄 Switching Between Modes¶

📚 Next Steps¶