GitHub - madkoding/fractalmind: AI system with evolutionary memory using fractal graph RAPTOR and vector embeddings for human-like cognition. Stack: SurrealDB+HNSW+fastembed+Rust. Phases: Vigilia (SSSP) and REM (consolidation+web). Multi-user, REST API, PDF/OCR ingestion, React/Tauri UI. v1.0.0

AI system with evolutionary memory and autonomous learning. Combines graph precision with vector embedding flexibility to create a knowledge engine that mimics human cognition.

Features

✅ Free Models: Use Ollama Cloud's nemotron-3-nano (30B) and nomic-embed-text for free
✅ Local Embeddings: FastEmbed generates embeddings locally at ingestion time
✅ Sovereign Data: All data stored in your own SurrealDB database
✅ Auto-Learning: REM phase consolidates memory and learns from web search
✅ Fractal Graph: RAPTOR algorithm creates self-similar knowledge hierarchy

Architecture

Operating Phases

Vigilia (Wakefulness): Fast responses using fractal graph navigation with SSSP
REM Phase (Sleep): Asynchronous learning, memory consolidation, and web search

Core Components

Storage Engine: SurrealDB with SSD persistence and HNSW indices for vectors
Fractal Graph: RAPTOR recursive structure with semantic clustering
Multi-User: Strict separation with NAMESPACE (global/personal) and SCOPE per user
Optimized Navigation: O(m log^(2/3) n) using hopsets for shortest paths

Project Structure

fractalmind/
├── src/
│   ├── models/          # Data models (nodes, edges, embeddings)
│   ├── db/              # SurrealDB configuration and queries
│   ├── api/             # HTTP endpoints (Axum)
│   ├── services/        # Business logic (ingestion, REM, queries)
│   ├── graph/           # Graph algorithms (SSSP, RAPTOR)
│   ├── embeddings/      # Embedding generation (fastembed)
│   ├── cache/           # LRU cache for frequent nodes
│   └── utils/           # General utilities
├── docs/                # Technical documentation
│   ├── FRACTAL_MIND_CORE_SPEC.md
│   ├── GUIA_USO_MODELOS_FRACTALES.md
│   ├── MODELOS_FRACTALES_IMPLEMENTACION.md
│   └── OpenClaw_INTEGRATION.md
├── searxng/
│   └── searxng-config.yml   # SearXNG configuration (local web search)
├── docker-compose.yml       # Service orchestration (includes SearXNG)
└── Dockerfile               # Rust backend image

Quick Start

Prerequisites

Rust 1.75+
Docker & Docker Compose
Minimum 4GB RAM (8GB recommended)

Free LLM Models: FractalMind uses Ollama Cloud's free tier models:

nemotron-3-nano for chat and summarization (30B parameters)
nomic-embed-text:v1.5 for embeddings (768D)
Total cost: $0/month on the free plan

Demo

FractalMind API running on port 9000 with SurrealDB and SearXNG

Installation

Clone and run:

git clone <repo-url>
cd fractalmind
./dev.sh run

Docker Services:

SurrealDB (port 8000): Database with persistent SSD storage in ./data/surrealdb/
SearXNG (port 8080): Local web search for REM phase
FractalMind API (port 9000): Main API server (default: 9000, configurable via SERVER_PORT)

Free LLM Models (Ollama Cloud):

nemotron-3-nano - 30B parameters, excellent for chat/summarization
nomic-embed-text:v1.5 - 274MB, high-quality embeddings
Cost: $0/month (free tier)

Data Persistence:

All database data is stored in ./data/surrealdb/fractalmind.db on your host machine
Survives container restarts and updates
Backup: copy the data/surrealdb/ directory

SearXNG Integration:

automatically included in docker-compose (profile: searxng)
provides web search for REM phase
runs on http://localhost:8080
if unavailable, REM phase web search disabled (still works with local knowledge)

Build and run:

cargo build --release
cargo run

Or with full Docker Compose:

docker-compose --profile full up -d

Development

# Fast build (development)
cargo build

# Tests
cargo test

# Integration tests (requires running SurrealDB)
cargo test -- --ignored

# Linting
cargo clippy

# Formatting
cargo fmt

API Endpoints

Endpoint	Method	Description
`/v1/ingest`	POST	Ingest files (PDF, images, text)
`/v1/remember`	POST	Episodic/conversational memory
`/v1/ask`	POST	Query with fractal navigation
`/v1/sync_rem`	POST	Trigger REM phase (consolidation + web)
`/v1/memory`	PATCH	Manual knowledge update

Data Model

FractalNode

Fundamental graph node with:

Type: Leaf, Parent, Root
Status: Complete, Incomplete, Pending, Deprecated
Embedding: Semantic vector (768D Nomic, 384D BGE, 512D CLIP)
Namespace: global_knowledge or user_
Metadata: Source, tags, language, access count

FractalEdge

Edge with weight calculated as 1/similarity:

Types: ParentChild, Semantic, Temporal, CrossNamespace
Weight: For shortest path (lower = more relevant)
Similarity: Cosine between embeddings (0.0-1.0)

Learning Cycle (REM Phase)

Detect nodes with status: incomplete
Web search to gather multiple perspectives
Synthesize information into new nodes (global namespace)
Semantic clustering of leaves → generate parent summaries
Create links between personal ↔ global memory

Configuration

Key variables in .env:

# Database
SURREAL_URL=ws://localhost:8000
SURREAL_NS=fractalmind
SURREAL_DB=knowledge

# Embeddings (local - no external service needed)
# FastEmbed generates embeddings locally when you ingest content
# Embeddings are stored in SurrealDB with HNSW index
# Once ingested, no embedding model running required!
EMBEDDING_MODEL=nomic-embed-text-v1.5
EMBEDDING_DIMENSION=768

# LRU Cache
CACHE_SIZE=1000

# REM Phase
REM_INTERVAL_MINUTES=60
WEB_SEARCH_PROVIDER=searxng
WEB_SEARCH_BASE_URL=http://localhost:8080
WEB_SEARCH_ENABLED=true

# LLM Providers (Ollama Cloud - Economic/Free Models)
# Use these free models for cost-effective operation:
LLM_PREFER_LOCAL=false  # Prioritize Ollama Cloud for more capacity

# Embedding Model (Free Tier)
EMBEDDING_PROVIDER=ollama-cloud
EMBEDDING_MODEL=nomic-embed-text:v1.5

# Chat Model (Economic - 8B parameters)
CHAT_PROVIDER=ollama-cloud
CHAT_MODEL=nemotron-3-nano

# Summarizer Model (Economic - 8B parameters)
SUMMARIZER_PROVIDER=ollama-cloud
SUMMARIZER_MODEL=nemotron-3-nano

# Ollama Cloud API Key
OLLAMA_API_KEY=your_ollama_cloud_api_key_here
OLLAMA_CLOUD_BASE_URL=https://api.ollama.com

🎯 Economic/Free Models (Ollama Cloud)

Free Tier Features:

Unlimited public models on the free plan
No credit card required for basic usage
Pay only for heavy usage (Pro: $20/mo, Max: $100/mo)
Data privacy: No storage of prompts or responses

Recommended Free Models:

Model	Size	Best For	Cost
nomic-embed-text:v1.5	274MB	Embeddings	Free
nemotron-3-nano	30B	Chat/Summarizer	Free
llama3:8b	4.7GB	General chat	Free

Switching to Local Models:

If you prefer local inference (sovereignty), set:

LLM_PREFER_LOCAL=true
OLLAMA_BASE_URL=http://localhost:11434
OLLAMA_API_KEY=  # Empty for local

Embeddings - How It Works

FractalMind uses local embeddings via FastEmbed - no external API needed!

Ingestion Phase: When you add content via /v1/ingest, FastEmbed automatically generates embeddings locally
Storage: Embeddings + content stored in SurrealDB with HNSW vector index
Query Time: Embeddings loaded from DB - no embedding model needed at runtime!
REM Phase: Uses web search (SearXNG) to gather knowledge - no embeddings needed

Benefits:

Feature	Description
✅ No External API	FastEmbed generates embeddings locally
✅ No Cost	No OpenAI/Anthropic costs for embeddings
✅ No Model Running	Embeddings pre-computed at ingestion
✅ Fast	FastEmbed is optimized for speed
✅ Multiple Models	Nomic (768D), BGE (384D), CLIP (512D)

Example Usage:

# Ingest content - FastEmbed handles embeddings automatically
curl -X POST http://localhost:9000/v1/ingest \
  -H "Content-Type: application/json" \
  -d '{"content": "Your text here", "namespace": "global"}'

# Query - embeddings retrieved from DB (no model needed)
curl -X POST http://localhost:9000/v1/ask \
  -d '{"query": "What is this about?", "namespace": "global"}'

Roadmap

Fractal data model
SurrealDB configuration with HNSW
Rust project structure
LLM module (Brain) with Ollama/OpenAI providers
Embeddings service (fastembed)
Optimized LRU cache
REST API (Axum)
RAPTOR algorithm for clustering
SSSP navigation with hopsets
Asynchronous REM phase
PDF/image ingestion with OCR
Web UI (React/Tauri)

License

MIT

Última actualización: 2026-02-26 15:51:01 -03