davidkimai - Overview

💭

optimizing my reward function

💭

optimizing my reward function

Pinned Loading

"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

Python 8.6k 961
Godel - Kubernetes for Agents. Built on Pi/OpenClaw

TypeScript 2
Backup for Bird - Fast X CLI by @steipete

Shell 7 1
Heron AI 90min Work Test | Project - Michael Chen METR Sabotage Threat Modeling. A small eval harness designed to measure Monitor Negligence. The script simulates a "Monitor" (the insider) reviewin…

Python 2
Ralph Zero - Your agents can now orchestrate Ralph using Skills! Ralph Zero is an orchestrator system wrapped in an Agent Skills package over Geoffrey Huntley's Ralph Loop that implements complex m…

Python 7 2
Agentic Reinforcement Learning 101. A pragmatic course for AI/ML Engineers based on "The Landscape of Agentic Reinforcement Learning for LLMs: A Survey" https://arxiv.org/abs/2509.02547

Roff 18 2