davidkimai - Overview

View davidkimai's full-sized avatar

💭

optimizing my reward function

💭

optimizing my reward function

Block or report davidkimai

Pinned Loading

  1. "Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

    Python 8.6k 961

  2. Godel - Kubernetes for Agents. Built on Pi/OpenClaw

    TypeScript 2

  3. Backup for Bird - Fast X CLI by @steipete

    Shell 7 1

  4. Heron AI 90min Work Test | Project - Michael Chen METR Sabotage Threat Modeling. A small eval harness designed to measure Monitor Negligence. The script simulates a "Monitor" (the insider) reviewin…

    Python 2

  5. Ralph Zero - Your agents can now orchestrate Ralph using Skills! Ralph Zero is an orchestrator system wrapped in an Agent Skills package over Geoffrey Huntley's Ralph Loop that implements complex m…

    Python 7 2

  6. Agentic Reinforcement Learning 101. A pragmatic course for AI/ML Engineers based on "The Landscape of Agentic Reinforcement Learning for LLMs: A Survey" https://arxiv.org/abs/2509.02547

    Roff 18 2