open-world-agents

Pinned Loading

  1. Pydantic media reference for images and video frames (with timestamp support) from data URIs, HTTP URLs, file URIs, and local paths. Features lazy loading and optimized batch video decoding.

    Python 11 2

  2. High-performance desktop recorder for Windows. Captures screen, audio, keyboard, mouse, and window events.

    Python 33 3

  3. Everything you need to build state-of-the-art foundation multimodal desktop agent, end-to-end.

    Python 34 10

  4. A conda-smithy repository for gstreamer-bundle.

    1

  5. Browser-based visualization tool for exploring OWAMcap datasets with synchronized playback of screen recordings and interaction events.

    JavaScript 3