lars20070 - Overview
Pinned Loading
-
Evaluation framework for Pydantic AI agents
Python 1
-
1
# Agent evaluations with `pytest-assay`3
* [`pytest-assay`](https://github.com/lars20070/pytest-assay) was written to address two problems:
4
1. Limited access to agent responses, making flexible evaluation difficult
5
2. Lack of scoring evaluations — most frameworks only support binary pass/fail evaluations
-
MCP server for WolframScript
Python
-
2
FROM mcr.microsoft.com/devcontainers/python:${PYTHON_VERSION}-trixie4
# Copy uv and uvx (latest version)5
COPY --from=ghcr.io/astral-sh/uv:latest /uv /uvx /bin/ -
Forked from 1st1/lat.md
Agent Lattice: a knowledge graph for your codebase, written in markdown.
TypeScript