New Test Run

2 runs currently active:

smoke-xsr ? ยท ?
smoke-ent ? ยท ?

You can start another run in parallel โ€” use "Skip purge" to keep the brain intact for the active run.

Skip if another run is active.

DeepSeek

Kimi (Moonshot AI)

โ–ธ Per-phase model overrides (optional)
Generate
No MCP โ€” creates test data
Ingest
MCP โ€” stores docs in brain
Query
MCP โ€” retrieves from brain
Evaluate
No MCP โ€” judges answer

Budget control per provider โ€” select a model above to see which applies.