The CI reliability gate for action-oriented agents.
-
Updated
Mar 5, 2026 - Python
The CI reliability gate for action-oriented agents.
Turn-based political sim where policy decisions ripple through 14 competing factions. Manage legitimacy, navigate crises, and survive your rival across 10 possible endings. Built with React + TypeScript via Claude Code, with deterministic testing and accessibility features.
Deterministic Rust testing utility for simulation and stochastic workflows
The deterministic heap groomer for C/C++ memory debugging.
WordleBench — Deterministic AI Wordle benchmark. Compare 34+ LLMs (GPT-5, Claude 4.5, Gemini, Grok, Llama) head-to-head on accuracy, speed, and cost across 50 standardized words.
Deterministic reliability lab focused on failure modes, invariants, and recovery boundaries in infrastructure systems.
Add a description, image, and links to the deterministic-testing topic page so that developers can more easily learn about it.
To associate your repository with the deterministic-testing topic, visit your repo's landing page and select "manage topics."