Uroboros: Adversarial Co-Evolutionary Software Agent

Uroboros is an autonomous software engineering system capable of recursive self-improvement. It implements the "Adversarial Co-Evolution" paradigm, where a Builder Agent (Actor) and a Tester Agent (Adversary) compete in an infinite loop to generate robust, verified code.

Uroboros is an ancient image of a snake eating itself, representing the core behavior of this system: code that writes itself, and critiques itself in a recursive loop.

Core Architecture

The system operates on the Uroboros Loop:

Actor (The Builder): Generates code solutions and tools. It uses Voyager-style Memory (Vector DB) to retrieve past skills and avoid repeating mistakes.
Adversary (The Critic): Generates "Killer Tests" designed to break the Actor's code. It targets edge cases, boundary conditions, and logic flaws.
Arbiter (The Judge): Runs the code and tests in a secure, isolated Firecracker MicroVM (via E2B). It provides the ground truth signal (Pass/Fail/Crash).
Evolver (The Optimizer): Analyzes failure patterns and rewrites the system prompts to improve future performance.

Quick Start

Prerequisites

Python 3.11+
Poetry (Dependency Manager)
Docker (Optional, for containerized runs)
API Keys: OpenAI (gpt-5-mini recommended) and E2B.

1. Installation

# Clone the repository
git clone git@github.com:renbytes/uroboros.git
cd uroboros

# Install dependencies via Poetry
# Note: This installs numpy<2.0.0 to ensure ChromaDB compatibility
poetry install

2. Configuration

Copy the template and add your secrets.

cp .env.example .env

Required .env variables:

OPENAI_API_KEY=sk-...
E2B_API_KEY=e2b_...
ACTOR_MODEL=gpt-5-mini
ADVERSARY_MODEL=gpt-5-mini
DEBUG=true  # Set to 'true' to save full debugging artifacts

3. Verify Infrastructure (Smoke Test)

Run this script to ensure your E2B sandbox connection is working:

poetry run python scripts/smoke_test.py

Expected Output: 🎉 Infrastructure is HEALTHY.

Usage

Run a Single Task

To assign a specific coding challenge to the agent:

poetry run python -m uroboros.main --task "Write a Flask API with a /users endpoint backed by SQLite."

Run the Autonomous Loop

To let the agent generate its own curriculum and evolve indefinitely:

poetry run python -m uroboros.main --loop

Debugging & Artifacts

If DEBUG=true is set in your .env, the system saves detailed artifacts for every step of the loop in:

data/intermediate_debugging/<task_id>/

Files generated:

_task_definition.txt: What the agent was asked to do.
_actor_reasoning.md: The Builder's internal monologue.
_actor_generated_code_X.py: The raw code patches.
_adversary_attack_plan.md: The logic behind the attack.
_adversary_test_code_X.py: The generated "Killer Tests".
_attempt_X_failure_log.log: Combined stdout/stderr from the sandbox failure.

Troubleshooting

Common Issues

1. ImportError or ModuleNotFoundError in Sandbox

Cause: The test file tries to import the solution file, but Python's path isn't set correctly in the VM.

Fix: The Arbiter now runs tests using python -m pytest ., which adds the current directory to sys.path.

2. Command exited with code 2 (Syntax Error)

Cause: The LLM included Markdown fences (```python) or conversational text inside the code file.

Fix: The clean_code_block utility now strips markdown, and prompts explicitly forbid conversational filler in the content field.

3. AttributeError: np.float_ was removed

Cause: Incompatibility between ChromaDB (v0.4.x) and NumPy 2.0.

Fix: pyproject.toml pins numpy = "<2.0.0". Run poetry lock && poetry install if you see this.

4. BadRequestError: Invalid parameter: 'response_format' ...

Cause: Using an older model (like gpt-4-turbo or gpt-3.5) that doesn't support "Structured Outputs" (json_schema).

Fix: Ensure ACTOR_MODEL=gpt-4o in your .env.

5. TypeError: 'Sandbox' object does not support asynchronous context manager

Cause: Using the synchronous Sandbox class instead of AsyncSandbox or using incorrect SDK v1 syntax.

Fix: The codebase now strictly uses AsyncSandbox.create() and sandbox.commands.run().

Testing

To run the internal unit and integration tests for the agent framework itself:

# Run all tests
poetry run pytest

# Run model connectivity check
poetry run pytest tests/integration/test_llm_connectivity.py

License

MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
docs		docs
scripts		scripts
src/uroboros		src/uroboros
tests		tests
workflows		workflows
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Uroboros: Adversarial Co-Evolutionary Software Agent

Core Architecture

Quick Start

Prerequisites

1. Installation

2. Configuration

3. Verify Infrastructure (Smoke Test)

Usage

Run a Single Task

Run the Autonomous Loop

Debugging & Artifacts

Troubleshooting

Common Issues

1. ImportError or ModuleNotFoundError in Sandbox

2. Command exited with code 2 (Syntax Error)

3. AttributeError: np.float_ was removed

4. BadRequestError: Invalid parameter: 'response_format' ...

5. TypeError: 'Sandbox' object does not support asynchronous context manager

Testing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Uroboros: Adversarial Co-Evolutionary Software Agent

Core Architecture

Quick Start

Prerequisites

1. Installation

2. Configuration

3. Verify Infrastructure (Smoke Test)

Usage

Run a Single Task

Run the Autonomous Loop

Debugging & Artifacts

Troubleshooting

Common Issues

1. ImportError or ModuleNotFoundError in Sandbox

2. Command exited with code 2 (Syntax Error)

3. AttributeError: np.float_ was removed

4. BadRequestError: Invalid parameter: 'response_format' ...

5. TypeError: 'Sandbox' object does not support asynchronous context manager

Testing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages