🍩 Ralph Wiggum Loop for Antigravity IDE

"I'm a unitard!" — Ralph Wiggum

An experimental implementation of the "Ralph Wiggum Loop" pattern for the Antigravity IDE.

This project demonstrates how to achieve autonomous, stateless agent execution by driving the IDE via the Chrome DevTools Protocol (CDP).

🧠 The Concept

The "Ralph Wiggum Loop" is a design pattern for autonomous agents:

Fresh Context: Examples, history, and "learnings" are cleared after every single task.
External Memory: State is persisted only in file artifacts (e.g. TASKS.json or task.md), not in the LLM's context window.
"The Gutter" Avoidance: By resetting the environment consistently, we prevent the degradation of reasoning that happens in long-running chat sessions.

Caution

SPEC = REQUIREMENT, NOT INSPIRATION
Every field defined in CONTEXT.json is a mandatory contract. The driver now injects model requirements with enforcement language. Agents that skip or rename fields without updating the spec have FAILED the task.

🏗️ Architecture (V3 - Modular)

graph TD
    User[User Goal] -->|Orchestrator| Plan[TASKS.json]
    Plan --> Driver[Wiggum Driver]
    
    subgraph "Ralph Wiggum Loop"
        Driver -->|Poll| Selector[Task Selector]
        Selector -->|Next Task| Builder[Prompt Builder]
        Builder -->|Inject| IDE[Antigravity IDE]
        IDE -->|Action| App[App Implementation]
        
        App -.->|Verify| QA[QA Verifier]
        QA -->|Unit Test| Layer1[Test Runner]
        QA -->|Visual| Layer2[Visual Checker]
        
        QA -->|Result| Driver
        IDE -->|Usage| Monitor[Progress Monitor]
        Monitor -->|Complete| Driver
        Driver -->|Rotate| Reset[Reset Handler]
    end
    
    subgraph "Outputs"
        Driver --> Log[errors.log]
        Driver --> Metrics[LLM Counter]
        QA --> Report[qa_report.json]
    end

The codebase is organized into composable packages:

├── wiggum_driver.py      # Main orchestrator (Driver)
├── orchestrator.py       # Plan generator (User Goal -> TASKS.json)
├── test_runner.sh        # Local test executor
├── loop/                 # Core loop components
│   ├── cdp_client.py     # Chrome DevTools Protocol communication
│   ├── task_selector.py  # Task parsing and dependency management
│   ├── task_parser.py    # JSON parsing and status tracking
│   ├── prompt_builder.py # Context injection
│   ├── progress_monitor.py # Completion polling & timestamping
│   └── reset_handler.py  # Context rotation (blast shield)
├── qa/                   # QA verification layers
│   ├── config.py         # QA Configuration
│   ├── schema_checker.py # Layer 1: Test existence
│   ├── test_runner.py    # Layer 2: Test execution (supports mocks)
│   └── visual_checker.py # Layer 3: Rule-based + Optional LLM check
├── qa_verification.py    # QA orchestrator (Result Aggregator)
├── tests/                # 56+ unit/integration tests
└── .agent/               # Project specs & logs
    ├── TASKS.json        # Executable task list
    ├── CONTEXT.json      # Model schemas and rules
    ├── task.md           # Progress scoreboard
    ├── errors.log        # Driver error log
    └── qa_report.json    # QA Verification Report

Key Components

Package	Purpose
`loop/`	Modular components for autonomous agent execution
`qa/`	3-layer verification (test existence → execution → visual)
`tests/`	56 pytest tests covering core logic

Agent Workflows

Command	Agent	Output
`/spec`	System Architect	`CONTEXT.json`, `TASKS.json`
`/ux`	UX Designer	`UI_SPEC.json`, `BRAND_BOOK.md`
`/ux_validation`	UX Validator	`UX_VALIDATION_REPORT.md`
`/ralph_mode`	Task Executor	Implements tasks from TASKS.json
`/ralph_mode`	Task Executor	Implements tasks from TASKS.json

Workflow Triggers

sequenceDiagram
    participant User
    participant Agent
    participant FileSys as .agent/

    User->>Agent: /spec "Build a Flight Tracker"
    Agent->>FileSys: Generates TASKS.json & CONTEXT.json
    
    User->>Agent: /ux "Design a Flight Tracker"
    Agent->>FileSys: Reads CONTEXT.json
    Agent->>FileSys: Generates UI_SPEC.json & BRAND_BOOK.md
    
    User->>Agent: /ralph_mode
    Agent->>FileSys: Reads all specs
    loop Execution Loop
        Agent->>Agent: Picks Next Task
        Agent->>Agent: Implements Code
        Agent->>Agent: Runs QA
        Agent->>FileSys: Updates task.md [x]
    end

🚀 Getting Started

Prerequisites

Python 3.8+
Antigravity IDE (or compatible fork)
pip install -r requirements.txt

Usage

Launch Antigravity with Remote Debugging
```
./launch_ide.sh
```
(This ensures the IDE listens on port 9000)
Define your Specs (The "Modern" Way) Use the /spec command to generate your project plan. Then copy the key files:
```
mkdir -p .agent
cp /path/to/spec/TASKS.json .agent/
cp /path/to/spec/CONTEXT.json .agent/
```
Alternatively (Legacy Mode), you can just edit .agent/task.md manually.

Run the Driver

python3 wiggum_driver.py

Alternative: Generate Plans

python3 orchestrator.py "Build a todo app with React"

Run Tests
```
./test_runner.sh all
```

How the Driver Works

The driver prioritizes sources in this order:

.agent/TASKS.json (Local Project Spec) - Preferred
.agent/task.md (Legacy/Fallback)

It autonomously:

Reads the next incomplete task from JSON.
Injects Context: Adds globally relevant rules (from CONTEXT.json) and specific verification steps into the prompt.
Sends the prompt to the IDE.
Waits for the agent to mark the task as [x] in task.md.
Rotates Context (reloads page) to keep the agent fresh.

⚠️ Safety Notes

Turbo Mode: For full autonomy, set your IDE Review Policy to "Always Proceed".
Sandboxing: Ensure your IDE has a "Terminal Command Deny List" enabled (e.g., blocking rm -rf) since the agent runs autonomously.

🛠️ Applying to Other Projects

To use this on a real project (e.g., your web app):

Copy Files: Copy wiggum_driver.py, launch_ide.sh, and requirements.txt to your project root.
Create Folders: Make sure .agent/ and .agent/workflows/ exist.
Copy Workflow: Copy ralph_mode.md into .agent/workflows/.
Add Specs: Place your TASKS.json and CONTEXT.json in .agent/.
Launch: Run ./launch_ide.sh and then python3 wiggum_driver.py.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.agent		.agent
.github/workflows		.github/workflows
loop		loop
qa		qa
templates		templates
tests		tests
.gitignore		.gitignore
QA_PROCESS.md		QA_PROCESS.md
README.md		README.md
launch_ide.sh		launch_ide.sh
orchestrator.py		orchestrator.py
qa_verification.py		qa_verification.py
ralphs_loop_gap_analysis.md		ralphs_loop_gap_analysis.md
requirements.txt		requirements.txt
system_architect.md		system_architect.md
system_brand_designer.md		system_brand_designer.md
test_runner.sh		test_runner.sh
wiggum_driver.py		wiggum_driver.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🍩 Ralph Wiggum Loop for Antigravity IDE

🧠 The Concept

🏗️ Architecture (V3 - Modular)

Key Components

Agent Workflows

Workflow Triggers

🚀 Getting Started

Prerequisites

Usage

How the Driver Works

⚠️ Safety Notes

🛠️ Applying to Other Projects

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🍩 Ralph Wiggum Loop for Antigravity IDE

🧠 The Concept

🏗️ Architecture (V3 - Modular)

Key Components

Agent Workflows

Workflow Triggers

🚀 Getting Started

Prerequisites

Usage

How the Driver Works

⚠️ Safety Notes

🛠️ Applying to Other Projects

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages