Skip to content

Commit 9e532f3

Browse files
jahoomaclaude
andcommitted
evalbuff: add carve-based eval pipeline (delete & rebuild)
New approach to evals that carves features out of the current codebase and has agents rebuild them, instead of replaying git commits. Uses OpenAI SDK (gpt-5.4) to identify and surgically remove features, then runs agents in parallel to rebuild from a natural prompt, judges against original code, and iterates on docs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent d1bd453 commit 9e532f3

File tree

4 files changed

+1151
-0
lines changed

4 files changed

+1151
-0
lines changed

bun.lock

Lines changed: 3 additions & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

evalbuff/package.json

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,7 @@
1818
"@codebuff/common": "workspace:*",
1919
"@codebuff/sdk": "workspace:*",
2020
"ai": "^5.0.0",
21+
"openai": "^6.33.0",
2122
"zod": "^4.2.1"
2223
}
2324
}

0 commit comments

Comments
 (0)