Skip to content

chore(multi-turn): remove checkTokenProgression from multi-turn LLM test and update README#123

Merged
constantinius merged 1 commit intomainfrom
constantinius/chore/remove-token-progression-check
Apr 2, 2026
Merged

chore(multi-turn): remove checkTokenProgression from multi-turn LLM test and update README#123
constantinius merged 1 commit intomainfrom
constantinius/chore/remove-token-progression-check

Conversation

@constantinius
Copy link
Copy Markdown
Collaborator

Closes #101

Copy link
Copy Markdown

@cursor cursor bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Fix All in Cursor

Bugbot Autofix prepared a fix for the issue found in the latest run.

  • ✅ Fixed: Unused imports left after removing checkTokenProgression
    • I removed the now-unused imports from multi-turn.ts and confirmed the project still builds successfully.

Create PR

Or push these changes by commenting:

@cursor push ea4e951dca
Preview (ea4e951dca)
diff --git a/src/test-cases/llm/multi-turn.ts b/src/test-cases/llm/multi-turn.ts
--- a/src/test-cases/llm/multi-turn.ts
+++ b/src/test-cases/llm/multi-turn.ts
@@ -5,7 +5,7 @@
  * Validates that Sentry captures multiple gen_ai spans correctly.
  */
 
-import { TestDefinition, Check, ErrorLocation } from "../../types.js";
+import { TestDefinition } from "../../types.js";
 import {
   checkAISpanCount,
   checkChatSpanAttributes,
@@ -15,10 +15,7 @@
   checkInputMessagesSchema,
   checkResponseModel,
 } from "../checks.js";
-import { extractGenAISpans, skipIf } from "../utils.js";
-import { CheckError } from "../../validator.js";
 
-
 export const multiTurnLLMTest: TestDefinition = {
   name: "Multi-Turn LLM Test",
   description: "Multi-turn conversation with back-and-forth exchanges",
@@ -61,15 +58,9 @@
     },
   ],
 
-  criticalChecks: [
-    checkAISpanCount(3),
-    checkChatSpanAttributes,
-  ],
+  criticalChecks: [checkAISpanCount(3), checkChatSpanAttributes],
 
-  checks: [
-    checkValidTokenUsage,
-    checkInputMessagesSchema,
-  ],
+  checks: [checkValidTokenUsage, checkInputMessagesSchema],
 
   warningChecks: [
     checkResponseModel,

This Bugbot Autofix run was free. To enable autofix for future PRs, go to the Cursor dashboard.

import { extractGenAISpans, skipIf } from "../utils.js";
import { CheckError } from "../../validator.js";

/**
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unused imports left after removing checkTokenProgression

Low Severity

After removing checkTokenProgression, the imports Check, ErrorLocation (from ../../types.js), extractGenAISpans, skipIf (from ../utils.js), and CheckError (from ../../validator.js) are no longer used anywhere in the file. These are leftover artifacts of the deleted function.

Additional Locations (2)
Fix in Cursor Fix in Web

@github-actions
Copy link
Copy Markdown

🔴 AI SDK Integration Test Results

Status: 2 regressions detected

Summary

Metric main PR Change
Total Tests 576 576
Passed 400 402 +2 ✅
Failed 148 146 -2 ✅

🔴 Regressions

These tests were passing on main but are now failing:

browser/openai :: Multi-Turn LLM Test (blocking)

Error: Browser test timed out (60s)

Browser test timed out (60s)
cloudflare/google-genai :: Vision LLM Test (blocking)

Error: 3 check(s) failed:

3 check(s) failed:
Should have at least one chat/completion span
Should have at least one chat or agent span
Should have at least one chat or agent span

✅ Fixed

These tests were failing on main but are now passing:

  • python/google-genai :: Multi-Turn LLM Test (sync, streaming)
  • python/google-genai :: Multi-Turn LLM Test (sync, blocking)
  • python/google-genai :: Multi-Turn LLM Test (async, streaming)
  • python/google-genai :: Multi-Turn LLM Test (async, blocking)

Test Matrix

Agent Tests

SDK Basic Agent Test Conversation ID Agent Test Long Input Agent Test Tool Call Agent Test Tool Error Agent Test Vision Agent Test
browser/langgraph blk, combinedblk, compiledblk, custom-stateblk, graphblk, langchainstr, combinedstr, compiledstr, custom-statestr, graphstr, langchain blk, combinedblk, compiledblk, custom-stateblk, graphblk, langchainstr, combinedstr, compiledstr, custom-statestr, graphstr, langchain blk, combinedblk, compiledblk, custom-stateblk, graphblk, langchainstr, combinedstr, compiledstr, custom-statestr, graphstr, langchain blk, combinedblk, compiledblk, custom-stateblk, graphblk, langchainstr, combinedstr, compiledstr, custom-statestr, graphstr, langchain blk, combinedblk, compiledblk, custom-stateblk, graphblk, langchainstr, combinedstr, compiledstr, custom-statestr, graphstr, langchain blk, combinedblk, compiledblk, custom-stateblk, graphblk, langchainstr, combinedstr, compiledstr, custom-statestr, graphstr, langchain
cloudflare/langgraph
cloudflare/vercel
nextjs/mastra
nextjs/vercel blkstr blkstr blkstr blkstr blkstr blkstr
node/langgraph
node/manual
node/mastra
node/vercel
php/laravel blkstr blkstr blkstr blkstr blkstr blkstr
python/langgraph as as as as as as
python/manual as as as as as as
python/openai-agents
python/pydantic-ai a, fallbacka, single a, fallbacka, single a, fallbacka, single a, fallbacka, single a, fallbacka, single a, fallbacka, single

Embedding Tests

SDK Basic Embeddings Test
browser/google-genai
browser/langchain
browser/openai
cloudflare/google-genai
cloudflare/langchain
cloudflare/openai
cloudflare/vercel
nextjs/google-genai
nextjs/langchain
nextjs/openai
nextjs/vercel
node/google-genai
node/langchain
node/openai
node/vercel
php/laravel
python/google-genai a, blks, blk
python/langchain a, blks, blk
python/litellm a, blks, blk
python/manual a, blks, blk
python/openai a, blks, blk

LLM Tests

SDK Basic Error LLM Test Basic LLM Test Conversation ID LLM Test Long Input LLM Test Multi-Turn LLM Test Vision LLM Test
browser/anthropic blkstr blkstr blkstr blkstr blkstr blkstr
browser/google-genai blkstr blkstr blkstr blkstr blkstr blkstr
browser/langchain blkstr blkstr blkstr blkstr blkstr blkstr
browser/openai blkstr blkstr blkstr blkstr ❌📉blkstr blkstr
cloudflare/anthropic blkstr blkstr blkstr blkstr blkstr blkstr
cloudflare/google-genai blkstr blkstr blkstr blkstr blkstr ❌📉blkstr
cloudflare/langchain blkstr blkstr blkstr blkstr blkstr blkstr
cloudflare/openai blkstr blkstr blkstr blkstr blkstr blkstr
nextjs/anthropic blkstr blkstr blkstr blkstr blkstr blkstr
nextjs/google-genai blkstr blkstr blkstr blkstr blkstr blkstr
nextjs/langchain blkstr blkstr blkstr blkstr blkstr blkstr
nextjs/openai blkstr blkstr blkstr blkstr blkstr blkstr
node/anthropic blkstr blkstr blkstr blkstr blkstr blkstr
node/google-genai blkstr blkstr blkstr blkstr blkstr blkstr
node/langchain blkstr blkstr blkstr blkstr blkstr blkstr
node/manual
node/openai blkstr blkstr blkstr blkstr blkstr blkstr
python/anthropic a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str
python/cohere s, blk, v1s, blk, v2s, str, v1s, str, v2 s, blk, v1s, blk, v2s, str, v1s, str, v2 s, blk, v1s, blk, v2s, str, v1s, str, v2 s, blk, v1s, blk, v2s, str, v1s, str, v2
python/google-genai a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str ✅🔧a, blk ✅🔧a, str ✅🔧s, blk ✅🔧s, str a, blka, strs, blks, str
python/langchain a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str
python/litellm a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str
python/manual a, blks, blk a, blks, blk a, blks, blk a, blks, blk a, blks, blk
python/openai a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str a, blka, strs, blks, str

MCP Tests

SDK Basic MCP Tool Call Test MCP Multiple Tool Calls Test MCP Prompt Get Test MCP Resource Read Test MCP Tool Error Test
node/mcp sseio sseio sseio sseio sseio
python/fastmcp a, blk, ssea, blk, io a, blk, ssea, blk, io a, blk, ssea, blk, io a, blk, ssea, blk, io a, blk, ssea, blk, io
python/mcp a, blk, sse, hia, blk, sse, loa, blk, io, hia, blk, io, lo a, blk, sse, hia, blk, sse, loa, blk, io, hia, blk, io, lo a, blk, sse, hia, blk, sse, loa, blk, io, hia, blk, io, lo a, blk, sse, hia, blk, sse, loa, blk, io, hia, blk, io, lo a, blk, sse, hia, blk, sse, loa, blk, io, hia, blk, io, lo

Legend: ✅ Pass | ❌ Fail | ✅🔧 Fixed | ❌📉 Regressed | ✅🆕 New (pass) | ❌🆕 New (fail) | 🗑️ Removed | str=streaming blk=blocking a=async s=sync io=stdio sse=sse hi=highlevel lo=lowlevel


Generated by AI SDK Integration Tests

@constantinius constantinius merged commit de7ed22 into main Apr 2, 2026
10 of 12 checks passed
@constantinius constantinius deleted the constantinius/chore/remove-token-progression-check branch April 2, 2026 13:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Turn 3 input tokens (14) should be greater than turn 2 (16)

1 participant