docs: fix SDK API references to match actual v0.2.8 code by vamsimanohar · Pull Request #146 · opensearch-project/observability-stack

vamsimanohar · 2026-03-30T18:25:41Z

Summary

Cross-checked all SDK documentation against the actual opensearch-genai-observability-sdk-py v0.2.8 source code and fixed multiple discrepancies:

Experiment -> Benchmark: The SDK class is Benchmark, not Experiment. Updated all references across 5 doc pages (python.mdx, evaluation.mdx, evaluation-integrations.mdx, getting-started.mdx, index.md)
Fixed result type names: ExperimentSummary -> BenchmarkSummary, CaseResult -> TestCaseResult
Added missing exports to API table: BenchmarkResult, BenchmarkSummary, TestCaseResult, ScoreSummary
Fixed auto-instrumentation extras table: Removed non-existent individual extras ([cohere], [mistral], [groq], [ollama]) that are only available via [otel-instrumentors] bundle. Added missing [google] extra.
Added missing env vars: OTEL_EXPORTER_OTLP_TRACES_PROTOCOL and OTEL_EXPORTER_OTLP_PROTOCOL
Improved Benchmark docs: Added constructor parameters table, manual (non-context-manager) usage example
Minor fixes: Show actual default endpoint URL, fix variable names in integration examples

Verification

Read all SDK source files in opensearch-genai-observability-sdk-py v0.2.8
Cross-checked every API signature, parameter, class name, and export against actual code
Verified pyproject.toml for available extras
Docs build passes with all internal links valid

Test plan

npm run build in docs/starlight-docs/ passes (validates all internal links)
Visual review of changed pages

codecov · 2026-03-30T18:27:26Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 18.51%. Comparing base (ea76e11) to head (a748b09).

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #146   +/-   ##
=======================================
  Coverage   18.51%   18.51%           
=======================================
  Files           3        3           
  Lines          54       54           
  Branches       19       18    -1     
=======================================
  Hits           10       10           
  Misses         44       44

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Cross-checked all SDK documentation against the actual opensearch-genai-observability-sdk-py v0.2.8 source code and fixed multiple discrepancies: - Rename `Experiment` to `Benchmark` throughout (the SDK class is `Benchmark`, not `Experiment`) - Fix `ExperimentSummary` -> `BenchmarkSummary`, `CaseResult` -> `TestCaseResult` in result type references - Add missing exports to API overview table: `BenchmarkResult`, `BenchmarkSummary`, `TestCaseResult`, `ScoreSummary` - Remove non-existent individual extras (`[cohere]`, `[mistral]`, `[groq]`, `[ollama]`) from auto-instrumentation table - these are only available via `[otel-instrumentors]` bundle - Add missing `[google]` extra to installation section - Add missing env vars `OTEL_EXPORTER_OTLP_TRACES_PROTOCOL` and `OTEL_EXPORTER_OTLP_PROTOCOL` to environment variables table - Show actual default endpoint URL instead of "Data Prepper default" - Add `Benchmark` constructor parameters documentation - Add manual (non-context-manager) usage example for `Benchmark` - Fix evaluation-integrations.mdx variable names for consistency Signed-off-by: Vamsi Manohar <reddyvam@amazon.com>

evaluation-integrations.mdx: - Add version caveat Aside for third-party framework APIs - Add install commands (pip install) for each section - DeepEval: add tip about deepeval.evaluate() alternative - RAGAS: add version note about v0.2+ API changes, fix iteration to use df.iterrows() instead of enumerate(itertuples()) - MLflow: fix to use DataFrame input, add mlflow.start_run() context, add version note about model_type deprecation in 2.12+ - pytest: move register() to conftest.py session-scoped fixture to avoid re-initialization across test files, add install command, rename scorer to avoid shadowing builtin - Fix related links: restore "Experiments" in link text to match actual page title at /docs/agent-health/evaluations/experiments/ integrations.mdx: - CrewAI: suggest [otel-instrumentors] extra which includes CrewAI auto-instrumentation, add note about what it provides - OpenAI Agents SDK: add import comment clarifying package name Signed-off-by: Vamsi Manohar <reddyvam@amazon.com>

docs/starlight-docs/src/content/docs/ai-observability/evaluation-integrations.mdx

Address review comment — keep related links consistent with evaluation.mdx. Signed-off-by: Vamshi Vijay Nakkirtha <vamsimanohar@gmail.com> Signed-off-by: Vamsi Manohar <reddyvam@amazon.com>

vamsimanohar requested review from anirudha, goyamegh, kylehounslow and ps48 as code owners March 30, 2026 18:25

vamsimanohar force-pushed the docs/fix-sdk-api-references branch from 0395cd0 to 0da8a02 Compare March 30, 2026 19:02

kylehounslow approved these changes Mar 31, 2026

View reviewed changes

docs/starlight-docs/src/content/docs/ai-observability/evaluation-integrations.mdx Outdated Show resolved Hide resolved

docs: remove agent-health experiments link from evaluation-integrations

a748b09

Address review comment — keep related links consistent with evaluation.mdx. Signed-off-by: Vamshi Vijay Nakkirtha <vamsimanohar@gmail.com> Signed-off-by: Vamsi Manohar <reddyvam@amazon.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: fix SDK API references to match actual v0.2.8 code#146

docs: fix SDK API references to match actual v0.2.8 code#146
vamsimanohar wants to merge 3 commits intoopensearch-project:mainfrom
vamsimanohar:docs/fix-sdk-api-references

vamsimanohar commented Mar 30, 2026

Uh oh!

codecov bot commented Mar 30, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vamsimanohar commented Mar 30, 2026

Summary

Verification

Test plan

Uh oh!

codecov bot commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Mar 30, 2026 •

edited

Loading