Skip to content

docs: fix SDK API references to match actual v0.2.8 code#146

Open
vamsimanohar wants to merge 3 commits intoopensearch-project:mainfrom
vamsimanohar:docs/fix-sdk-api-references
Open

docs: fix SDK API references to match actual v0.2.8 code#146
vamsimanohar wants to merge 3 commits intoopensearch-project:mainfrom
vamsimanohar:docs/fix-sdk-api-references

Conversation

@vamsimanohar
Copy link
Copy Markdown
Member

Summary

Cross-checked all SDK documentation against the actual opensearch-genai-observability-sdk-py v0.2.8 source code and fixed multiple discrepancies:

  • Experiment -> Benchmark: The SDK class is Benchmark, not Experiment. Updated all references across 5 doc pages (python.mdx, evaluation.mdx, evaluation-integrations.mdx, getting-started.mdx, index.md)
  • Fixed result type names: ExperimentSummary -> BenchmarkSummary, CaseResult -> TestCaseResult
  • Added missing exports to API table: BenchmarkResult, BenchmarkSummary, TestCaseResult, ScoreSummary
  • Fixed auto-instrumentation extras table: Removed non-existent individual extras ([cohere], [mistral], [groq], [ollama]) that are only available via [otel-instrumentors] bundle. Added missing [google] extra.
  • Added missing env vars: OTEL_EXPORTER_OTLP_TRACES_PROTOCOL and OTEL_EXPORTER_OTLP_PROTOCOL
  • Improved Benchmark docs: Added constructor parameters table, manual (non-context-manager) usage example
  • Minor fixes: Show actual default endpoint URL, fix variable names in integration examples

Verification

  • Read all SDK source files in opensearch-genai-observability-sdk-py v0.2.8
  • Cross-checked every API signature, parameter, class name, and export against actual code
  • Verified pyproject.toml for available extras
  • Docs build passes with all internal links valid

Test plan

  • npm run build in docs/starlight-docs/ passes (validates all internal links)
  • Visual review of changed pages

@codecov
Copy link
Copy Markdown

codecov bot commented Mar 30, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 18.51%. Comparing base (ea76e11) to head (a748b09).

Additional details and impacted files
@@           Coverage Diff           @@
##             main     #146   +/-   ##
=======================================
  Coverage   18.51%   18.51%           
=======================================
  Files           3        3           
  Lines          54       54           
  Branches       19       18    -1     
=======================================
  Hits           10       10           
  Misses         44       44           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Cross-checked all SDK documentation against the actual
opensearch-genai-observability-sdk-py v0.2.8 source code and fixed
multiple discrepancies:

- Rename `Experiment` to `Benchmark` throughout (the SDK class is
  `Benchmark`, not `Experiment`)
- Fix `ExperimentSummary` -> `BenchmarkSummary`, `CaseResult` ->
  `TestCaseResult` in result type references
- Add missing exports to API overview table: `BenchmarkResult`,
  `BenchmarkSummary`, `TestCaseResult`, `ScoreSummary`
- Remove non-existent individual extras (`[cohere]`, `[mistral]`,
  `[groq]`, `[ollama]`) from auto-instrumentation table - these are
  only available via `[otel-instrumentors]` bundle
- Add missing `[google]` extra to installation section
- Add missing env vars `OTEL_EXPORTER_OTLP_TRACES_PROTOCOL` and
  `OTEL_EXPORTER_OTLP_PROTOCOL` to environment variables table
- Show actual default endpoint URL instead of "Data Prepper default"
- Add `Benchmark` constructor parameters documentation
- Add manual (non-context-manager) usage example for `Benchmark`
- Fix evaluation-integrations.mdx variable names for consistency

Signed-off-by: Vamsi Manohar <reddyvam@amazon.com>
@vamsimanohar vamsimanohar force-pushed the docs/fix-sdk-api-references branch from 0395cd0 to 0da8a02 Compare March 30, 2026 19:02
evaluation-integrations.mdx:
- Add version caveat Aside for third-party framework APIs
- Add install commands (pip install) for each section
- DeepEval: add tip about deepeval.evaluate() alternative
- RAGAS: add version note about v0.2+ API changes, fix iteration
  to use df.iterrows() instead of enumerate(itertuples())
- MLflow: fix to use DataFrame input, add mlflow.start_run() context,
  add version note about model_type deprecation in 2.12+
- pytest: move register() to conftest.py session-scoped fixture to
  avoid re-initialization across test files, add install command,
  rename scorer to avoid shadowing builtin
- Fix related links: restore "Experiments" in link text to match
  actual page title at /docs/agent-health/evaluations/experiments/

integrations.mdx:
- CrewAI: suggest [otel-instrumentors] extra which includes CrewAI
  auto-instrumentation, add note about what it provides
- OpenAI Agents SDK: add import comment clarifying package name

Signed-off-by: Vamsi Manohar <reddyvam@amazon.com>
Address review comment — keep related links consistent with evaluation.mdx.

Signed-off-by: Vamshi Vijay Nakkirtha <vamsimanohar@gmail.com>

Signed-off-by: Vamsi Manohar <reddyvam@amazon.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants