Fix NVBandwidth benchmark results parsing bug (#748) by NJX-njx · Pull Request #782 · microsoft/superbenchmark

NJX-njx · 2026-03-03T14:11:47Z

Summary

Fixes #748 - NVBandwidth benchmark results parsing bug causing invalid metric names like _sum_None\ and \device_to_device_latency_sm_sum_None.

Root Cause

When parsing SUM lines, \parse_status['test_name']\ or \parse_status['benchmark_type']\ could be empty/None (e.g., after waived tests or when header parsing fails), leading to invalid metric keys.

Changes

Add validation: only add SUM result when both \ est_name\ and \�enchmark_type\ are valid
Use test name from SUM line (\match.group(1)) as fallback when \parse_status\ is stale
Infer \�enchmark_type\ from test name when not set: 'latency' in name -> 'lat', else 'bw'

Testing

Logic verified against \ ests/data/nvbandwidth_results.log\ format
Unit tests require CUDA platform (CI will run on Linux)

…ency_sm_sum_None - Add validation before adding SUM results: only add when test_name and benchmark_type are valid - Use test name from SUM line as fallback when parse_status is stale (e.g., after waived tests) - Infer benchmark_type from test name when not set (latency tests -> lat, else bw) Fixes microsoft#748 Made-with: Cursor

Copilot

Pull request overview

Fixes NVBandwidth SUM-line parsing to prevent invalid metric keys (e.g., _sum_None) by making SUM parsing more resilient when parsing state is missing or stale.

Changes:

Use the SUM line’s captured test name as a fallback when parse_status['test_name'] is empty.
Infer benchmark_type from the test name when it wasn’t detected from the matrix header.
Guard SUM metric emission to only occur when both test_name and benchmark_type are valid.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-03T14:15:10Z

superbench/benchmarks/micro_benchmarks/nvbandwidth.py

        if self.re_summary_pattern.match(line):
-            value = self.re_summary_pattern.match(line).group(2)
-            test_name = parse_status['test_name']
+            match = self.re_summary_pattern.match(line)
+            value = match.group(2)


re_summary_pattern.match(line) is evaluated twice (once in the if and again to assign match). Consider assigning once (e.g., match = ...; if match:) to avoid duplicated work and keep the control flow simpler.

Copilot · 2026-03-03T14:15:10Z

superbench/benchmarks/micro_benchmarks/nvbandwidth.py

+            # Use test_name from parse_status, fallback to group(1) from SUM line
+            test_name = parse_status['test_name'] or match.group(1).lower()
            benchmark_type = parse_status['benchmark_type']
-            parse_status['results'][f'{test_name}_sum_{benchmark_type}'] = float(value)
+            # Infer benchmark_type from test_name if not set (e.g., after waived tests)
+            if benchmark_type is None:
+                benchmark_type = 'lat' if 'latency' in test_name else 'bw'
+            # Only add result when we have valid metric name (avoid _sum_None or sum_None)
+            if test_name and benchmark_type:
+                parse_status['results'][f'{test_name}_sum_{benchmark_type}'] = float(value)


The new fallback/inference paths for SUM parsing (using match.group(1) when parse_status['test_name'] is empty, and inferring benchmark_type when it's None) aren’t covered by existing nvbandwidth tests (current tests only exercise the full header+matrix path). Adding a unit test with a minimal raw output containing a SUM ... line but no prior matrix header (and/or no preceding Running ...) would help prevent regressions of the _sum_None metric-name bug.

Copilot AI review requested due to automatic review settings March 3, 2026 14:11

NJX-njx requested a review from a team as a code owner March 3, 2026 14:11

Copilot started reviewing on behalf of NJX-njx March 3, 2026 14:12 View session

Copilot AI reviewed Mar 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix NVBandwidth benchmark results parsing bug (#748)#782

Fix NVBandwidth benchmark results parsing bug (#748)#782
NJX-njx wants to merge 1 commit intomicrosoft:mainfrom
NJX-njx:fix/nvbandwidth-parsing-bug-748

NJX-njx commented Mar 3, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 3, 2026

Uh oh!

Copilot AI Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

NJX-njx commented Mar 3, 2026

Summary

Root Cause

Changes

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants