Skip to content

Release v0.9#62

Open
esitaridi wants to merge 1 commit intomainfrom
v0.9
Open

Release v0.9#62
esitaridi wants to merge 1 commit intomainfrom
v0.9

Conversation

@esitaridi
Copy link
Copy Markdown
Collaborator

@esitaridi esitaridi commented Apr 6, 2026

New features:
- Add coefficient of variance to bandwidth output statistics
- Add huge page support for host memory (disabled on Windows)
- Add option to sample pairs in device-to-device tests
- Add troubleshooting guide
- Unify multinode and single-node execution paths

Improvements:
- Improve CUDA architecture detection without requiring GPU access
- Deprecate Volta (sm_70/sm_72) support for CUDA toolkit >=13.0

Bug fixes:
- Fix JSON output aggregation

Platform:
- Skip Boost static libs on Azure Linux

…sampling

New features:
- Add coefficient of variance to bandwidth output statistics
- Add huge page support for host memory (disabled on Windows)
- Add option to sample pairs in device-to-device tests
- Unify multinode and single-node execution paths
- Add troubleshooting guide
Improvements:
- Improve CUDA architecture detection without requiring GPU access
- Override CMake's outdated architecture defaults
- Deprecate Volta (sm_70/sm_72) support
- Re-add UUID reporting
- Update latency test documentation
Bug fixes:
- Fix JSON output aggregation
- Fix hang in stream synchronization
- Fix huge page detection check
- Fix multinode device check
Platform:
- Fix Windows builds (unistd.h guard, huge page disable)
- Skip Boost static libs on Azure Linux
@esitaridi esitaridi self-assigned this Apr 6, 2026
@deepakcu
Copy link
Copy Markdown
Contributor

deepakcu commented Apr 6, 2026

LGTM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants