This is the project page of the paper: SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
SORRY-Bench/sorry-bench.github.io
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Repository files navigation
Releases
No releases published
Languages
- HTML 62.4%
- Jupyter Notebook 32.1%
- CSS 3.9%
- JavaScript 1.5%
- Python 0.1%