Skip to content

Commit a8e8d71

Browse files
authored
Add AI Safety section to documentation
Added a section on AI Safety and Trustworthiness, detailing key areas of study including bias, robustness, security, and best practices for generative AI.
1 parent 3e9f941 commit a8e8d71

File tree

1 file changed

+17
-0
lines changed

1 file changed

+17
-0
lines changed

research/ai-safety.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1 +1,18 @@
1+
---
2+
layout: default
3+
title: AI Safety
4+
---
5+
6+
## AI Safety & Trustworthiness
7+
8+
We study when and how AI systems fail, and how to make their behavior more reliable in critical settings.
9+
10+
This includes:
11+
12+
- **Bias and fairness** – detecting and characterizing underdiagnosis and demographic leakage in imaging AI and language models.
13+
- **Robustness to clinical variation** – stress-testing models under real-world shifts in acquisition, scanners, and protocols.
14+
- **Security & adversarial bias** – understanding “hidden in plain sight” attacks and other subtle ways systems can be manipulated.
15+
- **Best practices for generative AI** – guidelines for the safe use of large language models in radiology and clinical workflows.
16+
17+
The goal is to design evaluation frameworks and mitigation strategies that go beyond accuracy, placing safety and trust at the center of AI deployment.
118

0 commit comments

Comments
 (0)