Add AI Safety section to documentation

vishwaparekh · web-flow · commit a8e8d715987a · 2025-11-19T09:44:12.000-06:00
Added a section on AI Safety and Trustworthiness, detailing key areas of study including bias, robustness, security, and best practices for generative AI.
diff --git a/research/ai-safety.md b/research/ai-safety.md
@@ -1 +1,18 @@
+---
+layout: default
+title: AI Safety
+---
+
+## AI Safety & Trustworthiness
+
+We study when and how AI systems fail, and how to make their behavior more reliable in critical settings.
+
+This includes:
+
+- **Bias and fairness** – detecting and characterizing underdiagnosis and demographic leakage in imaging AI and language models.
+- **Robustness to clinical variation** – stress-testing models under real-world shifts in acquisition, scanners, and protocols.
+- **Security & adversarial bias** – understanding “hidden in plain sight” attacks and other subtle ways systems can be manipulated.
+- **Best practices for generative AI** – guidelines for the safe use of large language models in radiology and clinical workflows.
+
+The goal is to design evaluation frameworks and mitigation strategies that go beyond accuracy, placing safety and trust at the center of AI deployment.