Skip to content

OpenScribe is an open-source AI scribe that records patient encounters and generates structured clinical notes automatically. You keep full control over data, workflows, and patient privacy with no vendor lock-in.

License

Notifications You must be signed in to change notification settings

sammargolis/OpenScribe

Repository files navigation

OpenScribe Banner

OpenScribe

Open-source AI medical scribe for recording encounters and generating structured clinical notes.

MIT License Discord Demo

OpenScribe

Project Overview

OpenScribe is a free MIT license open source AI Medical Scribe that helps clinicians record patient encounters, transcribe audio, and generate structured draft clinical notes using LLMs. The default web deployment path is mixed mode: local Whisper transcription + Anthropic Claude note generation. A fully local desktop path is also available forked from StenoAI.

This software is currently in early development (v0.x) and is NOT suitable for clinical practice yet. It is intended for evaluation, testing, and development purposes only.

  • HIPAA Compliant version is currently in the works. Join the Discord for more information when the version is launched.

Demo

Demo

Watch Demo

Quick Start (5 minutes)

1. Install Prerequisites

node --version  # Check you have Node.js 18+
# If not installed: brew install node if version < 18: brew upgrade node  (macOS) or download latest from nodejs.org
npm install -g pnpm

2. Clone and Install

git clone https://github.com/sammargolis/OpenScribe.git
cd OpenScribe
pnpm install

3. Configure Environment (Mixed Web Default)

Create env defaults:

pnpm run setup  # Auto-generates .env.local with secure storage key

Edit apps/web/.env.local and add:

TRANSCRIPTION_PROVIDER=whisper_local
WHISPER_LOCAL_MODEL=tiny.en
ANTHROPIC_API_KEY=sk-ant-YOUR_KEY_HERE
# NEXT_PUBLIC_SECURE_STORAGE_KEY is auto-generated, don't modify

OPENAI_API_KEY is optional unless you switch to TRANSCRIPTION_PROVIDER=whisper_openai.

4. Start the App

pnpm dev:local    # One command: Whisper local server + web app

Optional desktop app path:

pnpm dev:desktop

Runtime Modes

OpenScribe supports three workflows. Mixed web mode is the default path.

Mixed Web (default)

  • Transcription: local Whisper server (pnpm whisper:server) with default model tiny.en
  • Notes: larger model (default Claude in web path)
  • Start everything with one command: pnpm dev:local
  • Configure with TRANSCRIPTION_PROVIDER=whisper_local in apps/web/.env.local
  • Setup guide: docs/WHISPER-LOCAL-SETUP.md

Local-only Desktop (optional)

  • Transcription: local Whisper backend in local-only/openscribe-backend
  • Notes: local Ollama models (llama3.2:*, gemma3:4b)
  • No cloud inference in this path
  • Setup guide: /Users/sammargolis/OpenScribe/local-only/README.md

Cloud/OpenAI + Claude (fallback)

  • Transcription: OpenAI Whisper API
  • Notes: Anthropic Claude (or other hosted LLM)
  • Requires API keys in apps/web/.env.local

FYI Getting API Keys

OpenAI (transcription): platform.openai.com/api-keys - Sign up → API Keys → Create new secret key

Anthropic (note generation): console.anthropic.com/settings/keys - Sign up → API Keys → Create Key

Both services offer $5 free credits for new accounts

Staying Updated

git pull origin main  # Pull latest changes
pnpm install          # Update dependencies

# If you encounter issues after updating:
rm -rf node_modules pnpm-lock.yaml && pnpm install

Purpose and Philosophy

OpenScribe exists to provide a simple, open-source alternative to cloud dependent clinical documentation tools. The project is built on core principles:

  • Local-first storage: Encounter data is stored locally in the browser by default
  • Privacy-conscious: No analytics or telemetry in the web app; external model calls are explicit and configurable
  • Modular: Components can be swapped or extended (e.g., different LLM providers, transcription services)

Local MedGemma (Text-Only) Scribe

This repo now includes a fully local, text-only MedGemma scribe workflow in packages/pipeline/medgemma-scribe. It requires pre-transcribed text and does not perform speech-to-text. See packages/pipeline/medgemma-scribe/README.md for setup and usage.

Project Resources

Roadmap

Current Status (v0)

  • Core recording, transcription, and note generation
  • AES-GCM encrypted local storage
  • Browser-based audio capture

Near-term (v0.1-0.5)

  • Error handling improvements
  • Comprehensive test coverage
  • Basic audit logging

Physical Controls:

  • User responsibility (device security, physical access)

Future Goals (v2.0+)

  • Package app to be able to run 100% locally with transciption model and small 7b model for note generation
  • Multiple LLM providers (Anthropic, local models)
  • Custom note templates
  • Optional cloud sync (user-controlled)
  • Multi-language support
  • Mobile app
  • EHR integration
  • RCM integration

Architecture

See architecture.md for complete details.

┌─────────────────────────────────────────────────────────┐
│                    UI Layer (Next.js)                   │
│  ┌──────────────┐              ┌─────────────────────┐  │
│  │ Encounter    │              │  Workflow States    │  │
│  │ Sidebar      │◄────────────►│  - Idle             │  │
│  │              │              │  - Recording        │  │
│  │              │              │  - Processing       │  │
│  │              │              │  - Note Editor      │  │
│  └──────────────┘              └─────────────────────┘  │
└─────────────────────────────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────┐
│              Processing Pipeline                         │
│  ┌──────────┐   ┌──────────┐   ┌──────────┐   ┌─────┐  │
│  │  Audio   │──►│Transcribe│──►│   LLM    │──►│Note │  │
│  │  Ingest  │   │ (Whisper)│   │          │   │Core │  │
│  └──────────┘   └──────────┘   └──────────┘   └─────┘  │
│       │                                           │     │
│       └───────────────┐         ┌─────────────────┘     │
└───────────────────────┼─────────┼───────────────────────┘
                        ▼         ▼
┌─────────────────────────────────────────────────────────┐
│                  Storage Layer                           │
│  ┌──────────────────────────────────────────────────┐   │
│  │  Encrypted LocalStorage (AES-GCM)                │   │
│  │  - Encounters (patient data, transcripts, notes) │   │
│  │  - Metadata (timestamps, status)                 │   │
│  │  - Audio (in-memory only, not persisted)        │   │
│  └──────────────────────────────────────────────────┘   │
└─────────────────────────────────────────────────────────┘

Key Components:

  • UI Layer: React components in apps/web/ using Next.js App Router
  • Audio Ingest: Browser MediaRecorder API → WebM/MP4 blob
  • Transcription (default web path): local Whisper server (whisper.cpp via pywhispercpp, model tiny.en)
  • LLM (default web path): Anthropic Claude via packages/llm
  • Fully local desktop path: Whisper + Ollama via local-only/openscribe-backend
  • Cloud fallback path: OpenAI Whisper API + hosted provider via packages/llm
  • Note Core: Structured clinical note generation and validation
  • Storage: AES-GCM encrypted browser localStorage

Monorepo Structure:

  • apps/web/ – Next.js frontend + Electron renderer
  • packages/pipeline/ – Audio ingest, transcription, assembly, evaluation
  • packages/ui/ – Shared React components
  • packages/storage/ – Encrypted storage + encounter management
  • packages/llm/ – Provider-agnostic LLM client
  • packages/shell/ – Electron main process
  • config/ – Shared configuration files
  • build/ – Build artifacts

Privacy & Data Handling

Storage: AES-GCM encrypted localStorage. Audio processed in-memory, not persisted.
Transmission: In the default mixed mode, audio stays local for transcription and transcript text is sent to Anthropic Claude over HTTPS/TLS for note generation. If TRANSCRIPTION_PROVIDER=whisper_openai, audio is sent to OpenAI Whisper over HTTPS/TLS. The application enforces HTTPS-only connections and displays a security warning if accessed over HTTP in production builds.
No Tracking: Zero analytics, telemetry, or cloud sync

Use Responsibility

  • All AI notes are drafts requiring review
  • Ensure regulatory compliance for your use case
  • For production deployments serving PHI, ensure the application is accessed via HTTPS or served from localhost only

Limitations & Disclaimers

HIPAA Compliance: OpenScribe includes foundational privacy/security features, but this alone does not make the application HIPAA-compliant. Below is what is already built, followed by a checklist a health system must complete to operate compliantly.

Built (foundational rails)

  • AES-GCM encrypted localStorage for PHI at rest in the browser
  • Audio processed in-memory and not persisted
  • TLS/HTTPS for external API calls (Whisper, Claude)
  • No analytics/telemetry or cloud sync by default (local-first)
  • HTTPS-only enforcement in production with HTTP warning

Health System Checklist (required to run compliantly)

  • Execute BAAs with all PHI-touching vendors (e.g., OpenAI, Anthropic, hosting providers)
  • Perform and document HIPAA Security Rule risk analysis and remediation plan
  • Implement access controls (SSO/MFA, least-privilege, session timeouts)
  • Establish audit logging, log review processes, and retention policies
  • Define data retention, backup, and secure deletion procedures for PHI
  • Configure device, endpoint, and physical safeguards (disk encryption, MDM, secure workstations)
  • Document policies and procedures (incident response, breach notification, sanctions)
  • Train workforce on HIPAA/privacy/security requirements
  • Establish key management and secret rotation procedures
  • Validate network/security posture (secure deployment, vulnerability management)

No EHR Integration: Standalone tool
Browser Storage Limits: ~5-10MB typical
No Warranty: Provided as-is under MIT License

Contributing

Contributions welcome! See CONTRIBUTING.md for detailed guidelines.

Quick Start:

  1. Fork the repository
  2. Create a feature branch
  3. Make changes with tests
  4. Submit a PR

Notices

Portions of this project include or were derived from code in:

StenoAIhttps://github.com/ruzin/stenoai
Copyright (c) 2025 Skrape Limited
Licensed under the MIT License.

All third-party code remains subject to its original license terms.

License

MIT

MIT License

Copyright (c) 2026 Sam Margolis

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Citation

OpenScribe
GitHub: https://github.com/sammargolis/OpenScribe
Maintainer: Sam Margolis (@sammargolis)

About

OpenScribe is an open-source AI scribe that records patient encounters and generates structured clinical notes automatically. You keep full control over data, workflows, and patient privacy with no vendor lock-in.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

No packages published

Contributors 5