Doza Assist

Find the story inside your footage, not just the clips.

Local-first AI editor's assistant for documentary and spoken-word video. Free and open source.

Documentary films, corporate video, podcasts, news, legal depositions, customer testimonials, training content. If someone is talking on camera and you need to find the best moments, this tool does the heavy lifting.

Drop in footage, transcribe it, chat with the AI about what story you're looking for, build narrative sequences from your transcript, and export pre-cut timelines directly to Final Cut Pro. Everything runs on your machine. Nothing uploads. Nothing leaves.

Why I Built This

I'm a documentary filmmaker and I needed a way to find story beats and soundbites across hours of interview footage without uploading client material to cloud services. Existing tools were either too expensive, too slow, or required sending sensitive footage to third-party servers. So I built something that runs entirely on my Mac, uses AI locally, and exports directly to my Final Cut Pro timeline.

Features

Transcription

Drag and drop video/audio files (MP4, MOV, WAV, MP3, MXF, etc.)
Transcribes locally — no cloud uploads
Uses NVIDIA Parakeet TDT (via MLX) on Apple Silicon for fast English transcription, WhisperX large-v3 for 99+ languages
Word-level timestamps for precise sync
Click speaker names to assign who said what

Transcript Viewer

Clean paragraph layout grouped by speaker
Video player synced to transcript with word-level highlighting
Click any word to jump to that moment
Color highlighter — drag across words to create clips (like highlighting in a document)
5 renamable color labels for organizing selects

Clip Library

All highlights collected in a visual grid
Each clip has play/pause, scrub bar, duration, and transcript excerpt
Checkbox select for batch export
Add clips from transcript, AI analysis, or AI chat

AI Analysis (powered by Ollama — free, local)

Story structure with beats (hook, context, rising action, climax, resolution)
Social media clip suggestions with platform recommendations
Strongest soundbites identified
Every item has play/scrub controls and one-click "Add to Clips"

Narrative Intelligence (AI Chat)

Conversational AI that knows your transcript — ask for clips, themes, story angles, soundbites
Build stories directly from chat: "build me a 3-minute story about her journey from athlete to coach"
AI suggests clips with timecodes — play them instantly, add to clips, or build as a story
Every timecode is clickable with a + button to add as a clip on the spot
Pull all suggested clips at once or build them into a story sequence
Follow-up questions maintain context

Story Builder

Describe the story you want to tell and the AI assembles it from your footage
Works from Chat or the dedicated Story Builder tab
The story agent reads the full transcript, selects the strongest soundbites, and arranges them into a narrative arc — hook, rising action, emotional peak, resolution
Returns an ordered sequence of clips with editorial notes explaining why each clip is in that position
Drag to reorder clips, remove what doesn't work, rebuild with a different prompt
Play All button plays the entire sequence back-to-back so you can hear the story before you cut it
One-click export to FCPX — the clips land on your timeline in story order, ready to refine
Stories sidebar: browse, rename, switch between, and delete story builds
Save multiple story builds per project to compare different angles or versions

FCPX Export

Pre-cut timeline — each clip becomes an actual edit referencing your source media
Import the .fcpxml and your selects are ready to review in Final Cut Pro
Keyword ranges on source clip for browser filtering
Also exports SRT subtitles, plain text, and JSON

Client Sharing

One-click Cloudflare Tunnel generates a public URL
Clients see the full project: transcript, player, highlighting tools
No destructive controls exposed — clients can highlight and listen
No accounts or signups needed

Project Organization

Folder system for organizing by client
Rename, move, clear, delete projects
Multi-project workspace — combine interviews in one view

My Style — An AI That Tells Stories Like You

Teach Doza Assist your editorial voice by importing finished projects you've cut
The app transcribes your finished pieces, analyzes how you shape spoken stories (pacing, beat length, cut style, story structure), and builds a style profile
Once active, every AI suggestion — chat, story builder, clip selection — reflects your narrative instincts
Toggle My Style on or off from the chat input, story builder, or the dedicated My Style page
Your style profile stays on your machine and persists across sessions
Import more projects over time to refine the profile
Export your style as JSON or delete it anytime

Dark / Light Theme

Toggle between dark and light mode
Persists across sessions

Download

Download Doza Assist (macOS)

Download the .dmg file from the link above
Open it and drag Doza Assist to your Applications folder
Double-click to launch

First launch: macOS may block the app. Go to System Settings > Privacy & Security, scroll down, and click "Open Anyway" next to the Doza Assist message. This only happens once.

On first launch, the app will automatically install everything it needs. You may be asked for your Mac password once during setup. The AI model download (~3-5 GB) takes a few minutes — the app shows progress the whole time.

That's it. No Terminal required.

Quick Start (Developer Install)

If you prefer to run from source:

Prerequisites

macOS (tested on Mac Studio M2)
Python 3.11+
ffmpeg (brew install ffmpeg)
Ollama for AI features (https://ollama.com)

Install

git clone https://github.com/DozaVisuals/doza-assist.git
cd doza-assist

# Create virtual environment
python3 -m venv venv
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Install ffmpeg if you don't have it
brew install ffmpeg

Run

./start.sh

Open http://localhost:5050

AI Setup (optional but recommended)

Local with Ollama (free, private — recommended):

# Install and start Ollama
brew install ollama
ollama serve

# Pull a model (gemma4 recommended)
ollama pull gemma4

Cloud with Claude API (higher quality, optional): If you want better AI analysis quality, you can use Anthropic's Claude API as an alternative backend. Transcription still runs locally — only the AI analysis and chat use the API.

export ANTHROPIC_API_KEY=sk-ant-your-key-here
# Add to ~/.zshrc to persist

The app automatically tries Ollama first and falls back to Claude if configured.

How It Works

Add a file — Paste a path, browse, or drag a video/audio file
Transcribe — Click "Transcribe" to process locally with Whisper
Assign speakers — Click speaker names in the transcript to toggle between speakers
Highlight clips — Select a color and drag across words to mark selects
Discover with AI — Run AI Analysis or ask the Chat for clips and story structure
Export to FCPX — Export pre-cut timeline with your clips as edits on the timeline
Share with clients — Click Share to generate a public link for client review

Tech Stack

Backend: Python / Flask
Frontend: Vanilla JS, CSS custom properties
Transcription: Parakeet TDT via MLX (fast, Apple Silicon native) with OpenAI Whisper fallback
AI: Ollama with Gemma 4 (local, free) or Claude API (optional)
Audio: ffmpeg for extraction
Sharing: Cloudflare Tunnel (free, no account needed)
Export: FCPXML 1.11 with asset references
Storage: JSON files per project (no database)

Project Structure

doza-assist/
├── app.py               # Flask server + all routes
├── transcribe.py        # Whisper transcription engine
├── ai_analysis.py       # AI analysis + chat (Ollama/Claude)
├── fcpxml_export.py     # FCPXML generation with pre-cut timelines
├── editorial_dna/       # My Style — editorial voice profiling
│   ├── transcript_analyzer.py  # Narrative pattern extraction
│   ├── classifier.py    # AI-powered style classification
│   ├── summarizer.py    # Natural language summary generation
│   ├── storage.py       # Profile persistence (~/.doza-assist/)
│   └── injector.py      # Injects style into AI prompts
├── start.sh             # Launch script (developer mode)
├── install.sh           # Manual setup (developer mode)
├── setup_runner.sh      # Auto-setup phase 1 (Xcode CLT, Homebrew, Python)
├── setup_assistant.py   # Auto-setup phase 2 (browser UI for remaining deps)
├── dep_check.sh         # Quick dependency checker for app launches
├── build_launcher.sh    # Builds .app bundle + .dmg
├── requirements.txt     # Python dependencies
├── static/
│   └── style.css        # All styles (dark + light themes)
├── templates/
│   ├── dashboard.html   # Projects page with folders
│   ├── project.html     # Main project view (all tabs)
│   ├── my_style.html    # My Style page
│   └── ...
├── projects/            # User data (gitignored)
└── exports/             # FCPXML exports (gitignored)

Troubleshooting

Install failing? Run bash install.sh --clean to wipe the setup and start completely fresh.

Want to completely remove Doza Assist? Run bash uninstall.sh — it will walk you through what gets removed and ask before doing anything.

Getting a Python error or "command not found"? Make sure Xcode Command Line Tools are installed:

xcode-select --install

Wait for the installer to finish, then run bash install.sh --clean.

macOS blocking the app? Go to System Settings > Privacy & Security, scroll down, and click Open Anyway next to the Doza Assist message. This only happens once.

Want to report a bug? The installer saves a full log to install_log.txt in the project folder. Attach it when reporting issues — it shows exactly where things went wrong.

Privacy

All transcription runs locally on your machine
AI analysis uses Ollama (local) by default — nothing leaves your computer
Audio/video files are never uploaded anywhere
Client sharing uses a temporary tunnel URL that stops when you quit the app
All project data stored as local JSON files

License

MIT

Built by Doza Visuals

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Doza Assist

Why I Built This

Features

Download

Quick Start (Developer Install)

Prerequisites

Install

Run

AI Setup (optional but recommended)

How It Works

Tech Stack

Project Structure

Troubleshooting

Privacy

License

About

Uh oh!

Releases 5

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
.github		.github
editorial_dna		editorial_dna
static		static
templates		templates
.gitignore		.gitignore
Build Doza Assist.command		Build Doza Assist.command
LICENSE		LICENSE
README.md		README.md
ai_analysis.py		ai_analysis.py
app.py		app.py
build_launcher.sh		build_launcher.sh
dep_check.sh		dep_check.sh
fcpxml_export.py		fcpxml_export.py
install.sh		install.sh
launcher.sh		launcher.sh
make_icon.py		make_icon.py
requirements.txt		requirements.txt
setup_assistant.py		setup_assistant.py
setup_runner.sh		setup_runner.sh
start.sh		start.sh
transcribe.py		transcribe.py
uninstall.sh		uninstall.sh

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Doza Assist

Why I Built This

Features

Download

Quick Start (Developer Install)

Prerequisites

Install

Run

AI Setup (optional but recommended)

How It Works

Tech Stack

Project Structure

Troubleshooting

Privacy

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages