Contributing to TableCapture

Thank you for your interest in contributing to TableCapture! This guide will help you get started with development.

Getting Started
Development Setup
Running Tests
Building a Release
macOS Security & Permissions
Architecture & Design Notes
Submitting Changes

Getting Started

Prerequisites

macOS 12.3+ (Monterey or later)
Xcode 14+ with Swift 5.5+
Apple Silicon Mac (arm64)

Clone the Repository

git clone https://github.com/psenger/TableCapture.git
cd TableCapture

Open in Xcode

open TableCapture.xcodeproj

Development Setup

Open TableCapture.xcodeproj in Xcode
Select the TableCapture scheme
Build and run (⌘R)
Grant Screen Recording permission when prompted (System Settings > Privacy & Security > Screen Recording)

Running Tests

In Xcode

Run all tests: ⌘U
Run a single test: Click the diamond icon next to the test function

From Command Line

Run all tests:

xcodebuild test -project TableCapture.xcodeproj -scheme TableCapture -destination 'platform=macOS,arch=arm64'

Run only the ComplexLayoutMultiColMultiRowTests:

xcodebuild test -project TableCapture.xcodeproj -scheme TableCapture -destination 'platform=macOS,arch=arm64' -only-testing:TableCaptureTests/ComplexLayoutMultiColMultiRowTests

Run only the debug test to see OCR output:

xcodebuild test -project TableCapture.xcodeproj -scheme TableCapture -destination 'platform=macOS,arch=arm64' -only-testing:TableCaptureTests/ComplexLayoutMultiColMultiRowTests/debugComplexLayout

Run a specific test (CSV or Markdown):

xcodebuild test -project TableCapture.xcodeproj -scheme TableCapture -destination 'platform=macOS,arch=arm64' -only-testing:TableCaptureTests/ComplexLayoutMultiColMultiRowTests/testComplexLayoutMarkdown

Test Structure

Type	Framework	Location	Purpose
Unit Tests	Swift Testing	`TableCaptureTests/`	Test functions, logic, data transformations
UI Tests	XCTest + XCUITest	`TableCaptureUITests/`	Test user interactions and full app behavior

For detailed testing documentation, see TESTING.md.

Building a Release

Creating a Release Candidate

Build for Release in Xcode
- Select Product → Archive from the menu
- Wait for the archive to complete
- In the Organizer window, click Distribute App
- Choose Custom → Copy App and save to a location (e.g., Desktop)
Locate the .app Bundle
- Find TableCapture.app in the exported location

Create a Professional DMG Installer (with drag-to-Applications)

# Navigate to the folder containing TableCapture.app
cd /path/to/exported/app

# Create Applications symlink next to your app
ln -s /Applications Applications

# Create the DMG containing both the app and Applications link
hdiutil create -volname "TableCapture" \
  -srcfolder . \
  -ov -format UDZO \
  ../TableCapture.dmg

# Clean up the symlink
rm Applications

Create a GitHub Release
- Go to your repository → Releases
- Click Draft a new release
- Create a new tag (e.g., v1.0.0)
- Add release notes
- Upload the TableCapture.dmg file
- Publish the release

macOS Security & Permissions

Why Does Rebuilding Break Permissions?

When you rebuild the app, macOS often treats it as a "different" application:

Code Signature Changes: Each build gets a new signature, and macOS ties permissions (like Screen Recording) to that signature
Cached Permissions: The old permission is still registered but for the "old" app signature
macOS Gets Confused: It sees your app as brand new and blocks it

Solutions

Quick Fix (During Development)

# 1. Kill the app completely
killall TableCapture

# 2. Reset Screen Recording permissions for your app
tccutil reset ScreenCapture com.philipasenger.TableCapture

# 3. Rebuild and run in Xcode
# You'll need to re-grant permission in System Settings → Privacy & Security → Screen Recording

Better Fix (Consistent Identity)

Set a stable code signing identity in Xcode:

Go to your project settings → Signing & Capabilities
Enable Automatically manage signing
Make sure you have a consistent Team selected
Ensure your Bundle Identifier never changes (e.g., com.yourname.TableCapture)

Nuclear Option (When All Else Fails)

# Reset ALL TCC (privacy) permissions for your app - use carefully!
tccutil reset All com.philipasenger.TableCapture

Note: You'll need to re-grant Screen Recording permission after each rebuild during development. This is annoying but normal for macOS security.

For Distribution

When ready to distribute:

Sign with a Developer ID certificate
Notarize the app with Apple

This makes the signature consistent and permissions stick between launches for users.

Architecture & Design Notes

OCR Implementation

TableCapture uses a dual OCR approach:

Primary: Apple Vision Framework (Native macOS)

No external dependencies
Fast and accurate for most tables
Uses VNRecognizeTextRequest for text recognition
Custom logic groups text by Y coordinate to detect rows

// Simplified flow:
1. Load image as CGImage
2. Create VNRecognizeTextRequest
3. Group VNRecognizedTextObservation by Y coordinate (rows)
4. Sort cells within each row by X coordinate (columns)
5. Convert to CSV/Markdown format

Fallback: Tesseract OCR

Used for challenging cases (e.g., single letters)
Integrated via Tesseract-macOS wrapper
See ACKNOWLEDGMENTS.md for library details

Alternative Approaches Considered

img2table (Python) - Not Used

While img2table is specifically designed for table extraction, it was not chosen due to:

External Python dependency
Harder to install and bundle
Would require shipping Python runtime

# Example of what img2table usage would look like:
from img2table.document import Image
from img2table.ocr import TesseractOCR

ocr = TesseractOCR(n_threads=1, lang="eng")
doc = Image(src=image_path)
tables = doc.extract_tables(ocr=ocr, implicit_rows=True, borderless_tables=True)

Submitting Changes

Pull Request Process

Fork the repository and create your branch from main
Write tests for any new functionality
Run all tests to ensure nothing is broken
Update documentation if you've changed APIs or added features
Submit a pull request with a clear description of changes

Code Style

Follow Swift best practices and existing code patterns
Use meaningful variable and function names
Add comments for complex logic
Keep functions focused and small

Commit Messages

Use clear, descriptive commit messages
Start with a verb (Add, Fix, Update, Remove, etc.)
Reference issues when applicable (e.g., "Fix #123: ...")

Questions?

If you have questions or need help:

Open an issue on GitHub
Check existing issues for similar questions

Thank you for contributing!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing to TableCapture

Table of Contents

Getting Started

Prerequisites

Clone the Repository

Open in Xcode

Development Setup

Running Tests

In Xcode

From Command Line

Run all tests:

Run only the ComplexLayoutMultiColMultiRowTests:

Run only the debug test to see OCR output:

Run a specific test (CSV or Markdown):

Test Structure

Building a Release

Creating a Release Candidate

macOS Security & Permissions

Why Does Rebuilding Break Permissions?

Solutions

Quick Fix (During Development)

Better Fix (Consistent Identity)

Nuclear Option (When All Else Fails)

For Distribution

Architecture & Design Notes

OCR Implementation

Primary: Apple Vision Framework (Native macOS)

Fallback: Tesseract OCR

Alternative Approaches Considered

img2table (Python) - Not Used

Submitting Changes

Pull Request Process

Code Style

Commit Messages

Questions?

FilesExpand file tree

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing to TableCapture

Table of Contents

Getting Started

Prerequisites

Clone the Repository

Open in Xcode

Development Setup

Running Tests

In Xcode

From Command Line

Run all tests:

Run only the ComplexLayoutMultiColMultiRowTests:

Run only the debug test to see OCR output:

Run a specific test (CSV or Markdown):

Test Structure

Building a Release

Creating a Release Candidate

macOS Security & Permissions

Why Does Rebuilding Break Permissions?

Solutions

Quick Fix (During Development)

Better Fix (Consistent Identity)

Nuclear Option (When All Else Fails)

For Distribution

Architecture & Design Notes

OCR Implementation

Primary: Apple Vision Framework (Native macOS)

Fallback: Tesseract OCR

Alternative Approaches Considered

img2table (Python) - Not Used

Submitting Changes

Pull Request Process

Code Style

Commit Messages

Questions?