BDH development by ganeshflexiana · Pull Request #9 · pathwaycom/bdh

ganeshflexiana · 2026-03-30T15:04:02Z

No description provided.

ganeshflexiana · 2026-04-07T05:35:47Z

Baby Dragon Hatchling (BDH) Training Pipeline

Adds an end-to-end pipeline for training, evaluating, and demonstrating memory on a small character-level LM.

What's new:

train.py – configurable training with 10%-interval checkpointing
compare_checkpoints.py – checkpoint evaluation report and generation log
memory.py – fast (context-based) and model (weight-based) memory demos
main.py – one-command runner for the full pipeline
Docker images for CPU and GPU (CUDA)

To run locally:

pip install numpy torch requests psutil pandas matplotlib
python main.py --max_iters 1000

⚠️ Local Memory Note: Running locally requires more memory than the Docker setup. Make sure to configure your system memory settings before running — closing other applications and starting with a smaller --max_iters (e.g. 100) is recommended to test memory usage first. Use Docker if your machine has limited RAM.

Outputs land in outputs/training/, outputs/evaluation/, and outputs/memory/. See README.md for full details on interpreting the loss curve and checkpoint samples.

BDH development

c74e92e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BDH development#9

BDH development#9
ganeshflexiana wants to merge 1 commit intopathwaycom:mainfrom
ganeshflexiana:bdh-dev

ganeshflexiana commented Mar 30, 2026

Uh oh!

ganeshflexiana commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ganeshflexiana commented Mar 30, 2026

Uh oh!

ganeshflexiana commented Apr 7, 2026

Baby Dragon Hatchling (BDH) Training Pipeline

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant