sharpninja
diff --git a/‎.github/pull_request_template.md‎
Lines changed: 16 additions & 0 deletions b/‎.github/pull_request_template.md‎
Lines changed: 16 additions & 0 deletions
diff --git a/‎.github/workflows/benchmark-report.yml‎
Lines changed: 1 addition & 1 deletion b/‎.github/workflows/benchmark-report.yml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎.github/workflows/build.yml‎
Lines changed: 1 addition & 1 deletion b/‎.github/workflows/build.yml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎.gitignore‎
Lines changed: 3 additions & 0 deletions b/‎.gitignore‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 13 additions & 0 deletions b/‎README.md‎
Lines changed: 13 additions & 0 deletions
diff --git a/‎docs/README.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/README.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/SUMMARY.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/SUMMARY.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/repo-alignment-guidelines.md‎
Lines changed: 50 additions & 0 deletions b/‎docs/repo-alignment-guidelines.md‎
Lines changed: 50 additions & 0 deletions
diff --git a/‎docs/todo.yaml‎
Lines changed: 45 additions & 0 deletions b/‎docs/todo.yaml‎
Lines changed: 45 additions & 0 deletions
diff --git a/‎mcp.db‎ b/‎mcp.db‎
@@ -0,0 +1,16 @@
+## Summary
+
+Describe the change and its user-visible impact.
+
+## Validation
+
+- [ ] `dotnet build BitNet-b1.58-Sharp.slnx`
+- [ ] `dotnet test BitNet-b1.58-Sharp.slnx`
+
+## Repository alignment checklist
+
+- [ ] The change preserves the paper-aligned BitNet b1.58 runtime and does not reintroduce retired toy or bigram workflows into the active application surface.
+- [ ] The change keeps the repository domain-agnostic at the core runtime, benchmark, and top-level documentation level.
+- [ ] New or updated docs use Windows-first wording, PowerShell-oriented commands, and Windows-style paths when concrete path examples are needed.
+- [ ] If I added, removed, or renamed pages under `docs\`, I updated both `docs\README.md` and `docs\SUMMARY.md`.
+- [ ] Any new prompts, diagnostics, or examples keep the repository's American English tone.
@@ -40,7 +40,7 @@ jobs:
         run: dotnet build BitNet-b1.58-Sharp.slnx --configuration Release --no-restore
 
       - name: Test
-        run: dotnet test BitNet-b1.58-Sharp.slnx --configuration Release --no-build --no-restore
+        run: dotnet test BitNet-b1.58-Sharp.slnx --configuration Release --no-build --no-restore --filter "Category=SlowLane"
 
       - name: Generate benchmark comparison report
         run: >
 
@@ -43,7 +43,7 @@ jobs:
         run: dotnet build BitNet-b1.58-Sharp.slnx --configuration Release --no-restore
 
       - name: Test
-        run: dotnet test BitNet-b1.58-Sharp.slnx --configuration Release --no-build --no-restore
+        run: dotnet test BitNet-b1.58-Sharp.slnx --configuration Release --no-build --no-restore --filter "Category!=SlowLane"
 
       - name: Pack BitNetSharp.Core
         run: dotnet pack "${{ github.workspace }}/src/BitNetSharp.Core/BitNetSharp.Core.csproj" --configuration Release --no-build --no-restore -p:PackageVersion=${{ steps.gitversion.outputs.semVer }} --output "${{ github.workspace }}/artifacts/packages/core"
 
@@ -417,3 +417,6 @@ FodyWeavers.xsd
 *.msix
 *.msm
 *.msp
+
+AGENTS-README-FIRST.yaml
+.mcpServer/
@@ -4,3 +4,16 @@ Project documentation now lives in GitBook format under `/docs`.
 
 - Start here: [`/docs/README.md`](docs/README.md)
 - Navigation: [`/docs/SUMMARY.md`](docs/SUMMARY.md)
+
+## Windows development focus
+
+This repository is optimized for Windows development with Visual Studio 2022/2025, .NET 9/10, and PowerShell.
+
+Use the `dotnet` CLI from the repository root for the standard validation flow:
+
+```powershell
+dotnet build BitNet-b1.58-Sharp.slnx
+dotnet test BitNet-b1.58-Sharp.slnx
+```
+
+When documentation needs concrete local paths, prefer Windows-style examples such as `C:\src\BitNet-b1.58-Sharp`.
@@ -33,5 +33,6 @@ dotnet test BitNet-b1.58-Sharp.slnx
 - [DataGen guide](datagen-guide.md)
 - [Implementation plan](implementation-plan-v3.md)
 - [Releases and packaging](releases-and-packaging.md)
+- [Repository alignment guidelines](repo-alignment-guidelines.md)
 - [Usage](usage.md)
 - [Training and visualization](training-and-visualization.md)
@@ -10,5 +10,6 @@
   - [Implementation plan v1 (archived)](implementation-plan-v1.md)
   - [Benchmarking and model comparison](benchmarking.md)
   - [Releases and packaging](releases-and-packaging.md)
+  - [Repository alignment guidelines](repo-alignment-guidelines.md)
   - [Usage](usage.md)
   - [Training and visualization](training-and-visualization.md)
@@ -0,0 +1,50 @@
+# Repository alignment guidelines
+
+## Purpose
+
+This repository should stay focused on the paper-aligned BitNet b1.58 runtime and the local tooling needed to build, inspect, benchmark, and document it. These guidelines keep contributions consistent, Windows-first, and domain-agnostic.
+
+## Core alignment rules
+
+### Preserve the paper-aligned runtime surface
+
+Changes should reinforce the active BitNet b1.58 transformer path in `src\BitNetSharp.Core` and the hosting or CLI entry points in `src\BitNetSharp.App`.
+
+Do not reintroduce retired toy, bigram, or unrelated experimental workflows into the active application surface.
+
+### Keep the repository domain-agnostic
+
+The core runtime, built-in training data, benchmark positioning, and top-level documentation should remain general-purpose rather than anchored to a single business vertical, product, or proprietary workflow.
+
+Examples can stay illustrative, but defaults should not hard-code product-specific assumptions into the repository's main experience.
+
+### Prefer Windows-first guidance
+
+When adding or updating documentation, favor PowerShell and `dotnet` CLI examples that work from a standard Windows clone.
+
+If a document needs a concrete path example, use Windows-style paths such as `C:\src\BitNet-b1.58-Sharp` or repository-relative paths such as `src\BitNetSharp.Core`.
+
+### Keep repository-local validation authoritative
+
+Use the repository solution for the standard validation flow:
+
+```powershell
+dotnet build BitNet-b1.58-Sharp.slnx
+dotnet test BitNet-b1.58-Sharp.slnx
+```
+
+If a change affects user-facing behavior, diagnostics, benchmarks, or fixtures, update the relevant tests or documentation alongside the code.
+
+### Keep GitBook navigation in sync
+
+When you add, remove, or rename pages under `docs\`, update both `docs\README.md` and `docs\SUMMARY.md` in the same change so the documentation map stays accurate.
+
+## Review checklist
+
+Before opening a pull request, confirm the following:
+
+- The change keeps the repository aligned to BitNet b1.58 and the current .NET application surface.
+- The change does not add domain-specific defaults to the core runtime or benchmark story.
+- New or updated documentation uses American English and Windows-first instructions when concrete shell examples are needed.
+- Documentation navigation files were updated if the contents of `docs\` changed.
+- The repository still builds and tests cleanly with the standard solution commands.
@@ -0,0 +1,45 @@
+planning:
+  high-priority:
+  - id: PLAN-ROADMAP-001
+    title: Execute post-alignment BitNet implementation roadmap
+    note: 'Recommended order: complete tasks 1 through 6 first to clear the Phase 3 bottleneck, then implement export and interoperability, then finish chain-bucket production work and CI/test-lane separation.'
+    done: true
+    completed: 2026-03-23
+    description:
+    - Carry the repository from architecture scaffolding into a production-capable training, evaluation, serialization, and speculative decoding workflow.
+    - 'The immediate bottleneck is Phase 3: a real training core and token-sequence data pipeline.'
+    - This item captures the concrete next work after the completed repo-alignment/docs phase and the recent teacher-forced continuation training fix.
+    done-summary: Completed all roadmap tasks. The final slice added a shared paper-model snapshot, repo-authored GGUF save/load, .gguf model loading, and a minimal export command, with targeted GGUF regressions plus a full dual-target test pass.
+    remaining: All roadmap tasks are complete. The latest completion added shared snapshot-backed JSON checkpoint and GGUF state capture, repo-authored .gguf import/export, app-level .gguf loading, and verified round-trip coverage.
+    technical-details:
+    - Phase 1 repo-alignment and documentation work is complete.
+    - BitLinear and the transformer skeleton are mostly present already.
+    - The training core now reaches beyond output-head-only updates by applying AdamW updates to the paper model's final RMSNorm scale alongside the output head.
+    - Repo-authored GGUF export/import now complements the repo-local JSON checkpoint through a shared paper-model snapshot layer and strict tensor/metadata validation.
+    - Default validation now stays fast while expensive training and benchmark checks run in a dedicated SlowLane category and benchmark-report CI lane.
+    technical-requirements:
+    - Preserve the teacher-forced rolling-context training fix already present in BitNetPaperModel.
+    - Avoid direct edits to docs/todo.yaml; keep the roadmap synced through the MCP todo API.
+    - Keep the first loader/token pipeline compatible with the existing BitNetTokenizer and vocabulary semantics.
+    - Prefer new Training/* files and narrow adapters over broad rewrites of dirty worktree files.
+    implementation-tasks:
+    - task: Create src/BitNetSharp.Core/Training/ with BitNetTrainingOptions, TrainingBatch, CrossEntropyLoss, AdamWOptimizer, and a trainer that owns the loop instead of BitNetPaperModel.
+      done: true
+    - task: Implement a real BitNetDataLoader for packed fixed-length token batches and held-out splits, and cover it with loader tests under tests/BitNetSharp.Tests/.
+      done: true
+    - task: Extend paper-model training beyond output-head-only updates to STE/AdamW-style updates across deeper transformer parameters.
+      done: true
+    - task: Add a small-corpus path first in scripts/ plus repo-local fixtures and tests, before attempting larger SlimPajama or RedPajama-scale ingestion.
+      done: true
+    - task: Extend TrainingReport to include validation metrics, checkpoint cadence, and evaluation summaries rather than loss history alone.
+      done: true
+    - task: Wire periodic evaluation through BitNetBenchmarkFixtures for WikiText2, C4, and RedPajama fixture slices during training.
+      done: true
+    - task: Add a dedicated training CLI surface in src/BitNetSharp.App/Program.cs so training configuration and execution are first-class.
+      done: true
+    - task: Implement model export and import beyond the repo-local JSON checkpoint, with GGUF or an intermediate bridge format as the next target.
+      done: true
+    - task: Finish chain-bucket productionization with chain-buckets.bin persistence, threshold-based acceptance metrics, and benchmark reporting for acceptance rate and tokens per second.
+      done: true
+    - task: Split expensive training and benchmark validations into a separate category or CI lane so the default suite stays fast.
+      done: true