@selfagency/llm-stream-parser

Composable parsers and stream processing utilities for LLM responses.

Features

🧠 Thinking extraction — Parse and separate <think> reasoning sections from visible output, chunk-by-chunk
🧼 XML stream filtering — Scrub context blocks and privacy tags from streaming output
🛠️ Tool-call extraction — Extract and validate structured XML tool invocations
🏛️ Structured output — JSON parsing with schema validation, depth/key limits, and auto-repair
🚰 Stream processor — Event-driven orchestrator that composes all parsers in a single pipeline
🔌 Normalizers — Adapters for OpenAI, Anthropic, Gemini, Mistral, Cohere, Ollama, AWS Bedrock, and HF TGI
👮‍♂️ Safety by default — Privacy tags are always scrubbed; JSON depth, key counts, and tool-call sizes are bounded

Installation

npm install @selfagency/llm-stream-parser
# or
pnpm add @selfagency/llm-stream-parser
# or
yarn add @selfagency/llm-stream-parser

Requirements: Node.js 18+, TypeScript 5.0+ (if using TypeScript)

Quick Start

import { LLMStreamProcessor } from '@selfagency/llm-stream-parser/processor';

const processor = new LLMStreamProcessor({
  parseThinkTags: true,
  knownTools: new Set(['search', 'edit_file']),
});

processor.on('thinking', delta => process.stdout.write(`[thinking] ${delta}`));
processor.on('text', delta => process.stdout.write(delta));
processor.on('tool_call', call => executeToolCall(call));

for await (const chunk of apiStream) {
  processor.process({ content: chunk.content, done: chunk.done });
}

Modules

`@selfagency/llm-stream-parser/thinking` — ThinkingParser

Chunk-by-chunk extraction of <think> blocks. Returns [thinkingContent, regularContent] on every call.

import { ThinkingParser } from '@selfagency/llm-stream-parser/thinking';

const parser = new ThinkingParser();

for await (const chunk of llmStream) {
  const [thinking, content] = parser.addContent(chunk);
  if (thinking) showReasoning(thinking);
  if (content) showOutput(content);
}

const [finalThinking, finalContent] = parser.flush();

Automatic tag detection for common models:

const parser = ThinkingParser.forModel('deepseek');   // <think></think>
const parser = ThinkingParser.forModel('granite');    // <|thinking|></|thinking|>

`@selfagency/llm-stream-parser/xml-filter` — XmlStreamFilter

Stream-safe scrubbing of XML context and privacy blocks.

import { createXmlStreamFilter } from '@selfagency/llm-stream-parser/xml-filter';

const filter = createXmlStreamFilter({ enforcePrivacyTags: true });

for await (const chunk of llmStream) {
  output.write(filter.write(chunk));
}
output.write(filter.end());

Privacy tags are enforced by default (enforcePrivacyTags: true). Pass enforcePrivacyTags: false to opt out explicitly.

`@selfagency/llm-stream-parser/context` — Context splitting & dedup

import {
  splitLeadingXmlContextBlocks,
  dedupeXmlContextBlocksByTag,
  stripXmlContextTags,
} from '@selfagency/llm-stream-parser/context';

const { contextBlocks, remaining } = splitLeadingXmlContextBlocks(response);
const unique = dedupeXmlContextBlocksByTag(contextBlocks);
const clean = stripXmlContextTags(remaining);

`@selfagency/llm-stream-parser/tool-calls` — XML tool-call extraction

import { extractXmlToolCalls, buildXmlToolSystemPrompt } from '@selfagency/llm-stream-parser/tool-calls';

// Extract tool calls from a response
const calls = extractXmlToolCalls(response, new Set(['search', 'edit_file']));
for (const call of calls) {
  await executeTool(call.name, call.parameters);
}

// Build the system prompt that teaches the model to emit tool calls
const systemPrompt = buildXmlToolSystemPrompt([
  { name: 'search', description: 'Search the web', inputSchema: { properties: { query: { type: 'string' } }, required: ['query'] } },
  { name: 'edit_file', description: 'Edit a file' },
]);

buildXmlToolSystemPrompt throws on invalid tool names; extractXmlToolCalls never throws and silently drops malformed calls.

`@selfagency/llm-stream-parser/structured` — JSON parsing & validation

import { parseJson, validateJsonSchema } from '@selfagency/llm-stream-parser/structured';

// Tolerant parse — returns null on failure, never throws
const data = parseJson(responseText, { maxJsonDepth: 10, maxJsonKeys: 100 });

// Schema validation — returns discriminated union
const result = validateJsonSchema(responseText, {
  type: 'object',
  properties: { name: { type: 'string' }, age: { type: 'integer' } },
  required: ['name'],
});

if (result.success) {
  console.log(result.data);
} else {
  console.error(result.errors);
}

Additional utilities: buildFormatInstructions, buildRepairPrompt, streamJson, zodToJsonSchema, validateWithZod, repairWithLLM, pipe.

`@selfagency/llm-stream-parser/normalizers` — Provider normalizers

Normalize streaming events from different providers into a common StreamChunk shape:

import { normalizeOpenAI } from '@selfagency/llm-stream-parser/normalizers';

for await (const event of openaiStream) {
  const { chunk } = normalizeOpenAI(event);
  if (chunk) processor.process(chunk);
}

Supported: openai, openaiResponses, anthropic, gemini, mistral, cohere, ollama, bedrock, hfTgi.

`@selfagency/llm-stream-parser/adapters` — High-level adapters

import { createGenericAdapter } from '@selfagency/llm-stream-parser/adapters';

const adapter = createGenericAdapter(
  {
    onContent: text => display(text),
    onThinking: text => displayReasoning(text),
    onToolCall: call => executeToolCall(call),
  },
  { parseThinkTags: true, scrubContextTags: true },
);

await adapter.write(chunk);
await adapter.end();

`@selfagency/llm-stream-parser/formatting` — Output sanitization

import { sanitizeNonStreamingModelOutput, formatXmlLikeResponseForDisplay } from '@selfagency/llm-stream-parser/formatting';

`@selfagency/llm-stream-parser/markdown` — Markdown utilities

import { appendToBlockquote } from '@selfagency/llm-stream-parser/markdown';

Error Handling

Category	Behaviour
Streaming / parsing (`parseJson`, `ThinkingParser`, `XmlStreamFilter`, `LLMStreamProcessor`)	Never throw. Return best-effort results; malformed input is silently skipped.
Configuration (`buildXmlToolSystemPrompt`)	Throw `Error` on invalid arguments (caught at setup time).
Validation (`validateJsonSchema`)	Return `{ success: true; data }` or `{ success: false; errors }` — never throw.

Development

pnpm install
task check-types     # TypeScript type check
task unit-tests      # Run Vitest suite
task lint            # oxlint
task format          # oxfmt
task compile         # tsup → dist/
task precommit       # check-types + lint-fix + format

Contributing

Fork and clone the repository
Create a branch: feat/your-feature, fix/your-fix, etc.
Make changes with colocated tests (module.test.ts next to source)
Run task precommit before pushing
Open a pull request

See docs/developers/contributing.md for full details.

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
.beans		.beans
.github		.github
.vitepress		.vitepress
.vscode		.vscode
docs		docs
scripts		scripts
src		src
.beans.yml		.beans.yml
.gitignore		.gitignore
.markdownlint.yaml		.markdownlint.yaml
.oxfmtrc.json		.oxfmtrc.json
.oxlintrc.json		.oxlintrc.json
CHANGELOG.md		CHANGELOG.md
LICENSE.md		LICENSE.md
README.md		README.md
Taskfile.yaml		Taskfile.yaml
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

@selfagency/llm-stream-parser

Features

Installation

Quick Start

Modules

`@selfagency/llm-stream-parser/thinking` — ThinkingParser

`@selfagency/llm-stream-parser/xml-filter` — XmlStreamFilter

`@selfagency/llm-stream-parser/context` — Context splitting & dedup

`@selfagency/llm-stream-parser/tool-calls` — XML tool-call extraction

`@selfagency/llm-stream-parser/structured` — JSON parsing & validation

`@selfagency/llm-stream-parser/normalizers` — Provider normalizers

`@selfagency/llm-stream-parser/adapters` — High-level adapters

`@selfagency/llm-stream-parser/formatting` — Output sanitization

`@selfagency/llm-stream-parser/markdown` — Markdown utilities

Error Handling

Development

Contributing

License

About

Uh oh!

Releases 6

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

@selfagency/llm-stream-parser

Features

Installation

Quick Start

Modules

@selfagency/llm-stream-parser/thinking — ThinkingParser

@selfagency/llm-stream-parser/xml-filter — XmlStreamFilter

@selfagency/llm-stream-parser/context — Context splitting & dedup

@selfagency/llm-stream-parser/tool-calls — XML tool-call extraction

@selfagency/llm-stream-parser/structured — JSON parsing & validation

@selfagency/llm-stream-parser/normalizers — Provider normalizers

@selfagency/llm-stream-parser/adapters — High-level adapters

@selfagency/llm-stream-parser/formatting — Output sanitization

@selfagency/llm-stream-parser/markdown — Markdown utilities

Error Handling

Development

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`@selfagency/llm-stream-parser/thinking` — ThinkingParser

`@selfagency/llm-stream-parser/xml-filter` — XmlStreamFilter

`@selfagency/llm-stream-parser/context` — Context splitting & dedup

`@selfagency/llm-stream-parser/tool-calls` — XML tool-call extraction

`@selfagency/llm-stream-parser/structured` — JSON parsing & validation

`@selfagency/llm-stream-parser/normalizers` — Provider normalizers

`@selfagency/llm-stream-parser/adapters` — High-level adapters

`@selfagency/llm-stream-parser/formatting` — Output sanitization

`@selfagency/llm-stream-parser/markdown` — Markdown utilities

Packages