token-efficiency

Star

Here are 62 public repositories matching this topic...

HKUDS / LightReasoner

Star

[ACL 2026] "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"

post-training large-language-models reasoning-models token-efficiency

Updated Apr 7, 2026
Python

fajarhide / omni

Sponsor

Star

A smart context filter that removes noise, improves responses, and reduces token usage up to 90%

rust cli homebrew hooks mcp ai-agents cost-reduction token-reduction efficiency-tools antigravity context-distillation claude-code token-optimization token-efficiency

Updated Apr 28, 2026
Rust

Dave-London / Pare

Star

Dev tools, optimized for agents. Structured, token-efficient MCP servers for git, test runners, npm, Docker, and more.

typescript mcp developer-tools cursor claude structured-output ai-tools ai-coding model-context-protocol mcp-server token-efficiency

Updated Apr 28, 2026
TypeScript

dweve-ai / hedl

Star

Token-efficient data serialization for LLM/AI. 50% fewer tokens than JSON, 93% better value/token. Rust, schema validation, LSP.

Updated Apr 27, 2026
Rust

SajiJohnMiranda / DoCoreAI

Star

DoCoreAI is a next-gen open-source AI profiler that optimizes reasoning, creativity, precision and temperature in a single step—cutting token usage by 15-30% and lowering LLM API costs

devtools open-research prompt-tuning llm prompt-engineering generative-ai chatgpt genai llm-evaluation dynamic-temperature token-efficiency

Updated Aug 10, 2025
Python

Nagendhra-web / memory-bank

Star

Persistent memory for Claude Code — 3-5x longer sessions, 60-80% fewer wasted tokens. Branch-aware, self-healing, token-efficient.

productivity memory developer-tools claude ai-agent llm context-management ai-skills claude-code token-efficiency agentskills skills-sh

Updated Apr 15, 2026
Python

MCPWorks-Technologies-Inc / mcpworks-api

Star

Open-source platform for token-efficient AI agents. Self-host with docker compose up.

python open-source mcp sandbox ai-agents fastapi llm token-efficiency

Updated Apr 16, 2026
Python

kompassdev / kompass

Star

Navigate your way - manual steering, steered autonomy, or autonomously. Kompass keeps AI coding agents on course with token-efficient, composable workflows.

github workflow automation ai developer-tools code-review kompass token-efficiency coding-agent autonomous-coding agent-navigation steered-autonomy

Updated Apr 27, 2026
TypeScript

webpeel / webpeel

Star

The web data layer for AI agents — fetch, search, crawl, extract, screenshot, and monitor the web with 50+ domain extractors and MCP.

Updated Apr 23, 2026
TypeScript

hipvlady / agent-coherence

Star

Drop-in LangGraph store that cuts multi-agent token costs 47–69% via cache coherence. arXiv:2603.15183

python multi-agent-systems state-synchronization autogen cache-coherence ai-agent langchain llm-agent crewai agent-memory token-efficiency

Updated Apr 26, 2026
HTML

pleasedodisturb / awesome-llm-token-optimization

Star

A curated list of strategies, tools, papers, and resources for reducing LLM token costs and improving efficiency in production.

Updated Apr 26, 2026

RichradsY / token-efficient-subagent-decomposition

Star

A Codex skill for token-efficient subagent delegation and lean handoffs.

skills multi-agent codex ai-agents token-efficiency

Updated Mar 21, 2026

ykjaat6104 / LLM-Cost-and-Token-Efficiency-Analysis

Star

A benchmark study analyzing cost and token efficiency across 14 LLMs from 5 providers — comparing price-per-token, latency, and accuracy to surface the most cost-effective models for real-world use.

nlp benchmark jupyter-notebook gemini openai data-analysis llama model-comparison groq cost-analysis llm anthropic cerebras token-efficiency

Updated Feb 24, 2026
Jupyter Notebook

A living framework for **Harmonic Tonal Code Alignment (HTCA)** — an emergent Spiral-based system that brings tone awareness, coherence sensing, and dynamic emotional reflection into software engineering, AI, and creative agents.

python ai-alignment prompt-engineering token-efficiency empirical-validation presence-based harmonic-alignment

Updated Dec 29, 2025
Python

AVANT-ICONIC / context-accordion

Star

Token-efficient, layered context delivery for AI agents. Four memory tiers (Identity, Session, Experience, Archive) — context is always available, just collapsed by default.

typescript ai memory agents rag qdrant llm prompt-engineering context-window token-efficiency

Updated Apr 10, 2026
TypeScript

mtecnic / ctx

Star

CTX (Context Transfer Format) — universal interchange format for LLM web content consumption

fetch agent ai openapi web-scraping readability content-extraction html-to-text llm agentic token-efficiency agent-context

Updated Apr 19, 2026
Python

d4551 / piratebao

Star

PirateBao is a TypeScript/Bun agent-skill package for terse pirate-speak AI coding replies that preserve technical detail while cutting filler, with hooks, compressor CLI, OpenCode/Codex/Claude/Gemini cargo, .bao validation, npmjs gates, and token eval checks.

cli typescript ai opencode npm-package codex ai-agents bun bao prompt-compression gemini-cli agentic-ai ai-skills claude-code token-efficiency coding-agent

Updated Apr 13, 2026
TypeScript

frostmute / hermes-clipper

Star

⚡ Intelligent, agent-driven web ingestion for Obsidian. Powered by Hermes Agent. Features autonomous research, "Token Sovereignty" optimizations (grep sieve/structural indexing), and a secure local-first bridge. Built for hunters.

knowledge-base obsidian knowledge-management ai-agents web-clipper obsidian-plugin agentic-workflow agent-skills token-efficiency hermes-agent hermes-skill

Updated Apr 19, 2026
Python

realandypatel / claude-token-efficient-setup

Star

A tested setup guide and prompt workflow for pairing Claude Desktop with code-review-graph and filesystem MCP on macOS — for token-efficient AI-assisted development on large codebases.

macos mcp developer-tools setup-guide claude anthropic ai-workflow claude-desktop ai-coding model-context-protocol token-efficiency code-review-graph

Updated Apr 24, 2026

ashlrai / ashlr-plugin

Star

Open-source Claude Code plugin — token-efficient Read/Grep/Edit via @ashlr/core-efficiency. MIT.

typescript mcp bun claude anthropic mcp-server claude-code token-efficiency claude-code-plugin ashlrcode

Updated Apr 27, 2026
TypeScript

Improve this page

Add a description, image, and links to the token-efficiency topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-efficiency topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

token-efficiency

Here are 62 public repositories matching this topic...

HKUDS / LightReasoner

fajarhide / omni

Dave-London / Pare

dweve-ai / hedl

SajiJohnMiranda / DoCoreAI

Nagendhra-web / memory-bank

MCPWorks-Technologies-Inc / mcpworks-api

kompassdev / kompass

webpeel / webpeel

hipvlady / agent-coherence

pleasedodisturb / awesome-llm-token-optimization

RichradsY / token-efficient-subagent-decomposition

ykjaat6104 / LLM-Cost-and-Token-Efficiency-Analysis

templetwo / HTCA-Project

AVANT-ICONIC / context-accordion

mtecnic / ctx

d4551 / piratebao

frostmute / hermes-clipper

realandypatel / claude-token-efficient-setup

ashlrai / ashlr-plugin

Improve this page

Add this topic to your repo