Pinned Loading
-
llama.cpp-1-bit-turbo
llama.cpp-1-bit-turbo PublicForked from ggml-org/llama.cpp
HIP/ROCm fork optimized for AMD RDNA2 (gfx1030) with PrismML Q1_0_G128 1-bit quant support, RotorQuant, TurboQuant, EAGLE3 and P-EAGLE speculative decoding, and full Wave32 kernel optimizations.
C++ 11
-
sglang-1-bit-turbo
sglang-1-bit-turbo PublicForked from sgl-project/sglang
AMD ROCm (gfx1030) inference fork with RotorQuant/TurboQuant KV compression, PHANTOM-X zero-copy draft speculation, EAGLE3 speculative decoding, 12 RDNA2 crash fixes, and PrismML Bonsai Q1_0_G128 1…
Python 5
-
vllm-1-bit-turbo
vllm-1-bit-turbo PublicForked from vllm-project/vllm
HIP/ROCm fork optimized for AMD RDNA2 (gfx1030) with EAGLE3 speculative decoding, TurboQuant KV compression, PrismML Bonsai Q1_0_G128 1-bit GGUF support, and gfx1031 compatibility enablement.
Python 1
-
litellm-turbo
litellm-turbo PublicForked from BerriAI/litellm
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
Python
-
ATLAS
ATLAS PublicForked from itigges22/ATLAS
Adaptive Test-time Learning and Autonomous Specialization
Python 1
If the problem persists, check the GitHub status page or contact support.



