Replies: 1 comment 1 reply
-
|
bitnet.cpp is based on the llama.cpp framework and is optimized specifically for text-only inference with specialized kernels for 1-bit matrix operations |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
The readme includes these build instructions:
BitNet/README.md
Lines 193 to 199 in 404980e
Does that mean that we have to recompile BitNet for each model?
Is there no way to get a single binary that can run all 1.58 bit models?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions