PrismML-Eng / llama.cpp Public

forked from ggml-org/llama.cpp

Notifications You must be signed in to change notification settings
Fork 23
Star 70

Code
Pull requests 8
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: PrismML-Eng/llama.cpp

Labels 15 Milestones 0

New pull request New

8 Open 2 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

(Performance) Optimized x86 and generic q1_0(_g128) dot ggml

#10 opened Apr 3, 2026 by pl752

Loading…

vulkan: add Q1_0_g128 (1-bit ternary) shader support ggml testing Vulkan

#9 opened Apr 3, 2026 by claudlos

Loading…

fix: Q1_0_g128 x86 CPU kernel — float truncation + AVX2 vectorization ggml

#7 opened Apr 2, 2026 by wildcattrio

Loading…

4 tasks done

fix: Q1_0_g128 x86 CPU kernel - correct output + AVX2/AVX-512 VNNI ggml

#6 opened Apr 2, 2026 by stfurkan

Loading…

Fixes for CPU backend + instructions for targetting AMD GPUs ggml

#5 opened Apr 2, 2026 by philtomson

Loading…

fix: Q1_0_g128 CPU dot product int truncation ggml

#4 opened Apr 2, 2026 by Marxist-Leninist

Loading…

fix: Q1_0_g128 CPU kernel - correct output and AVX-512 SIMD ggml

#3 opened Apr 1, 2026 by jordankzf

Loading…

feat: port TQ3_0 KV cache from llama-turboquant examples ggml Nvidia GPU

#2 opened Apr 1, 2026 by carlosfundora

Loading…

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!