ggml-cpu: extend RVV quantization vec dot to higher VLENs by taimur-10x · Pull Request #10 · riseproject-dev/llama.cpp

taimur-10x · 2026-02-13T15:52:41Z

Summary

This PR adds RVV implementations for quantized vector dot kernels (for VLENs 512-bit and 1024-bit).

Key Changes

Added the following RVV kernels:

TODO

Testing

Kernels were functionally tested through test-quantize-fns for VLENs 512-bit and above on QEMU.

Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>

…iq2_xxs

taimur-10x marked this pull request as draft February 13, 2026 15:52

github-actions bot added the ggml label Feb 13, 2026

taimur-10x and others added 8 commits March 4, 2026 15:57

ggml-cpu: add rvv quantize_row_q8_K kernel

1440762

Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>

ggml-cpu: add rvv vec_dot for iq4_nl, mxfp4, iq2_xxs

481cb2e

Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>

ggml-cpu: add rvv vec_dot for iq4_xs, refactor

8cc2254

ggml-cpu: remove ifunc for rvv vec dot

3d682db

ggml-cpu: add vec_dot for iq2_xs, iq3_xxs

6769950

Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>

ggml-cpu: refactor quants.c

08cfbe0

ggml-cpu: add 128-bit impls for i-quants, ternary quants

97d1218

ggml-cpu: add 128-bit impls for iq2_xs, iq3_s, iq3_xxs, tq2_0

2785c94

Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>

taimur-10x changed the base branch from master to 10x/riscv-quant-vec-dot-128b March 4, 2026 11:48

taimur-10x and others added 4 commits March 4, 2026 21:59

ggml-cpu: add rvv 512b,1024b impls for iq4_xs

adb8b4e

ggml-cpu: refactor; add rvv 512b, 1024b impls for q6_K, i-quants

7f2bc2c

added 512 and 1024 implementations of tq3_s, iq3_xxs, iq2_s, iq2_xs, …

3326b3e

…iq2_xxs

ggml-cpu: improve iq2_xs impl for rvv 256b

bd69a20

taimur-10x force-pushed the 10x/riscv-quant-vec-dot-vlens branch from 86ffc7e to bd69a20 Compare March 4, 2026 17:10

taimur-10x marked this pull request as ready for review March 4, 2026 17:11

taimur-10x assigned taimur-10x and rehan-10xengineer Mar 10, 2026

rehan-10xengineer force-pushed the 10x/riscv-quant-vec-dot-128b branch 2 times, most recently from f83ddf7 to c7c6abc Compare March 16, 2026 10:55

taimur-10x force-pushed the 10x/riscv-quant-vec-dot-128b branch 3 times, most recently from cf95828 to 05a5425 Compare March 18, 2026 12:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml-cpu: extend RVV quantization vec dot to higher VLENs#10

ggml-cpu: extend RVV quantization vec dot to higher VLENs#10
taimur-10x wants to merge 12 commits into10x/riscv-quant-vec-dot-128bfrom
10x/riscv-quant-vec-dot-vlens

taimur-10x commented Feb 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

taimur-10x commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Changes

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

taimur-10x commented Feb 13, 2026 •

edited

Loading