ggml-cpu: add 128-bit RVV implementation for Quantization Vector Dot by taimur-10x · Pull Request #9 · riseproject-dev/llama.cpp

taimur-10x · 2026-02-13T15:47:57Z

Summary

This PR adds RVV 128-bit implementations for quantized vector dot kernels.

Key Changes

Added the following RVV kernels:

Kernel	VLEN
ggml_vec_dot_iq1_s_q8_K	128
ggml_vec_dot_iq1_m_q8_K	128
ggml_vec_dot_iq2_xs_q8_K	128
ggml_vec_dot_iq3_s_q8_K	128
ggml_vec_dot_iq3_xxs_q8_K	128
ggml_vec_dot_iq4_xs_q8_K	128
ggml_vec_dot_tq1_0_q8_K	128
ggml_vec_dot_tq2_0_q8_K	128

Testing

Kernels were functionally tested through test-quantize-fns for 128-bit on QEMU.

Future Work

Subsequent PRs plan to extend existing RVV kernels for quantization types to higher VLENs (512-bit and 1024-bit).

rehan-10xengineer · 2026-03-16T10:58:34Z

opened pr upstreamhere

Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>

taimur-10x marked this pull request as draft February 13, 2026 15:48

github-actions bot added the ggml label Feb 13, 2026

taimur-10x force-pushed the 10x/riscv-quant-vec-dot-128b branch from 51b400b to 8bd5cbe Compare February 14, 2026 16:51

rehan-10xengineer force-pushed the 10x/riscv-quant-vec-dot-128b branch from 21b6845 to a22b149 Compare February 24, 2026 12:15

taimur-10x changed the base branch from master to 10x/riscv-quant March 4, 2026 11:26

taimur-10x force-pushed the 10x/riscv-quant-vec-dot-128b branch 2 times, most recently from f8f9384 to 2785c94 Compare March 4, 2026 11:41

taimur-10x marked this pull request as ready for review March 4, 2026 11:47

taimur-10x requested a review from david-baker-808 March 10, 2026 00:26

taimur-10x assigned taimur-10x and rehan-10xengineer Mar 10, 2026

rehan-10xengineer force-pushed the 10x/riscv-quant branch 3 times, most recently from 9ca80fc to 68e3cee Compare March 13, 2026 15:04

rehan-10xengineer force-pushed the 10x/riscv-quant-vec-dot-128b branch from 2785c94 to f83ddf7 Compare March 16, 2026 10:50

github-actions bot added documentation Improvements or additions to documentation testing Nvidia GPU Apple Metal SYCL Vulkan examples devops python script server model OpenCL labels Mar 16, 2026

rehan-10xengineer force-pushed the 10x/riscv-quant-vec-dot-128b branch from f83ddf7 to c7c6abc Compare March 16, 2026 10:55

rehan-10xengineer changed the base branch from 10x/riscv-quant to master March 16, 2026 11:17

taimur-10x force-pushed the 10x/riscv-quant-vec-dot-128b branch from c7c6abc to d618925 Compare March 16, 2026 12:15

taimur-10x removed documentation Improvements or additions to documentation testing Nvidia GPU Apple Metal SYCL Vulkan examples devops python script server model OpenCL labels Mar 16, 2026

taimur-10x and others added 2 commits March 18, 2026 16:59

ggml-cpu: add 128-bit impls for i-quants, ternary quants

2fe760f

ggml-cpu: add 128-bit impls for iq2_xs, iq3_s, iq3_xxs, tq2_0

4b12d40

Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>

taimur-10x force-pushed the 10x/riscv-quant-vec-dot-128b branch from d618925 to cf95828 Compare March 18, 2026 12:19

ggml-cpu: refactor; add rvv checks

05a5425

taimur-10x force-pushed the 10x/riscv-quant-vec-dot-128b branch from cf95828 to 05a5425 Compare March 18, 2026 12:47

taimur-10x merged commit 92dc6b1 into master Mar 18, 2026
32 of 51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml-cpu: add 128-bit RVV implementation for Quantization Vector Dot#9

ggml-cpu: add 128-bit RVV implementation for Quantization Vector Dot#9
taimur-10x merged 3 commits intomasterfrom
10x/riscv-quant-vec-dot-128b

taimur-10x commented Feb 13, 2026 •

edited

Loading

Uh oh!

rehan-10xengineer commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

taimur-10x commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Changes

Testing

Future Work

Uh oh!

rehan-10xengineer commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

taimur-10x commented Feb 13, 2026 •

edited

Loading