b8008

Feb 12, 2026

Meta/llama.cppCLIvb8008

hexagon: further optimization and tuning of matmul and dot kernels (#19407)

ggml-hexagon: implement 2x2 matmul kernel
hexmm: implement vec_dot_rx2x2 for Q8_0 and MXFP4
hexagon: fix editor config failures
hexagon: refactor matmul ops to use context struct and remove wrappers

Also implement vec_dot_f16 2x2

hexagon: refactor dyn quantizers to use mmctx
hexagon: remove mm fastdiv from op_ctx
hexagon: refactor matmul entry point to reduce code duplication

Co-authored-by: Trivikram Reddy tamarnat@qti.qualcomm.com

macOS/iOS:

Linux:

Windows:

openEuler:

← Back to feed