Back to feed

b8459

Mar 21, 2026
Meta/llama.cppCLIvb8459

ggml-cpu: add always_inline to tinyBLAS_PPC accumulator saves (#20791)

Explicitly mark save_acc and add_save_Acc with always_inline in tinyBLAS_PPC. This ensures the compiler keeps MMA accumulator disassembly within kernel's register context, preventing un-necessary stask spills.

Signed-off-by: Shalini Salomi Bodapati Shalini.Salomi.Bodapati@ibm.com

macOS/iOS:

Linux:

Windows:

openEuler: