Back to feed

b9006

May 2, 2026
Meta/llama.cppCLIvb9006

opencl: Adreno optimization for MoE - MxFP4 (#22301)

  • MoE Mxfp4 CLC kernel added, router reorder on GPU

  • Pass test-backend-ops for MoE mxfp4 Adreno CLC

  • remove putenv in llama-model.cpp

  • fix indent style and whitespace

  • opencl: remove unnecessary headers

  • opencl: do not save cl_program objects

  • opencl: remove unnecessary assert

  • fix precision issue


Co-authored-by: Li He lih@qti.qualcomm.com

macOS/iOS:

Linux:

Android:

Windows:

openEuler: