Back to feed

b8030

Feb 14, 2026
Meta/llama.cppCLIvb8030

CUDA: Do not mutate cgraph for fused ADDs (#19566)

  • Do not mutate cgraph for fused ADDs
  1. We should try to minimize in-place changes to the incoming ggml_cgraph where possible (those should happen in graph_optimize)
  2. Modifying in-place leads to an additional, unnecessary graph capture step as we store the properties before modifying the graph in-place in the cuda-backend
  • Assert ggml_tensor is trivially copyable

  • Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Aman Gupta amangupta052@gmail.com


Co-authored-by: Aman Gupta amangupta052@gmail.com

macOS/iOS:

Linux:

Windows:

openEuler: