Back to feed

b8267

Mar 11, 2026
Meta/llama.cppCLIvb8267

metal: handle command buffer failures gracefully in synchronize (#20306)

Replace GGML_ABORT("fatal error") in ggml_metal_synchronize() with error flag + return. This aligns synchronize error handling with graph_compute, which already returns GGML_STATUS_FAILED for the same condition.

When a command buffer fails (e.g., iOS GPU access revocation during backgrounding, macOS eGPU disconnect, OOM), the backend enters an error state instead of killing the host process. Subsequent graph_compute calls return GGML_STATUS_FAILED immediately. Recovery requires recreating the backend.

Failed extra command buffers are properly released on the error path to avoid Metal object leaks.

macOS/iOS:

Linux:

Windows:

openEuler: