b7760
ggml : extend ggml_pool_1d + metal (#16429)
chore: resolve conflicts
feat: ggml metal impl
fix: ggml_metal_kargs_pool_1d struct
fix: require contiguous input
chore: test pool_1d
chore: limit pool1d test cases to p0=0 and s0=k0 to conform with asserts
chore: add p0 and s0 to testing
fix: allow padding for cpu and metal
Update ggml/src/ggml-metal/ggml-metal.metal
fix: correct single-threaded loop
ggml : cleanup
tests : add ne[1] != 1 tests
fix: ne[1] handling in np
cont : fixes
Co-authored-by: Georgi Gerganov ggerganov@gmail.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: