b7592
metal : add count_equal op (#18314)
add count equal for metal
remove trailing whitespace
updated doc ops table
changed shmem to i32
added multi tg and templating
removed BLAS support from Metal docs
Apply suggestions from code review
Co-authored-by: Georgi Gerganov ggerganov@gmail.com
add memset to set dst to 0
metal : cleanup
Co-authored-by: Georgi Gerganov ggerganov@gmail.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: