b8068
ggml: aarch64: Implement SVE in Gemm q4_k 8x8 q8_k Kernel (#19132)
Updated repack.cpp
Updated repack.cpp
Updated repack.cpp
Added if condition to support only vector length 256.
Changed the format removed comments and duplicate variable
If SVE 256 not present then was using generic function to compute, hence slowing the performance.
So added code if SVE 256 is not present then use NEON code.
- Code format change suggestion
Co-authored-by: Vithule, Prashant Prashant.Vithule@fujitsu.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: