b7508
server: prevent data race from HTTP threads (#18263)
server: prevent data race from HTTP threads
fix params
fix default_generation_settings
nits: make handle_completions_impl looks less strange
stricter const
fix GGML_ASSERT(idx < states.size())
move index to be managed by server_response_reader
http: make sure req & res lifecycle are tied together
fix compile
fix index handling buggy
fix data race for lora endpoint
nits: fix shadow variable
nits: revert redundant changes
nits: correct naming for json_webui_settings
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: