b8477
mtmd: Add dynamic high-resolution image preprocessing for InternVL model (#20847)
added support for internvl's dynamic high-resolution (Qianfan-OCR needed)
add min/max dynamic patch to gguf meta
clean up
simplified handling min/max dynamic patch
reuse llava_uhd logic for slice images
provide default values for older models
flake8
prevent writing 0 value to gguf
remove duplicated resolution candidates with a better algorithm
fix indentation
format
add protection from divide by zero
change to 0 to be safe
Co-authored-by: Xuan Son Nguyen son@huggingface.co
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: