Back to feed

b9148

May 14, 2026
Meta/llama.cppCLIvb9148

unicode,test: add Qwen3.5 non-backtracking tokenizer handler and regr… (#22110)

  • unicode,test: add Qwen3.5 non-backtracking tokenizer handler and regression tests

This mirrors the Qwen2 fix (commit 0d049d6), but adapts for Qwen3.5's regex. Ensures robust Unicode tokenization and prevents std::regex stack overflows.

Closes #21919.

  • fix: enhance regex handling for Qwen3.5 tokenizer to include accent marks

  • cont : remove trailing whitespace


Co-authored-by: Kabir kabir@example.com Co-authored-by: Alde Rojas hello@alde.dev

macOS/iOS:

Linux:

Android:

Windows:

openEuler: