Back to feed

b7432

Dec 16, 2025
Meta/llama.cppCLIvb7432

[!WARNING] Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

Optimization: Qwen3 next autoregressive pass (#17996)

  • It's Qwen3 Next, the lean mean token generation machine!

  • Apply patches from thread

  • Remove recurrent version, only keep chunked and autoregressive

  • Remove unnecessary conts and asserts

  • Remove more extra conts and asserts

  • Cleanup masking

macOS/iOS:

Linux:

Windows:

openEuler: