Back to feed

b7582

Dec 30, 2025
Meta/llama.cppCLIvb7582

sampling: reuse token data buffer in llama_sampler_sample (#18365)

  • sampling: reuse token data buffer in llama_sampler_sample

  • move cur buffer before timing section, after samplers

  • minor : fix build


Co-authored-by: Georgi Gerganov ggerganov@gmail.com

macOS/iOS:

Linux:

Windows:

openEuler: