Back to feed

b7932

Feb 4, 2026
Meta/llama.cppCLIvb7932

completion : simplify batch (embd) processing (#19286)

  • completion : simplify batch (embd) processing

This commit simplifies the processing of embd by removing the for loop that currently exists which uses params.n_batch as its increment. This commit also removes the clamping of n_eval as the size of embd is always at most the size of params.n_batch.

The motivation is to clarify the code as it is currently a little confusing when looking at this for loop in isolation and thinking that it can process multiple batches.

  • add an assert to verify n_eval is not greater than n_batch

macOS/iOS:

Linux:

Windows:

openEuler: