Llama models apparently adding extra input tokens. #292

snova-rodrigom · 2025-03-17T21:04:21Z

Hi, I've tested multiple Llama models 3.1-3.3 from SambaNova, Fireworks and TogetherAI and have noticed that additional tokens are apparently being added per API model call. I'm using the right model tokenizer to cut and count the number of desire tokens to process as prompt tokens, however APIs always return ~30 extra tokens more, which is not expected. Since this is happening in all of them, is this a model related issue? is there a way to get the exact amount of prompt tokens expected?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama models apparently adding extra input tokens. #292

Llama models apparently adding extra input tokens. #292

snova-rodrigom commented Mar 17, 2025

Llama models apparently adding extra input tokens. #292

Llama models apparently adding extra input tokens. #292

Comments

snova-rodrigom commented Mar 17, 2025