mirror of
https://github.com/mudler/LocalAI.git
synced 2025-04-28 06:49:54 +00:00
This happens when no max_tokens are set, so by default go-llama allocates more space for the slice and padding happens.
This happens when no max_tokens are set, so by default go-llama allocates more space for the slice and padding happens.