Georgi Gerganov 62b5ff875c stream : add "max_tokens" parameter
Used to limit the number of tokens in a segment.
Useful to battle with word repetition when using partial encoder context
2022-11-20 21:22:41 +02:00
..
2022-11-04 22:26:08 +02:00
2022-10-24 18:23:07 +03:00