From 81c185576c6aea21139c5ffeb69b46228fb80894 Mon Sep 17 00:00:00 2001 From: Georgi Gerganov Date: Thu, 20 Oct 2022 17:39:31 +0300 Subject: [PATCH] Update README.md --- README.md | 11 ++++++++++- 1 file changed, 10 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index d5940535..21fcc73c 100644 --- a/README.md +++ b/README.md @@ -252,9 +252,18 @@ the framwork utilizes the special-purpose AMX coprocessor available in modern Ap ## Limitations -- Very basic greedy sampling scheme - always pick up the top token. You can implement your own strategy - Inference only - No GPU support +- Very basic greedy sampling scheme - always pick up the token with highest probability. + This should be similar to the [GreedyDecoder](https://github.com/openai/whisper/blob/main/whisper/decoding.py#L249-L274) + from the original python implementation, so in order to make a fair comparison between the 2 implementations, make sure + to run the python code with the following parameters: + + ``` + whisper --best_of 1 --beam_size 1 ... + ``` + + In the future, `whisper.cpp` will support more sampling strategies. ## Memory usage