whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2024-12-30 01:08:52 +00:00

Author	SHA1	Message	Date
Georgi Gerganov	fcf515de60	bench.wasm : same as "bench" but runs in the browser (#89 )	2022-12-11 11:09:10 +02:00
Georgi Gerganov	3b1aacbe6d	talk : talk with AI in the terminal	2022-12-10 16:51:58 +02:00
Georgi Gerganov	3996ecc156	Update README.md	2022-12-07 05:15:46 +02:00
Georgi Gerganov	9fe7306f4b	models : add the new "large" model release by OpenAI The old "large" model is now renamed "large-v1". If you have been using it, make sure to rename it and download the new "large" model for best results.	2022-12-06 18:48:57 +02:00
Georgi Gerganov	6fd5358dd0	Update README.md	2022-11-27 11:30:32 +02:00
Georgi Gerganov	67e819baf4	minor : remove "examples/" prefix from the README	2022-11-26 13:07:54 +02:00
Georgi Gerganov	a425365b82	yt-wsp.sh : script to easily transcribe VODs Thanks to @DaniruKun ref: https://gist.github.com/DaniruKun/96f763ec1a037cc92fe1a059b643b818 Usage: cd whisper.cpp make ./examples/yt-wsp.sh <video-url>	2022-11-26 12:54:42 +02:00
Georgi Gerganov	e0e864d9ca	Update README.md	2022-11-26 11:56:55 +02:00
Georgi Gerganov	68ecadbbc9	command.wasm : add voice assistant example for the Web (#171 ) Same as the command-line tool "command", but runs in the browser Also, added helper script "extra/deploy-wasm.sh" and fixed some timing constants for the WASM examples.	2022-11-26 11:40:06 +02:00
Georgi Gerganov	1246dd023e	command : add demonstration video	2022-11-25 20:23:58 +02:00
Georgi Gerganov	bc88eb13c6	examples : add "command" tool (#171 )	2022-11-25 19:36:57 +02:00
Georgi Gerganov	b8ce25dec1	refactoring : more readable code	2022-11-25 19:28:04 +02:00
Georgi Gerganov	2c0501b38a	Update README.md	2022-11-24 20:06:51 +02:00
Georgi Gerganov	35cd29ce1f	ggml : fix cross-compile Linux -> Window with mingw (#168 )	2022-11-23 22:28:41 +02:00
Georgi Gerganov	a156a358ca	Revert "update README.md" This reverts commit `6a84147113`.	2022-11-23 22:16:50 +02:00
katsu560	6a84147113	update README.md	2022-11-23 22:16:33 +02:00
Georgi Gerganov	363a2dadec	Update README.md	2022-11-23 09:53:55 +02:00
Georgi Gerganov	623a486056	Update README.md	2022-11-23 09:52:36 +02:00
Georgi Gerganov	2e311a2917	Update README.md	2022-11-21 18:52:20 +02:00
Georgi Gerganov	864a78a8d0	models : change default hosting to Hugging Face My Linode is running out of monthly bandwidth due to the big interest in the project	2022-11-15 19:47:06 +02:00
Georgi Gerganov	8fdfb0ba92	Update README.md	2022-11-06 21:04:21 +02:00
Georgi Gerganov	a09e9123ca	Update README.md	2022-11-05 08:44:41 +02:00
Georgi Gerganov	0e689f83d8	Update README.md	2022-11-02 22:03:27 +02:00
Georgi Gerganov	d5afebd37c	whisper : token-level timestamp refactoring (#49 , #120 ) This turned out pretty good overall. The algorithm has been moved from main.cpp to whisper.cpp and can be reused for all subtitles types. This means that now you can specify the maximum length of the generated lines. Simply provide the "-ml" argument specifying the max length in number of characters	2022-11-02 21:45:54 +02:00
Georgi Gerganov	4b1c32e8ea	Update README.md	2022-11-02 18:33:29 +02:00
Georgi Gerganov	b5dde365e9	extra : compute SHA of all models files	2022-11-02 18:31:55 +02:00
Georgi Gerganov	e46bc56e71	Update README.md	2022-11-01 22:47:58 +02:00
Georgi Gerganov	b0f2aa0ea6	Update README.md	2022-10-30 17:10:46 +02:00
Georgi Gerganov	2c281d190b	Update README.md	2022-10-28 22:09:40 +03:00
Georgi Gerganov	9ccafa8792	Update README.md	2022-10-25 20:53:48 +03:00
Georgi Gerganov	89d8ee3ee5	Update README.md	2022-10-25 20:53:48 +03:00
Georgi Gerganov	c6710efde2	refactoring : move main + stream in examples + other stuff	2022-10-25 20:53:48 +03:00
Georgi Gerganov	728676927f	Update README.md	2022-10-24 18:26:21 +03:00
Georgi Gerganov	181b762de8	Update README.md	2022-10-23 12:47:51 +03:00
Georgi Gerganov	4196856c7b	Update README.md	2022-10-23 10:24:36 +03:00
Georgi Gerganov	705198f063	Update README.md	2022-10-23 10:12:10 +03:00
Georgi Gerganov	3e69a6071d	Update README.md	2022-10-23 08:04:33 +03:00
Georgi Gerganov	f3dae90c31	Update README.md	2022-10-22 21:17:21 +03:00
Georgi Gerganov	8c1d970088	Update README.md	2022-10-22 19:00:25 +03:00
Georgi Gerganov	6b45e37b2b	Update README.md and finalize the whisper.wasm example	2022-10-22 18:54:01 +03:00
Georgi Gerganov	5698b51718	Update README.md	2022-10-20 17:52:59 +03:00
Georgi Gerganov	3fe3898ebb	Update README.md	2022-10-20 17:43:56 +03:00
Georgi Gerganov	81c185576c	Update README.md	2022-10-20 17:39:31 +03:00
Georgi Gerganov	1969ee4bc7	Update README.md	2022-10-18 22:20:35 +03:00
Georgi Gerganov	72d967bce4	Use Accelerate framework on Apple silicon Huge performance improvement in the Encode (almost x2 on MacBook M1 Pro) Also various extra optimizations: - Multi-threaded NORM operator - Faster GELU via F16 cast	2022-10-18 00:12:51 +03:00
Georgi Gerganov	36945162fa	Update README.md (ref #50 )	2022-10-15 09:40:08 +03:00
Georgi Gerganov	b2f1600aa3	Update README.md	2022-10-12 21:25:42 +03:00
Topping1	1348796a93	Update README.md (#43 ) * Update README.md Updated README.md to list new features, such as subtitle file support (VTT and SRT) * Update README.md Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2022-10-12 07:32:14 +03:00
Georgi Gerganov	8d94358251	Update README.md	2022-10-11 00:36:32 +03:00
Georgi Gerganov	ad6693fb64	Update README.md	2022-10-10 22:16:25 +03:00
Georgi Gerganov	63b6786767	Minor	2022-10-10 22:06:27 +03:00
Georgi Gerganov	f7ab81fe51	Update README.md	2022-10-10 22:05:37 +03:00
Georgi Gerganov	4c4ab71d4d	Update README.md	2022-10-08 11:46:34 +03:00
Georgi Gerganov	2d47693435	Update README.md	2022-10-08 11:43:42 +03:00
Georgi Gerganov	700898e6ed	ref #22 : add option to provide multiple input .wav files	2022-10-05 23:44:10 +03:00
Georgi Gerganov	6b1c3cc198	Update README.md	2022-10-05 23:13:15 +03:00
Georgi Gerganov	b8f713482e	Minor updates	2022-10-05 23:11:02 +03:00
Georgi Gerganov	e7a15876f8	Update README.md	2022-10-04 23:27:25 +03:00
Georgi Gerganov	d71e567656	Update README.md	2022-10-02 18:19:22 +03:00
Georgi Gerganov	62897e8ae6	Update README.md	2022-10-01 00:01:04 +03:00
Georgi Gerganov	3bcdbdfc32	Reduce memory usage even more + better sampling - The encode/decode memory buffers are now reused - If the 30-sec segment goes for too long without a timestamp token, we force one. Improves transcription for large model - Stereo support - Add "micro-machines.wav" sample	2022-09-30 19:35:27 +03:00
Georgi Gerganov	310f4883d1	Update README.md	2022-09-29 23:48:01 +03:00
Georgi Gerganov	fd3f3d748f	Update README.md	2022-09-29 23:37:59 +03:00
Georgi Gerganov	5877c3578e	ref #4 : added transcription timestamps Can be turned off with "-nt" argument. Performance has also improved.	2022-09-29 23:09:39 +03:00
Georgi Gerganov	4352a6018b	Update README.md	2022-09-28 21:13:32 +03:00
Georgi Gerganov	f888c2373d	Flash + language support (ref #2 ) - Achieved big performance improvement + memory usage reduction - Can now translate / transcribe different languages	2022-09-28 21:07:32 +03:00
Georgi Gerganov	476182e439	Update README.md and simplify usage	2022-09-26 09:36:51 +03:00
Georgi Gerganov	f2456f8d93	Create README.md	2022-09-25 22:59:04 +03:00

1 2 3 4

168 Commits