Commit Graph

70 Commits

Author SHA1 Message Date
8dac3c6e10 Fixed sched_yield 2022-10-30 21:38:18 +02:00
6417e59aad Implemenated sched_yield function for Windows 2022-10-30 21:38:18 +02:00
e5044f87d9 ggml : fix barrier 2022-10-29 19:37:19 +03:00
a272f10b2e ggml : fix thread-safety of ggml_init and ggml_free 2022-10-29 19:37:19 +03:00
fbd513b813 Add OpenBLAS support
Supported via CMake - just add:

cmake .. -DWHISPER_SUPPORT_OPENBLAS=ON

On Ubuntu, you have to install the library like this:

apt install libopenblas-dev

Unfortunately, I don't observe any benefit compared to the
original AVX2 + FP16 implementation. Maybe I'm missing something
2022-10-27 18:31:49 +03:00
34bb3ab0cf ggml : add system info functions 2022-10-25 20:53:48 +03:00
c6710efde2 refactoring : move main + stream in examples + other stuff 2022-10-25 20:53:48 +03:00
db460b78ff wip : WASM 128-bit SIMD support 2022-10-22 18:54:01 +03:00
e905c6f827 wip : initial WASM port
Works but it is very slow because no SIMD is used.
For example, jfk.wav is processed in ~23 seconds using "tiny.en" model
2022-10-22 18:54:01 +03:00
19817711b4 Add reference to FP16 repo 2022-10-18 19:48:34 +03:00
e36aabe00d Correct implementation of FP16 GELU
Can toggle it via the GGML_GELU_FP16 macro
2022-10-18 18:42:08 +03:00
91632eb6ea Revert GELU change
Seems it does not work on x86 for some reason
2022-10-18 00:45:08 +03:00
72d967bce4 Use Accelerate framework on Apple silicon
Huge performance improvement in the Encode (almost x2 on MacBook M1 Pro)

Also various extra optimizations:

- Multi-threaded NORM operator
- Faster GELU via F16 cast
2022-10-18 00:12:51 +03:00
0e858f080d close #56 : build on FreeBSD
Thanks to @abelbabel for the contribution
2022-10-17 18:10:16 +03:00
0b45d25151 Building with MSVC 2022-10-11 21:40:46 +03:00
4bbb8a587b Add MinGW support 2022-10-09 22:26:37 +08:00
e29a5dacc6 ref #11, #18, #26 : fix CACHE_LINE_SIZE constant 2022-10-07 21:56:44 +03:00
167324584b wip : rpi4 support 2022-10-05 23:03:46 +03:00
f888c2373d Flash + language support (ref #2)
- Achieved big performance improvement + memory usage reduction
- Can now translate / transcribe different languages
2022-09-28 21:07:32 +03:00
b0a11594ae Initial release 2022-09-25 22:13:49 +03:00