Shupei Fan
|
330273901f
|
ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541)
* ggml-cpu: support IQ4_NL_4_4 by runtime repack
* ggml-cpu: add __ARM_FEATURE_DOTPROD guard
|
2024-12-08 20:14:35 +02:00 |
|
Diego Devesa
|
77e3e4a090
|
ggml : add support for dynamic loading of backends (llama/10469)
* ggml : add support for dynamic loading of backends
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2024-12-08 20:14:35 +02:00 |
|
Charles Xu
|
3298916e5e
|
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (llama/9921)
* backend-cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels
---------
Co-authored-by: Diego Devesa <slarengh@gmail.com>
|
2024-11-20 21:00:08 +02:00 |
|
Diego Devesa
|
746bf2596f
|
ggml : build backends as libraries (llama/10256)
* ggml : build backends as libraries
---------
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Co-authored-by: R0CKSTAR <xiaodong.ye@mthreads.com>
|
2024-11-20 21:00:08 +02:00 |
|
Diego Devesa
|
9c817edb48
|
ggml : move CPU backend to a separate file (llama/10144)
|
2024-11-15 15:21:04 +02:00 |
|