Diego Devesa
746bf2596f
ggml : build backends as libraries (llama/10256)
...
* ggml : build backends as libraries
---------
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Co-authored-by: R0CKSTAR <xiaodong.ye@mthreads.com>
2024-11-20 21:00:08 +02:00
leo-pony
13db492f83
Adapt to dynamically loadable backends mechanism (llama/9970)
...
* [CANN] Adapt to dynamically loadable backends mechanism
* Fix the Bug: inference running result is garbled in debug running model for LM models who's type is Q4_0 class
* Handle the review comments of this pull request
2024-11-01 10:19:05 +02:00
Diego Devesa
cf977670e6
ggml-backend : add device and backend reg interfaces (llama/9707)
...
Also:
- metal : fix compute pass descriptor autorelease crash
- ggml-backend : add device description to CPU backend
- ggml: unify backend logging mechanism
2024-10-05 15:23:51 +03:00
Diego Devesa
1acfadb721
ggml-backend : add device and backend reg interfaces (llama/9707)
...
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
2024-10-05 15:23:51 +03:00
Dou Xinpeng
c6cc8d16c3
cann: Add host buffer type for Ascend NPU (llama/9406)
...
* feat: Add host buffer type for Ascend NPU(CANN backend)
* fix some checking errors
* Add a few comments
2024-09-24 19:45:08 +03:00
hipudding
be88ee1d75
ggml : add CANN backend (llama/0)
...
ggml-ci
2024-08-09 09:58:16 +03:00