LocalAI/pkg/backend/llm
Ettore Di Giacinto 1120847f72
feat: bump llama.cpp, add gguf support (#943)
**Description**

This PR syncs up the `llama` backend to use `gguf`
(https://github.com/go-skynet/go-llama.cpp/pull/180). It also adds
`llama-stable` to the targets so we can still load ggml. It adapts the
current tests to use the `llama-backend` for ggml and uses a `gguf`
model to run tests on the new backend.

In order to consume the new version of go-llama.cpp, it also bump go to
1.21 (images, pipelines, etc)

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2023-08-24 01:18:58 +02:00
..
bert fix: drop racy code, refactor and group API schema (#931) 2023-08-20 14:04:45 +02:00
bloomz fix: drop racy code, refactor and group API schema (#931) 2023-08-20 14:04:45 +02:00
falcon fix: drop racy code, refactor and group API schema (#931) 2023-08-20 14:04:45 +02:00
gpt4all fix: drop racy code, refactor and group API schema (#931) 2023-08-20 14:04:45 +02:00
langchain fix: drop racy code, refactor and group API schema (#931) 2023-08-20 14:04:45 +02:00
llama feat: bump llama.cpp, add gguf support (#943) 2023-08-24 01:18:58 +02:00
llama-stable feat: add llama-stable backend (#932) 2023-08-20 16:35:42 +02:00
rwkv Feat: rwkv improvements: (#937) 2023-08-22 18:48:06 +02:00
transformers fix: drop racy code, refactor and group API schema (#931) 2023-08-20 14:04:45 +02:00