LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2025-05-19 08:53:07 +00:00

History

Ettore Di Giacinto 1120847f72

feat: bump llama.cpp, add gguf support (#943 )

**Description**

This PR syncs up the `llama` backend to use `gguf`
(https://github.com/go-skynet/go-llama.cpp/pull/180). It also adds
`llama-stable` to the targets so we can still load ggml. It adapts the
current tests to use the `llama-backend` for ggml and uses a `gguf`
model to run tests on the new backend.

In order to consume the new version of go-llama.cpp, it also bump go to
1.21 (images, pipelines, etc)

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-08-24 01:18:58 +02:00

bert

fix: drop racy code, refactor and group API schema (#931 )

2023-08-20 14:04:45 +02:00

bloomz

fix: drop racy code, refactor and group API schema (#931 )

2023-08-20 14:04:45 +02:00

falcon

fix: drop racy code, refactor and group API schema (#931 )

2023-08-20 14:04:45 +02:00

gpt4all

fix: drop racy code, refactor and group API schema (#931 )