LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2024-12-22 22:12:23 +00:00

Author	SHA1	Message	Date
Ettore Di Giacinto	b7821361c3	feat(petals): add backend (#1350 ) * feat(petals): add backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-28 09:01:46 +01:00
LocalAI [bot]	63e1f8fffd	⬆️ Update ggerganov/llama.cpp (#1345 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-27 09:02:19 +01:00
Ettore Di Giacinto	824612f1b4	feat: initial watchdog implementation (#1341 ) * feat: initial watchdog implementation Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * fiuxups * Add more output * wip: idletime checker * wire idle watchdog checks * enlarge watchdog time window * small fixes * Use stopmodel * Always delete process Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-26 18:36:23 +01:00
LocalAI [bot]	9482acfdfc	⬆️ Update ggerganov/llama.cpp (#1340 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-26 09:27:42 +01:00
Ettore Di Giacinto	c75bdd99e4	fix: rename transformers.py to avoid circular import (#1337 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-26 08:49:43 +01:00
Ettore Di Giacinto	6f34e8f044	fix: propagate CMAKE_ARGS when building grpc (#1334 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-25 13:53:51 +01:00
Ettore Di Giacinto	6d187af643	fix: handle grpc and llama-cpp with REBUILD=true (#1328 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-25 08:48:24 +01:00
LocalAI [bot]	97e9598c79	⬆️ Update ggerganov/llama.cpp (#1330 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-24 23:45:05 +01:00
B4ckslash	5a6a6de3d7	docs: Update Features->Embeddings page to reflect backend restructuring (#1325 ) * Update path to sentencetransformers backend for local execution Signed-off-by: Marcus Köhler <khler.marcus@gmail.com> * Rename huggingface-embeddings -> sentencetransformers in embeddings.md for consistency with the backend structure The Dockerfile still knows the "huggingface-embeddings" backend (I assume for compatibility reasons) but uses the sentencetransformers backend under the hood anyway. I figured it would be good to update the docs to use the new naming to make it less confusing moving forward. As the docker container knows both the "huggingface-embeddings" and the "sentencetransformers" backend, this should not break anything. Signed-off-by: Marcus Köhler <khler.marcus@gmail.com> --------- Signed-off-by: Marcus Köhler <khler.marcus@gmail.com>	2023-11-24 18:21:04 +01:00
LocalAI [bot]	b1a20effde	⬆️ Update ggerganov/llama.cpp (#1323 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-24 08:32:36 +01:00
Ettore Di Giacinto	ba5ab26f2e	docs: Add llava, update hot topics (#1322 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-23 18:54:55 +01:00
Dave	69f53211a1	Feat: OSX Local Codesigning (#1319 ) * stage makefile * OSX local code signing and entitlements file to fix incoming connections prompt	2023-11-23 15:22:54 +01:00
B4ckslash	9dddd1134d	fix: move python header comments below shebang in some backends (#1321 ) * Fix python header comments for some extra gRPC backends When a Python script is to be executed directly via exec(3), either the platform knows how to execute the file itself (i.e. special configuration is necessary) or the first line contains a shebang (#!) specifying the interpreter to run it (similar to shell scripts). The shebang MUST be on the first line for the script to work on all platforms, so any header comments need to be in the lines following it. Otherwise executing these scripts as extra backends will yield an "exec format error" message. Changes: * Move introductory comments below the shebang line * Change header comment in transformers.py to refer to the correct python module Signed-off-by: Marcus Köhler <khler.marcus@gmail.com> * Make header comment in ttsbark.py more specific Signed-off-by: Marcus Köhler <khler.marcus@gmail.com> --------- Signed-off-by: Marcus Köhler <khler.marcus@gmail.com>	2023-11-23 15:22:37 +01:00
Ettore Di Giacinto	c5c77d2b0d	docs: Initial import from localai-website (#1312 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-22 18:13:50 +01:00
LocalAI [bot]	763f94ca80	⬆️ Update ggerganov/llama.cpp (#1313 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-22 08:37:11 +01:00
ok2sh	20d637e7b7	fix: ExLlama Backend Context Size & Rope Scaling (#1311 ) * fix: context_size not propagated to exllama backend * fix: exllama rope scaling	2023-11-21 19:26:39 +01:00
LocalAI [bot]	480b14c8dc	⬆️ Update ggerganov/llama.cpp (#1310 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-21 00:20:37 +01:00
Ettore Di Giacinto	999db4301a	ci(core): add -core images without python deps (#1309 ) * ci(core): add -core images without python deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci(core): use public runners --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-20 23:01:31 +01:00
Ettore Di Giacinto	92cbc4d516	feat(transformers): add embeddings with Automodel (#1308 ) * Update huggingface.py Switch SentenceTransformer for AutoModel in order to set trust_remote_code needed to use the encode method with embeddings models like jinai-v2 Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu> * feat(transformers): split in separate backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Lucas Hänke de Cansino <lhc@next-boss.eu>	2023-11-20 21:21:17 +01:00
LocalAI [bot]	ff9afdb0fe	⬆️ Update ggerganov/llama.cpp (#1306 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-20 08:16:00 +01:00
LocalAI [bot]	3e35b20a02	⬆️ Update mudler/go-piper (#1305 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-19 09:01:40 +01:00
LocalAI [bot]	9ea371d6cd	⬆️ Update ggerganov/llama.cpp (#1304 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-19 08:49:05 +01:00
Ettore Di Giacinto	7a0f9767da	docs: fix heading Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-11-18 15:04:00 +01:00
Ettore Di Giacinto	9d7363f2a7	docs: update configuration readme Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-11-18 15:03:15 +01:00
Ettore Di Giacinto	8ee5cf38fd	Delete examples/configurations/llava/README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-11-18 15:01:39 +01:00
Ettore Di Giacinto	a6b788d220	docs: update LLaVa instructions Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-11-18 15:01:16 +01:00
lunamidori5	ccd87cd9f0	llava.yaml (yaml format standardization) (#1303 ) Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>	2023-11-18 14:48:54 +01:00
LocalAI [bot]	b5af87fc6c	⬆️ Update ggerganov/llama.cpp (#1300 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-18 08:19:10 +01:00
Ettore Di Giacinto	3c9544b023	refactor: rename llama-stable to llama-ggml (#1287 ) * refactor: rename llama-stable to llama-ggml * Makefile: get sources in sources/ Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup path Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup sources Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups sd Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * update SD * fixup * fixup: create piper libdir also when not built Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix make target on linux test Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-18 08:18:43 +01:00
Mathias	2f65671070	fix(api/config): allow YAML config with .yml (#1299 ) This commit allow to use both `.yml` and `.yaml` extensions for YAML configuration files as it is usually expected.	2023-11-17 22:47:30 +01:00
LocalAI [bot]	8c5436cbed	⬆️ Update ggerganov/llama.cpp (#1297 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-17 08:45:22 +01:00
Ettore Di Giacinto	548959b50f	feat: queue up requests if not running parallel requests (#1296 ) Return a GRPC which handles a lock in case it is not meant to be parallel. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-16 22:20:16 +01:00
LocalAI [bot]	2addb9f99a	⬆️ Update ggerganov/llama.cpp (#1291 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-16 08:20:26 +01:00
Ettore Di Giacinto	fdd95d1d86	feat: allow to run parallel requests (#1290 ) * feat: allow to run parallel requests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-16 08:20:05 +01:00
Ettore Di Giacinto	66a558ff41	fix: respect OpenAI spec for response format (#1289 ) fix: properly respect OpenAI spec for response format Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-15 19:36:23 +01:00
LocalAI [bot]	733b612eb2	⬆️ Update ggerganov/llama.cpp (#1288 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-15 18:41:09 +01:00
LocalAI [bot]	991ecce004	⬆️ Update ggerganov/llama.cpp (#1285 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-14 18:23:09 +01:00
Ettore Di Giacinto	ad0e30bca5	refactor: move backends into the backends directory (#1279 ) * refactor: move backends into the backends directory Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor: move main close to implementation for every backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-13 22:40:16 +01:00
LocalAI [bot]	55461188a4	⬆️ Update ggerganov/llama.cpp (#1282 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-13 00:48:26 +00:00
LocalAI [bot]	5d2405fdef	⬆️ Update ggerganov/llama.cpp (#1280 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-11 23:26:54 +00:00
LocalAI [bot]	e9f1268225	⬆️ Update ggerganov/llama.cpp (#1272 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-11 20:00:28 +00:00
Ettore Di Giacinto	803a0ac02a	feat(llama.cpp): support lora with scale and yarn (#1277 ) * feat(llama.cpp): support lora with scale Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(llama.cpp): support yarn Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-11 18:40:48 +01:00
Gianluca Boiano	bde87d00b9	deps(go-piper): update to 2023.11.6-3 (#1257 ) Signed-off-by: Gianluca Boiano <morf3089@gmail.com>	2023-11-11 18:40:26 +01:00
Ettore Di Giacinto	0eae727366	🔥 add LaVA support and GPT vision API, Multiple requests for llama.cpp, return JSON types (#1254 ) * wip * wip * Make it functional Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * wip * Small fixups * do not inject space on role encoding, encode img at beginning of messages Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add examples/config defaults * Add include dir of current source dir * cleanup * fixes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups * Revert "fixups" This reverts commit `f1a4731cca`. * fixes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-11 13:14:59 +01:00
LocalAI [bot]	3b4c5d54d8	⬆️ Update ggerganov/llama.cpp (#1265 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-10 08:50:42 +01:00
LocalAI [bot]	4e16bc2f13	⬆️ Update ggerganov/llama.cpp (#1256 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-08 08:21:12 +01:00
LocalAI [bot]	562ac62f59	⬆️ Update ggerganov/llama.cpp (#1242 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-07 08:37:55 +01:00
Diego	e7fa2e06f8	Fixes the bug 1196 (#1232 ) * Current state of the branch. * Now gRPC is build only when the BUILD_GRPC_FOR_BACKEND_LLAMA variable is defined. * Now the local compilation of gRPC is executed on BUILD_GRPC_FOR_BACKEND_LLAMA. * Revised the Makefile. * Removed replace directives in go.mod. --------- Signed-off-by: Diego <38375572+diego-minguzzi@users.noreply.github.com> Co-authored-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-11-06 19:07:46 +01:00
Ettore Di Giacinto	8123f009d0	dockerfile: fixup duplicate This should have been "exllama" Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-05 14:09:31 +01:00
Ettore Di Giacinto	622aaa9f7d	dockerfile: avoid pushing a big layer Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-05 10:31:33 +01:00

... 8 9 10 11 12 ...

1409 Commits