LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2024-12-24 14:56:41 +00:00

Author	SHA1	Message	Date
Ettore Di Giacinto	182fef339d	Create dependabot_auto.yml Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-04-11 12:13:06 +02:00
Ettore Di Giacinto	c74dec7e38	Add dependabot.yml Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-04-11 11:47:54 +02:00
Ettore Di Giacinto	d692b2c32a	ci: push latest images for dockerhub (#1984 ) Fixes: #1983 Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-04-10 10:31:59 +02:00
Ettore Di Giacinto	cc3d601836	ci: fixup latest image push Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-04-09 09:49:11 +02:00
Ettore Di Giacinto	2bbb221fb1	tests(petals): temp disable	2024-04-08 21:28:59 +00:00
Ettore Di Giacinto	ff77d3bc22	fix(seed): generate random seed per-request if -1 is set (#1952 ) * fix(seed): generate random seed per-request if -1 is set Also update ci with new workflows and allow the aio tests to run with an api key Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * docs(openvino): Add OpenVINO example Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-03 22:25:47 +02:00
Ettore Di Giacinto	93cfec3c32	ci: correctly tag latest and aio images	2024-04-03 11:30:23 +02:00
Ettore Di Giacinto	89560ef87f	fix(ci): manually tag latest images (#1948 ) fix(ci): manually tag images Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-02 19:25:46 +02:00
cryptk	93702e39d4	feat(build): adjust number of parallel make jobs (#1915 ) * feat(build): adjust number of parallel make jobs * fix: update make on MacOS from brew to support --output-sync argument * fix: cache grpc with version as part of key to improve validity of cache hits * fix: use gmake for tests-apple to use the updated GNU make version * fix: actually use the new make version for tests-apple * feat: parallelize tests-extra * feat: attempt to cache grpc build for docker images * fix: don't quote GRPC version * fix: don't cache go modules, we have limited cache space, better used elsewhere * fix: release with the same version of go that we test with * fix: don't fail on exporting cache layers * fix: remove deprecated BUILD_GRPC docker arg from Makefile	2024-03-29 22:32:40 +01:00
cryptk	0c0efc871c	fix(build): better CI logging and correct some build failure modes in Makefile (#1899 ) * feat: group make output by target when running parallelized builds in CI * fix: quote GO_TAGS in makefile to fix handling of whitespace in value * fix: set CPATH to find opencv2 in it's commonly installed location * fix: add missing go mod dropreplace for go-llama.cpp * chore: remove opencv symlink from github workflows	2024-03-27 21:12:19 +01:00
Ettore Di Giacinto	49cec7fd61	ci(aio): add latest tag images (#1884 ) Tangentially also fixes #1868	2024-03-23 16:08:32 +01:00
Ettore Di Giacinto	d9456f2a23	ci(aio): publish hipblas and Intel GPU images (#1883 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-03-23 15:54:14 +01:00
Ettore Di Giacinto	8495750cb8	Update release.yml Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-03-23 15:22:26 +01:00
Ettore Di Giacinto	4b1ee0c170	feat(aio): add tests, update model definitions (#1880 )	2024-03-22 21:13:11 +01:00
Ettore Di Giacinto	418ba02025	ci: fix typo Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-03-22 09:14:17 +01:00
Ettore Di Giacinto	abc9360dc6	feat(aio): entrypoint, update workflows (#1872 )	2024-03-21 22:09:04 +01:00
Dave	ed5734ae25	test/fix: OSX Test Repair (#1843 ) * test with gguf instead of ggml. Updates testPrompt to match? Adds debugging line to Dockerfile that I've found helpful recently. * fix testPrompt slightly * Sad Experiment: Test GH runner without metal? * break apart CGO_LDFLAGS * switch runner * upstream llama.cpp disables Metal on Github CI! * missed a dir from clean-tests * CGO_LDFLAGS * tmate failure + NO_ACCELERATE * whisper.cpp has a metal fix * do the exact opposite of the name of this branch, but keep it around for unrelated fixes? * add back newlines * add tmate to linux for testing * update fixtures * timeout for tmate	2024-03-18 19:19:43 +01:00
cryptk	020ce29cd8	fix(make): allow to parallelize jobs (#1845 ) * fix: clean up Makefile dependencies to allow for parallel builds * refactor: remove old unused backend from Makefile * fix: finish removing legacy backend, update piper * fix: I broke llama... I fixed llama * feat: give the tests and builds a few threads * fix: ensure libraries are replaced before build, add dropreplace target * Fix image build workflows	2024-03-17 15:39:20 +01:00
Ettore Di Giacinto	5d1018495f	feat(intel): add diffusers/transformers support (#1746 ) * feat(intel): add diffusers support * try to consume upstream container image * Debug * Manually install deps * Map transformers/hf cache dir to modelpath if not specified * fix(compel): update initialization, pass by all gRPC options * fix: add dependencies, implement transformers for xpu * base it from the oneapi image * Add pillow * set threads if specified when launching the API * Skip conda install if intel * defaults to non-intel * ci: add to pipelines * prepare compel only if enabled * Skip conda install if intel * fix cleanup * Disable compel by default * Install torch 2.1.0 with Intel * Skip conda on some setups * Detect python * Quiet output * Do not override system python with conda * Prefer python3 * Fixups * exllama2: do not install without conda (overrides pytorch version) * exllama/exllama2: do not install if not using cuda * Add missing dataset dependency * Small fixups, symlink to python, add requirements * Add neural_speed to the deps * correctly handle model offloading * fix: device_map == xpu * go back at calling python, fixed at dockerfile level * Exllama2 restricted to only nvidia gpus * Tokenizer to xpu	2024-03-07 14:37:45 +01:00
Ettore Di Giacinto	c1966af2cf	ci: reduce stress on self-hosted runners (#1776 ) Split jobs by self-hosted and free public runner provided by Github Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-29 11:40:08 +01:00
Sertaç Özercan	7f72a61104	ci: add stablediffusion to release (#1757 ) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>	2024-02-25 23:06:18 +00:00
fenfir	fb0a4c5d9a	Build docker container for ROCm (#1595 ) * Dockerfile changes to build for ROCm * Adjust linker flags for ROCm * Update conda env for diffusers and transformers to use ROCm pytorch * Update transformers conda env for ROCm * ci: build hipblas images * fixup rebase * use self-hosted Signed-off-by: mudler <mudler@localai.io> * specify LD_LIBRARY_PATH only when BUILD_TYPE=hipblas --------- Signed-off-by: mudler <mudler@localai.io> Co-authored-by: mudler <mudler@localai.io>	2024-02-16 15:08:50 +01:00
Sertaç Özercan	2e61ff32ad	ci: add cuda builds to release (#1702 ) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>	2024-02-13 08:35:39 +00:00
Ettore Di Giacinto	ddd21f1644	feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689 ) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-08 20:12:51 +01:00
Ettore Di Giacinto	37e6974afe	ci: fix extra(bark) tests Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-06 20:49:28 +01:00
Ettore Di Giacinto	e23e490455	Revert "fix(Dockerfile): sycl dependencies" (#1687 ) Revert "fix(Dockerfile): sycl dependencies (#1686)" This reverts commit `f76bb8954b`.	2024-02-06 20:48:29 +01:00
Ettore Di Giacinto	f76bb8954b	fix(Dockerfile): sycl dependencies (#1686 ) * fix(Dockerfile): sycl dependencies Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(ci): cleanup before running bark test --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-06 19:42:52 +01:00
Ettore Di Giacinto	d168c7c9dc	ci: cleanup worker before run (#1685 ) Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-06 19:42:27 +01:00
Ettore Di Giacinto	fd9d060c94	ci: fix sycl image suffix Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-06 15:52:21 +01:00
Ettore Di Giacinto	1c57f8d077	feat(sycl): Add support for Intel GPUs with sycl (#1647 ) (#1660 ) * feat(sycl): Add sycl support (#1647) * onekit: install without prompts * set cmake args only in grpc-server Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * fixup sycl source env * Cleanup docs * ci: runs on self-hosted * fix typo * bump llama.cpp * llama.cpp: update server * adapt to upstream changes * adapt to upstream changes * docs: add sycl --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-01 19:21:52 +01:00
Ettore Di Giacinto	6ca4d38a01	docs/examples: enhancements (#1572 ) * docs: re-order sections * fix references * Add mixtral-instruct, tinyllama-chat, dolphin-2.5-mixtral-8x7b * Fix link * Minor corrections * fix: models is a StringSlice, not a String Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP: switch docs theme * content * Fix GH link * enhancements * enhancements * Fixed how to link Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com> * fixups * logo fix * more fixups * final touches --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com> Co-authored-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>	2024-01-18 19:41:08 +01:00
Ettore Di Giacinto	09e5d9007b	feat: embedded model configurations, add popular model examples, refactoring (#1532 ) * move downloader out * separate startup functions for preloading configuration files * docs: add popular model examples Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * shorteners * Add llava * Add mistral-openorca * Better link to build section * docs: update * fixup * Drop code dups * Minor fixups * Apply suggestions from code review Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * ci: try to cache gRPC build during tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: do not build all images for tests, just necessary * ci: cache gRPC also in release pipeline * fixes * Update model_preload_test.go Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-01-05 23:16:33 +01:00
Ettore Di Giacinto	bcf02449b3	ci(dockerhub): push images also to dockerhub (#1542 ) Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-01-04 08:32:29 +01:00
Ettore Di Giacinto	ae0c48e6bd	ci(apple): speedups (#1471 ) * ci(apple): install grpc from brew * ci(apple): use brew deps also on release * ci(linux): install grpc from package manager * ci: set concurrency * Revert "ci(linux): install grpc from package manager" This reverts commit `004e3e308e`.	2023-12-26 19:19:37 +01:00
Ettore Di Giacinto	95eb72bfd3	feat: add 🐸 coqui (#1489 ) * feat: add coqui * docs: update news	2023-12-24 19:38:54 +01:00
Ettore Di Giacinto	939187a129	env(conda): use transformers for vall-e-x (#1481 )	2023-12-23 14:31:34 -05:00
Ettore Di Giacinto	b4b21a446b	feat(conda): share envs with transformer-based backends (#1465 ) * feat(conda): share env between diffusers and bark * Detect if env already exists * share diffusers and petals * tests: add petals * Use smaller model for tests with petals * test only model load on petals * tests(petals): run only load model tests * Revert "test only model load on petals" This reverts commit `111cfa97f1`. * move transformers and sentencetransformers to common env * Share also transformers-musicgen	2023-12-21 08:35:15 +01:00
Ettore Di Giacinto	2eeed2287b	docs: automatically track latest versions (#1451 )	2023-12-17 19:02:13 +01:00
Ettore Di Giacinto	9aa2a7ca13	extras: add vllm,bark,vall-e-x tests, bump diffusers (#1422 ) * tests: add vllm Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * tests: Add vall-e-x tests * Add bark tests * bump diffusers --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-12-12 00:39:26 +01:00
Ettore Di Giacinto	718a5d4a9e	fix(transformers): add sentence-transformers and transformers-musicgen tests, fix musicgen wrapper (#1420 ) tests: add sentence-transformers and transformers-musicgen Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * fix: tranformers-musicgen conda env Initialize correctly the environment for the transformers-musicgen backend. * fix(tests): transformer-musicgen tests fixups --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-12-11 19:26:02 +01:00
Ettore Di Giacinto	48e5380e45	tests: add diffusers tests (#1419 )	2023-12-11 08:20:34 +01:00
Ettore Di Giacinto	887b3dff04	feat: cuda transformers (#1401 ) * Use cuda in transformers if available tensorflow probably needs a different check. Signed-off-by: Erich Schubert <kno10@users.noreply.github.com> * feat: expose CUDA at top level Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * tests: add to tests and create workflow for py extra backends * doc: update note on how to use core images --------- Signed-off-by: Erich Schubert <kno10@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Erich Schubert <kno10@users.noreply.github.com>	2023-12-08 15:45:04 +01:00
Ettore Di Giacinto	6011911746	fix(piper): pin petals, phonemize and espeak (#1393 ) * fix: pin phonemize and espeak Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: pin petals deps --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-12-07 22:58:41 +01:00
Ettore Di Giacinto	c3fb4b1d8e	ci: rename workflow Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-11-30 19:25:33 +01:00
Ettore Di Giacinto	e3ca1a7dbe	ci: split into reusable workflows (#1366 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-30 19:24:37 +01:00
Ettore Di Giacinto	9b98be160a	ci: limit concurrent jobs (#1364 ) * ci: limit concurrent image push * docs: mention core images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-30 17:45:20 +01:00
Ettore Di Giacinto	999db4301a	ci(core): add -core images without python deps (#1309 ) * ci(core): add -core images without python deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci(core): use public runners --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-20 23:01:31 +01:00
Ettore Di Giacinto	92cbc4d516	feat(transformers): add embeddings with Automodel (#1308 ) * Update huggingface.py Switch SentenceTransformer for AutoModel in order to set trust_remote_code needed to use the encode method with embeddings models like jinai-v2 Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu> * feat(transformers): split in separate backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Lucas Hänke de Cansino <lhc@next-boss.eu>	2023-11-20 21:21:17 +01:00
Ettore Di Giacinto	3c9544b023	refactor: rename llama-stable to llama-ggml (#1287 ) * refactor: rename llama-stable to llama-ggml * Makefile: get sources in sources/ Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup path Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup sources Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups sd Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * update SD * fixup * fixup: create piper libdir also when not built Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix make target on linux test Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-18 08:18:43 +01:00
Ettore Di Giacinto	ad0e30bca5	refactor: move backends into the backends directory (#1279 ) * refactor: move backends into the backends directory Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor: move main close to implementation for every backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-13 22:40:16 +01:00

1 2 3

114 Commits