LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2024-12-18 20:27:57 +00:00

Author	SHA1	Message	Date
LocalAI [bot]	784657a652	⬆️ Update ggerganov/llama.cpp (#1934 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-31 00:27:38 +01:00
LocalAI [bot]	831efa8893	⬆️ Update ggerganov/whisper.cpp (#1933 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-31 00:27:16 +01:00
LocalAI [bot]	2bba62ca4d	⬆️ Update ggerganov/llama.cpp (#1928 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 22:52:01 +00:00
cryptk	93702e39d4	feat(build): adjust number of parallel make jobs (#1915 ) * feat(build): adjust number of parallel make jobs * fix: update make on MacOS from brew to support --output-sync argument * fix: cache grpc with version as part of key to improve validity of cache hits * fix: use gmake for tests-apple to use the updated GNU make version * fix: actually use the new make version for tests-apple * feat: parallelize tests-extra * feat: attempt to cache grpc build for docker images * fix: don't quote GRPC version * fix: don't cache go modules, we have limited cache space, better used elsewhere * fix: release with the same version of go that we test with * fix: don't fail on exporting cache layers * fix: remove deprecated BUILD_GRPC docker arg from Makefile	2024-03-29 22:32:40 +01:00
LocalAI [bot]	a7fc89c207	⬆️ Update ggerganov/whisper.cpp (#1927 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 22:29:50 +01:00
Ettore Di Giacinto	123a5a2e16	feat(swagger): Add swagger API doc (#1926 ) * makefile(build): add minimal and api build target * feat(swagger): Add swagger	2024-03-29 22:29:33 +01:00
LocalAI [bot]	ab2f403dd0	⬆️ Update ggerganov/whisper.cpp (#1924 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 00:13:59 +01:00
LocalAI [bot]	b9c5e14e2c	⬆️ Update ggerganov/llama.cpp (#1923 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 00:13:38 +01:00
LocalAI [bot]	07c49ee4b8	⬆️ Update ggerganov/whisper.cpp (#1914 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-27 22:53:13 +00:00
LocalAI [bot]	07c4bdda7c	⬆️ Update ggerganov/llama.cpp (#1913 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-27 21:57:59 +00:00
cryptk	0c0efc871c	fix(build): better CI logging and correct some build failure modes in Makefile (#1899 ) * feat: group make output by target when running parallelized builds in CI * fix: quote GO_TAGS in makefile to fix handling of whitespace in value * fix: set CPATH to find opencv2 in it's commonly installed location * fix: add missing go mod dropreplace for go-llama.cpp * chore: remove opencv symlink from github workflows	2024-03-27 21:12:19 +01:00
Gianluca Boiano	7ef5f3b473	⬆️ Update M0Rf30/go-tiny-dream (#1911 )	2024-03-27 21:12:04 +01:00
LocalAI [bot]	b500ceaf73	⬆️ Update ggerganov/llama.cpp (#1904 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 23:21:54 +00:00
LocalAI [bot]	1395e505cd	⬆️ Update ggerganov/llama.cpp (#1897 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 00:34:10 +01:00
LocalAI [bot]	42a4c86dca	⬆️ Update ggerganov/whisper.cpp (#1896 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 00:33:46 +01:00
LocalAI [bot]	3e293f1465	⬆️ Update ggerganov/llama.cpp (#1889 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-24 21:12:18 +00:00
LocalAI [bot]	0106c58181	⬆️ Update ggerganov/llama.cpp (#1885 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-24 14:54:01 +01:00
LocalAI [bot]	a922119c41	⬆️ Update ggerganov/llama.cpp (#1881 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-23 09:23:28 +01:00
Richard Palethorpe	643d85d2cc	feat(stores): Vector store backend (#1795 ) Add simple vector store backend Signed-off-by: Richard Palethorpe <io@richiejp.com>	2024-03-22 21:14:04 +01:00
Ettore Di Giacinto	4b1ee0c170	feat(aio): add tests, update model definitions (#1880 )	2024-03-22 21:13:11 +01:00
LocalAI [bot]	dd84c29a3d	⬆️ Update ggerganov/whisper.cpp (#1875 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-22 09:14:56 +01:00
LocalAI [bot]	07468c8786	⬆️ Update ggerganov/llama.cpp (#1874 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-22 09:14:42 +01:00
Ettore Di Giacinto	abc9360dc6	feat(aio): entrypoint, update workflows (#1872 )	2024-03-21 22:09:04 +01:00
Ettore Di Giacinto	e533dcf506	feat(functions/aio): all-in-one images, function template enhancements (#1862 ) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text	2024-03-21 01:12:20 +01:00
LocalAI [bot]	eeaf8c7ccd	⬆️ Update ggerganov/whisper.cpp (#1867 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 22:26:29 +00:00
LocalAI [bot]	7e34dfdae7	⬆️ Update ggerganov/llama.cpp (#1866 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 22:13:29 +00:00
LocalAI [bot]	e4bf51d5bd	⬆️ Update ggerganov/llama.cpp (#1864 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 09:05:53 +01:00
LocalAI [bot]	ead61bf9d5	⬆️ Update ggerganov/llama.cpp (#1857 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:03:17 +00:00
LocalAI [bot]	621541a92f	⬆️ Update ggerganov/whisper.cpp (#1508 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:44:23 +01:00
Dave	ed5734ae25	test/fix: OSX Test Repair (#1843 ) * test with gguf instead of ggml. Updates testPrompt to match? Adds debugging line to Dockerfile that I've found helpful recently. * fix testPrompt slightly * Sad Experiment: Test GH runner without metal? * break apart CGO_LDFLAGS * switch runner * upstream llama.cpp disables Metal on Github CI! * missed a dir from clean-tests * CGO_LDFLAGS * tmate failure + NO_ACCELERATE * whisper.cpp has a metal fix * do the exact opposite of the name of this branch, but keep it around for unrelated fixes? * add back newlines * add tmate to linux for testing * update fixtures * timeout for tmate	2024-03-18 19:19:43 +01:00
Ettore Di Giacinto	b202bfaaa0	deps(whisper.cpp): update, fix cublas build (#1846 ) fix(whisper.cpp): Add stubs and -lcuda	2024-03-18 15:56:53 +01:00
LocalAI [bot]	0eb0ac7dd0	⬆️ Update ggerganov/llama.cpp (#1848 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-18 08:57:58 +01:00
cryptk	020ce29cd8	fix(make): allow to parallelize jobs (#1845 ) * fix: clean up Makefile dependencies to allow for parallel builds * refactor: remove old unused backend from Makefile * fix: finish removing legacy backend, update piper * fix: I broke llama... I fixed llama * feat: give the tests and builds a few threads * fix: ensure libraries are replaced before build, add dropreplace target * Fix image build workflows	2024-03-17 15:39:20 +01:00
LocalAI [bot]	8967ed1601	⬆️ Update ggerganov/llama.cpp (#1840 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-16 11:25:41 +00:00
LocalAI [bot]	5826fb8e6d	⬆️ Update mudler/go-piper (#1844 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-15 23:51:03 +00:00
Dave	db199f61da	fix: osx build default.metallib (#1837 ) fix: osx build default.metallib (#1837) * port osx fix from refactor pr to slim pr * manually bump llama.cpp version to unstick CI?	2024-03-15 08:18:58 +00:00
LocalAI [bot]	44adbd2c75	⬆️ Update go-skynet/go-llama.cpp (#1835 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-14 23:06:42 +00:00
Dave	45d520f913	fix: OSX Build Files for llama.cpp (#1836 ) bot ate my changes, seperate branch	2024-03-14 23:07:47 +01:00
LocalAI [bot]	f82065703d	⬆️ Update ggerganov/llama.cpp (#1827 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-14 08:39:39 +01:00
LocalAI [bot]	5c5f07c1e7	⬆️ Update ggerganov/llama.cpp (#1821 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-13 10:05:46 +01:00
LocalAI [bot]	8e57f4df31	⬆️ Update ggerganov/llama.cpp (#1818 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-11 00:02:37 +01:00
LocalAI [bot]	a08cc5adbb	⬆️ Update ggerganov/llama.cpp (#1816 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-10 09:32:09 +01:00
LocalAI [bot]	595a73fce4	⬆️ Update ggerganov/llama.cpp (#1813 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-09 09:27:06 +01:00
LocalAI [bot]	dc919e08e8	⬆️ Update ggerganov/llama.cpp (#1811 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-08 08:21:25 +01:00
Ettore Di Giacinto	5d1018495f	feat(intel): add diffusers/transformers support (#1746 ) * feat(intel): add diffusers support * try to consume upstream container image * Debug * Manually install deps * Map transformers/hf cache dir to modelpath if not specified * fix(compel): update initialization, pass by all gRPC options * fix: add dependencies, implement transformers for xpu * base it from the oneapi image * Add pillow * set threads if specified when launching the API * Skip conda install if intel * defaults to non-intel * ci: add to pipelines * prepare compel only if enabled * Skip conda install if intel * fix cleanup * Disable compel by default * Install torch 2.1.0 with Intel * Skip conda on some setups * Detect python * Quiet output * Do not override system python with conda * Prefer python3 * Fixups * exllama2: do not install without conda (overrides pytorch version) * exllama/exllama2: do not install if not using cuda * Add missing dataset dependency * Small fixups, symlink to python, add requirements * Add neural_speed to the deps * correctly handle model offloading * fix: device_map == xpu * go back at calling python, fixed at dockerfile level * Exllama2 restricted to only nvidia gpus * Tokenizer to xpu	2024-03-07 14:37:45 +01:00
LocalAI [bot]	ad6fd7a991	⬆️ Update ggerganov/llama.cpp (#1805 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-06 23:28:31 +01:00
LocalAI [bot]	e022b5959e	⬆️ Update mudler/go-stable-diffusion (#1802 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-05 23:39:57 +00:00
LocalAI [bot]	db7f4955a1	⬆️ Update ggerganov/llama.cpp (#1801 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-05 21:50:27 +00:00
LocalAI [bot]	c8e29033c2	⬆️ Update ggerganov/llama.cpp (#1794 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-05 08:59:09 +01:00
LocalAI [bot]	d0bd961bde	⬆️ Update ggerganov/llama.cpp (#1791 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-04 09:44:21 +01:00
LocalAI [bot]	b60a3fc879	⬆️ Update ggerganov/llama.cpp (#1789 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-03 08:49:23 +01:00
LocalAI [bot]	daa0b8741c	⬆️ Update ggerganov/llama.cpp (#1785 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-01 22:38:24 +00:00
Dave	1c312685aa	refactor: move remaining api packages to core (#1731 ) * core 1 * api/openai/files fix * core 2 - core/config * move over core api.go and tests to the start of core/http * move over localai specific endpoints to core/http, begin the service/endpoint split there * refactor big chunk on the plane * refactor chunk 2 on plane, next step: port and modify changes to request.go * easy fixes for request.go, major changes not done yet * lintfix * json tag lintfix? * gitignore and .keep files * strange fix attempt: rename the config dir?	2024-03-01 16:19:53 +01:00
LocalAI [bot]	316de82f51	⬆️ Update ggerganov/llama.cpp (#1779 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-29 22:33:30 +00:00
LocalAI [bot]	c665898652	⬆️ Update donomii/go-rwkv.cpp (#1771 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-28 23:50:27 +00:00
LocalAI [bot]	f651a660aa	⬆️ Update ggerganov/llama.cpp (#1772 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-28 23:02:30 +01:00
LocalAI [bot]	c7e08813a5	⬆️ Update ggerganov/llama.cpp (#1767 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-27 23:12:51 +01:00
LocalAI [bot]	d21a6b33ab	⬆️ Update ggerganov/llama.cpp (#1756 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-27 18:07:51 +00:00
Ettore Di Giacinto	d6cf82aba3	fix(tests): re-enable tests after code move (#1764 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-27 15:04:19 +01:00
Ettore Di Giacinto	bc5f5aa538	deps(llama.cpp): update (#1759 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-26 13:18:44 +01:00
Sertaç Özercan	7f72a61104	ci: add stablediffusion to release (#1757 ) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>	2024-02-25 23:06:18 +00:00
LocalAI [bot]	8e45d47740	⬆️ Update ggerganov/llama.cpp (#1753 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-25 10:03:19 +01:00
LocalAI [bot]	ff88c390bb	⬆️ Update ggerganov/llama.cpp (#1750 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-24 00:06:46 +01:00
LocalAI [bot]	d825821a22	⬆️ Update ggerganov/llama.cpp (#1740 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-23 00:07:15 +01:00
LocalAI [bot]	6fc122fa1a	⬆️ Update ggerganov/llama.cpp (#1705 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-22 09:33:23 +00:00
Ettore Di Giacinto	8292781045	deps(llama.cpp): update, support Gemma models (#1734 ) deps(llama.cpp): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-21 17:23:38 +01:00
Ettore Di Giacinto	54ec6348fa	deps(llama.cpp): update (#1714 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-21 11:35:44 +01:00
fenfir	fb0a4c5d9a	Build docker container for ROCm (#1595 ) * Dockerfile changes to build for ROCm * Adjust linker flags for ROCm * Update conda env for diffusers and transformers to use ROCm pytorch * Update transformers conda env for ROCm * ci: build hipblas images * fixup rebase * use self-hosted Signed-off-by: mudler <mudler@localai.io> * specify LD_LIBRARY_PATH only when BUILD_TYPE=hipblas --------- Signed-off-by: mudler <mudler@localai.io> Co-authored-by: mudler <mudler@localai.io>	2024-02-16 15:08:50 +01:00
Ettore Di Giacinto	5e155fb081	fix(python): pin exllama2 (#1711 ) fix(python): pin python deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-14 21:44:12 +01:00
Ettore Di Giacinto	39a6b562cf	fix(llama.cpp): downgrade to a known working version (#1706 ) sycl support is broken otherwise. See upstream issue: https://github.com/ggerganov/llama.cpp/issues/5469 Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-14 10:28:06 +01:00
LocalAI [bot]	02f6e18adc	⬆️ Update ggerganov/llama.cpp (#1700 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-12 21:43:33 +00:00
LocalAI [bot]	4436e62cf1	⬆️ Update ggerganov/llama.cpp (#1698 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-12 09:56:04 +01:00
LocalAI [bot]	58cdf97361	⬆️ Update ggerganov/llama.cpp (#1694 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-11 10:01:11 +01:00
LocalAI [bot]	ef1306f703	⬆️ Update mudler/go-stable-diffusion (#1674 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-09 21:59:15 +00:00
LocalAI [bot]	3196967995	⬆️ Update ggerganov/llama.cpp (#1691 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-09 21:50:34 +00:00
LocalAI [bot]	fc8423392f	⬆️ Update ggerganov/llama.cpp (#1688 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-09 00:02:23 +01:00
Ettore Di Giacinto	ddd21f1644	feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689 ) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-08 20:12:51 +01:00
Ettore Di Giacinto	e0632f2ce2	fix(llama.cpp): downgrade to fix sycl build Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-07 00:16:52 +01:00
LocalAI [bot]	d8b17795d7	⬆️ Update ggerganov/llama.cpp (#1683 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-06 09:26:01 +01:00
LocalAI [bot]	8ace0a9ba7	⬆️ Update ggerganov/llama.cpp (#1681 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-04 21:59:14 +00:00
Ettore Di Giacinto	98ad93d53e	Drop ggml-based gpt2 and starcoder (supported by llama.cpp) (#1679 ) * Drop ggml-based gpt2 and starcoder (supported by llama.cpp) * Update compatibility table	2024-02-04 13:15:51 +01:00
LocalAI [bot]	38e4ec0b2a	⬆️ Update ggerganov/llama.cpp (#1678 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-04 00:55:12 +01:00
Ettore Di Giacinto	df13ba655c	Drop old falcon backend (deprecated) (#1675 ) Drop old falcon backend	2024-02-03 13:01:13 +01:00
LocalAI [bot]	7678b25755	⬆️ Update ggerganov/llama.cpp (#1673 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-02 21:46:26 +00:00
LocalAI [bot]	c87ca4f320	⬆️ Update ggerganov/llama.cpp (#1669 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-02 19:14:03 +01:00
Ettore Di Giacinto	1c57f8d077	feat(sycl): Add support for Intel GPUs with sycl (#1647 ) (#1660 ) * feat(sycl): Add sycl support (#1647) * onekit: install without prompts * set cmake args only in grpc-server Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * fixup sycl source env * Cleanup docs * ci: runs on self-hosted * fix typo * bump llama.cpp * llama.cpp: update server * adapt to upstream changes * adapt to upstream changes * docs: add sycl --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-01 19:21:52 +01:00
LocalAI [bot]	16cebf0390	⬆️ Update ggerganov/llama.cpp (#1665 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-30 23:38:05 +00:00
LocalAI [bot]	c1bae1ee81	⬆️ Update ggerganov/llama.cpp (#1656 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-30 00:43:36 +01:00
LocalAI [bot]	abd678e147	⬆️ Update ggerganov/llama.cpp (#1655 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-28 09:24:44 +01:00
LocalAI [bot]	f928899338	⬆️ Update ggerganov/llama.cpp (#1652 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-27 00:13:38 +01:00
LocalAI [bot]	ac19998e5e	⬆️ Update ggerganov/llama.cpp (#1644 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-26 00:13:39 +01:00
LocalAI [bot]	3733250b3c	⬆️ Update ggerganov/llama.cpp (#1642 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-24 22:51:59 +01:00
LocalAI [bot]	7690caf020	⬆️ Update ggerganov/llama.cpp (#1632 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-23 23:07:51 +01:00
LocalAI [bot]	efe2883c5d	⬆️ Update ggerganov/llama.cpp (#1626 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-22 23:22:01 +01:00
LocalAI [bot]	47237c7c3c	⬆️ Update ggerganov/llama.cpp (#1623 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-22 08:54:06 +01:00
LocalAI [bot]	6a88b030ea	⬆️ Update ggerganov/llama.cpp (#1620 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-20 23:34:46 +01:00
LocalAI [bot]	b2dc5fbd7e	⬆️ Update ggerganov/llama.cpp (#1612 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-20 00:38:14 +01:00
Ettore Di Giacinto	9e653d6abe	feat: 🐍 add mamba support (#1589 ) feat(mamba): Initial import This is a first iteration of the mamba backend, loosely based on mamba-chat(https://github.com/havenhq/mamba-chat).	2024-01-19 23:42:50 +01:00
Ettore Di Giacinto	3a253c6cd7	Makefile: allow to build without GRPC_BACKENDS (#1607 )	2024-01-19 15:38:43 +01:00
LocalAI [bot]	23d64ac53a	⬆️ Update ggerganov/llama.cpp (#1604 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-18 21:20:50 +00:00
LocalAI [bot]	b5c93f176a	⬆️ Update ggerganov/llama.cpp (#1599 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-18 14:39:30 +01:00
LocalAI [bot]	1aaf88098d	⬆️ Update ggerganov/llama.cpp (#1597 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-17 09:27:02 +01:00
LocalAI [bot]	dfb7c3b1aa	⬆️ Update ggerganov/llama.cpp (#1594 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-16 14:47:57 +01:00
Dionysius	b41eb5e1f3	prepend built binaries in PATH for BUILD_GRPC_FOR_BACKEND_LLAMA (#1593 ) prepend built binaries in PATH	2024-01-16 14:47:47 +01:00
LocalAI [bot]	9c2d264979	⬆️ Update ggerganov/llama.cpp (#1590 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-15 09:01:07 +01:00
LocalAI [bot]	b996c3198c	⬆️ Update ggerganov/llama.cpp (#1587 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-14 09:46:47 +00:00
Dionysius	441e2965ff	move BUILD_GRPC_FOR_BACKEND_LLAMA logic to makefile: errors in this section now immediately fail the build (#1576 ) * move BUILD_GRPC_FOR_BACKEND_LLAMA option to makefile * review: oversight, fixup cmake_args Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: Dionysius <1341084+dionysius@users.noreply.github.com> --------- Signed-off-by: Dionysius <1341084+dionysius@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-01-13 10:08:26 +01:00
LocalAI [bot]	cbe9a03e3c	⬆️ Update ggerganov/llama.cpp (#1583 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-12 23:04:04 +01:00
LocalAI [bot]	4ee7e73d00	⬆️ Update ggerganov/llama.cpp (#1578 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-12 16:04:33 +01:00
LocalAI [bot]	faf7c1c325	⬆️ Update ggerganov/llama.cpp (#1573 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-11 08:41:32 +01:00
LocalAI [bot]	58288494d6	⬆️ Update ggerganov/llama.cpp (#1568 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-10 10:18:57 +01:00
Dionysius	72283dc744	minor: replace shell pwd in Makefile with CURDIR for better windows compatibility (#1571 ) replace shell pwd in Makefile with CURDIR	2024-01-10 08:39:50 +00:00
LocalAI [bot]	2e890b3838	⬆️ Update ggerganov/llama.cpp (#1563 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-09 08:48:40 +01:00
LocalAI [bot]	574fa67bdc	⬆️ Update ggerganov/llama.cpp (#1558 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-08 00:38:03 +01:00
LocalAI [bot]	0a06c80801	⬆️ Update ggerganov/llama.cpp (#1547 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-05 23:27:51 +01:00
LocalAI [bot]	d48faf35ab	⬆️ Update ggerganov/llama.cpp (#1544 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-04 00:08:03 +01:00
LocalAI [bot]	7e1d8c489b	⬆️ Update ggerganov/llama.cpp (#1533 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-03 08:43:35 +01:00
LocalAI [bot]	de28867374	⬆️ Update ggerganov/llama.cpp (#1531 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-02 00:28:22 +00:00
Ettore Di Giacinto	fd48cb6506	deps(llama.cpp): update and sync grpc server (#1527 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-01-01 14:39:31 +01:00
LocalAI [bot]	27686ff20b	⬆️ Update ggerganov/llama.cpp (#1518 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-31 00:19:08 +00:00
LocalAI [bot]	5b0dc20e4c	⬆️ Update ggerganov/llama.cpp (#1509 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-30 09:19:07 +00:00
LocalAI [bot]	6428003c3b	⬆️ Update ggerganov/llama.cpp (#1503 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-28 22:44:50 +01:00
LocalAI [bot]	2eac4f93bb	⬆️ Update ggerganov/llama.cpp (#1501 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-28 00:51:29 +00:00
LocalAI [bot]	c45f581c47	⬆️ Update ggerganov/llama.cpp (#1496 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-26 19:15:58 -05:00
LocalAI [bot]	4ca649154d	⬆️ Update ggerganov/llama.cpp (#1495 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-26 17:53:59 +00:00
LocalAI [bot]	9789f5a96a	⬆️ Update ggerganov/llama.cpp (#1492 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-25 02:43:35 -05:00
Gianluca Boiano	cae7b197ec	feat: add tiny dream stable diffusion support (#1283 ) Signed-off-by: Gianluca Boiano <morf3089@gmail.com>	2023-12-24 19:27:24 +00:00
Ettore Di Giacinto	95eb72bfd3	feat: add 🐸 coqui (#1489 ) * feat: add coqui * docs: update news	2023-12-24 19:38:54 +01:00
LocalAI [bot]	eaa899df63	⬆️ Update ggerganov/whisper.cpp (#1483 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-24 02:53:29 -05:00
LocalAI [bot]	16ed0bd0c5	⬆️ Update ggerganov/llama.cpp (#1482 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-24 02:53:12 -05:00
LocalAI [bot]	51215d480a	⬆️ Update ggerganov/whisper.cpp (#1480 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-23 09:11:40 +00:00
LocalAI [bot]	987f0041d3	⬆️ Update ggerganov/llama.cpp (#1469 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-23 00:05:56 +00:00
LocalAI [bot]	a29de9bf50	⬆️ Update donomii/go-rwkv.cpp (#1478 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-22 15:02:32 +01:00
LocalAI [bot]	9bd5831fda	⬆️ Update ggerganov/whisper.cpp (#1479 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-22 08:26:39 +01:00
Ettore Di Giacinto	9ae47d37e9	pin go-rwkv Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-12-21 08:42:40 +01:00
Ettore Di Giacinto	2b3ad7f41c	Revert "⬆️ Update donomii/go-rwkv.cpp" (#1474 ) Revert "⬆️ Update donomii/go-rwkv.cpp (#1470)" This reverts commit `51db10b18f`.	2023-12-21 08:38:50 +01:00
LocalAI [bot]	51db10b18f	⬆️ Update donomii/go-rwkv.cpp (#1470 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-21 08:35:31 +01:00
LocalAI [bot]	23eced1644	⬆️ Update ggerganov/llama.cpp (#1461 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-20 18:02:52 +01:00
LocalAI [bot]	7741a6e75d	⬆️ Update ggerganov/whisper.cpp (#1462 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-20 00:21:49 +00:00
LocalAI [bot]	d4210db0c9	⬆️ Update ggerganov/llama.cpp (#1457 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-19 00:42:19 +01:00
LocalAI [bot]	64a8471dd5	⬆️ Update ggerganov/llama.cpp (#1455 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-18 08:55:29 +01:00
LocalAI [bot]	86a8df1c8b	⬆️ Update ggerganov/llama.cpp (#1450 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-17 19:02:28 +01:00
LocalAI [bot]	2f7beb6744	⬆️ Update ggerganov/whisper.cpp (#1434 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-16 09:22:28 +01:00
LocalAI [bot]	ab0370a0b9	⬆️ Update ggerganov/llama.cpp (#1429 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-16 09:22:13 +01:00
LocalAI [bot]	3f9a41684a	⬆️ Update mudler/go-piper (#1441 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-16 09:21:56 +01:00
Ettore Di Giacinto	fb6a5bc620	update(llama.cpp): update server, correctly propagate LLAMA_VERSION (#1440 ) * fix(Makefile): correctly propagate LLAMA_VERSION Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * update grpc-server.cpp --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-12-15 08:26:48 +01:00
Ettore Di Giacinto	7641f92cde	feat(diffusers): update, add autopipeline, controlnet (#1432 ) * feat(diffusers): update, add autopipeline, controlenet * tests with AutoPipeline * simplify logic	2023-12-13 19:20:22 +01:00
LocalAI [bot]	72325fd0a3	⬆️ Update ggerganov/whisper.cpp (#1430 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-13 08:37:02 +01:00
LocalAI [bot]	86fac272d8	⬆️ Update ggerganov/llama.cpp (#1391 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-12 18:22:48 +01:00
LocalAI [bot]	4a965e1b0e	⬆️ Update ggerganov/whisper.cpp (#1418 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-11 08:24:48 +01:00
Ettore Di Giacinto	48e5380e45	tests: add diffusers tests (#1419 )	2023-12-11 08:20:34 +01:00
LocalAI [bot]	831418612b	⬆️ Update mudler/go-piper (#1400 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-10 08:50:26 +01:00
LocalAI [bot]	89ff12309d	⬆️ Update ggerganov/whisper.cpp (#1390 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-09 09:23:40 +01:00
Ettore Di Giacinto	887b3dff04	feat: cuda transformers (#1401 ) * Use cuda in transformers if available tensorflow probably needs a different check. Signed-off-by: Erich Schubert <kno10@users.noreply.github.com> * feat: expose CUDA at top level Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * tests: add to tests and create workflow for py extra backends * doc: update note on how to use core images --------- Signed-off-by: Erich Schubert <kno10@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Erich Schubert <kno10@users.noreply.github.com>	2023-12-08 15:45:04 +01:00
Dave	8b6e601405	Feat: new backend: transformers-musicgen (#1387 ) Transformers-MusicGen --------- Signed-off-by: Dave <dave@gray101.com>	2023-12-08 10:01:02 +01:00
Ettore Di Giacinto	6011911746	fix(piper): pin petals, phonemize and espeak (#1393 ) * fix: pin phonemize and espeak Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: pin petals deps --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-12-07 22:58:41 +01:00
LocalAI [bot]	997119c27a	⬆️ Update ggerganov/llama.cpp (#1385 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-05 15:44:24 +01:00
Ettore Di Giacinto	2b2d6673ff	exllama(v2): fix exllamav1, add exllamav2 (#1384 ) * fix(exllama): fix exllama deps with anaconda Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(exllamav2): add exllamav2 backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-12-05 08:15:37 +01:00
LocalAI [bot]	67966b623c	⬆️ Update ggerganov/llama.cpp (#1379 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-04 18:36:34 +01:00
LocalAI [bot]	9fc3fd04be	⬆️ Update ggerganov/whisper.cpp (#1378 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-04 18:36:22 +01:00
LocalAI [bot]	3d71bc9b64	⬆️ Update ggerganov/whisper.cpp (#1227 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-03 01:16:07 +01:00
Felix Erkinger	3923024d84	update whisper_cpp with CUBLAS, HIPBLAS, METAL, OPENBLAS, CLBLAST support (#1302 ) update whisper_cpp to 1.5.1 with OPENBLAS, METAL, HIPBLAS, CUBLAS, CLBLAST support	2023-12-02 10:10:18 +00:00
LocalAI [bot]	42a80d1b8b	⬆️ Update ggerganov/llama.cpp (#1375 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-12-02 00:09:48 +00:00
Dave	e94a34be8c	fix: OSX Build Fix Part 1: Metal (#1365 ) * Make Metal the default on OSX, simplify osx-specific code, and fix the file copy error. * fix endif / comment	2023-11-30 19:50:50 +01:00
LocalAI [bot]	9f708ff318	⬆️ Update ggerganov/llama.cpp (#1363 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-30 00:06:28 +01:00
LocalAI [bot]	519285bf38	⬆️ Update ggerganov/llama.cpp (#1351 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-29 08:29:03 +01:00
Gianluca Boiano	687730a7f5	fix: go-piper add libucd at linking time (#1357 ) Signed-off-by: Gianluca Boiano <morf3089@gmail.com>	2023-11-28 19:55:09 +00:00
Ettore Di Giacinto	b7821361c3	feat(petals): add backend (#1350 ) * feat(petals): add backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-28 09:01:46 +01:00
LocalAI [bot]	63e1f8fffd	⬆️ Update ggerganov/llama.cpp (#1345 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-27 09:02:19 +01:00
LocalAI [bot]	9482acfdfc	⬆️ Update ggerganov/llama.cpp (#1340 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-26 09:27:42 +01:00
Ettore Di Giacinto	6f34e8f044	fix: propagate CMAKE_ARGS when building grpc (#1334 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-25 13:53:51 +01:00
Ettore Di Giacinto	6d187af643	fix: handle grpc and llama-cpp with REBUILD=true (#1328 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-25 08:48:24 +01:00
LocalAI [bot]	97e9598c79	⬆️ Update ggerganov/llama.cpp (#1330 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-24 23:45:05 +01:00
LocalAI [bot]	b1a20effde	⬆️ Update ggerganov/llama.cpp (#1323 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-24 08:32:36 +01:00
Dave	69f53211a1	Feat: OSX Local Codesigning (#1319 ) * stage makefile * OSX local code signing and entitlements file to fix incoming connections prompt	2023-11-23 15:22:54 +01:00
LocalAI [bot]	763f94ca80	⬆️ Update ggerganov/llama.cpp (#1313 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-22 08:37:11 +01:00
LocalAI [bot]	480b14c8dc	⬆️ Update ggerganov/llama.cpp (#1310 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-21 00:20:37 +01:00
Ettore Di Giacinto	92cbc4d516	feat(transformers): add embeddings with Automodel (#1308 ) * Update huggingface.py Switch SentenceTransformer for AutoModel in order to set trust_remote_code needed to use the encode method with embeddings models like jinai-v2 Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu> * feat(transformers): split in separate backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Lucas Hänke de Cansino <lhc@next-boss.eu>	2023-11-20 21:21:17 +01:00
LocalAI [bot]	ff9afdb0fe	⬆️ Update ggerganov/llama.cpp (#1306 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-20 08:16:00 +01:00
LocalAI [bot]	3e35b20a02	⬆️ Update mudler/go-piper (#1305 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-19 09:01:40 +01:00
LocalAI [bot]	9ea371d6cd	⬆️ Update ggerganov/llama.cpp (#1304 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-19 08:49:05 +01:00
LocalAI [bot]	b5af87fc6c	⬆️ Update ggerganov/llama.cpp (#1300 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-18 08:19:10 +01:00
Ettore Di Giacinto	3c9544b023	refactor: rename llama-stable to llama-ggml (#1287 ) * refactor: rename llama-stable to llama-ggml * Makefile: get sources in sources/ Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup path Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup sources Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups sd Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * update SD * fixup * fixup: create piper libdir also when not built Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix make target on linux test Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-18 08:18:43 +01:00
LocalAI [bot]	8c5436cbed	⬆️ Update ggerganov/llama.cpp (#1297 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-17 08:45:22 +01:00
LocalAI [bot]	2addb9f99a	⬆️ Update ggerganov/llama.cpp (#1291 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-16 08:20:26 +01:00
LocalAI [bot]	733b612eb2	⬆️ Update ggerganov/llama.cpp (#1288 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-15 18:41:09 +01:00
LocalAI [bot]	991ecce004	⬆️ Update ggerganov/llama.cpp (#1285 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-14 18:23:09 +01:00
Ettore Di Giacinto	ad0e30bca5	refactor: move backends into the backends directory (#1279 ) * refactor: move backends into the backends directory Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor: move main close to implementation for every backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-13 22:40:16 +01:00
LocalAI [bot]	55461188a4	⬆️ Update ggerganov/llama.cpp (#1282 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-13 00:48:26 +00:00
LocalAI [bot]	5d2405fdef	⬆️ Update ggerganov/llama.cpp (#1280 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-11 23:26:54 +00:00
LocalAI [bot]	e9f1268225	⬆️ Update ggerganov/llama.cpp (#1272 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-11 20:00:28 +00:00
Gianluca Boiano	bde87d00b9	deps(go-piper): update to 2023.11.6-3 (#1257 ) Signed-off-by: Gianluca Boiano <morf3089@gmail.com>	2023-11-11 18:40:26 +01:00
LocalAI [bot]	3b4c5d54d8	⬆️ Update ggerganov/llama.cpp (#1265 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-10 08:50:42 +01:00
LocalAI [bot]	4e16bc2f13	⬆️ Update ggerganov/llama.cpp (#1256 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-08 08:21:12 +01:00
LocalAI [bot]	562ac62f59	⬆️ Update ggerganov/llama.cpp (#1242 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-07 08:37:55 +01:00
Diego	e7fa2e06f8	Fixes the bug 1196 (#1232 ) * Current state of the branch. * Now gRPC is build only when the BUILD_GRPC_FOR_BACKEND_LLAMA variable is defined. * Now the local compilation of gRPC is executed on BUILD_GRPC_FOR_BACKEND_LLAMA. * Revised the Makefile. * Removed replace directives in go.mod. --------- Signed-off-by: Diego <38375572+diego-minguzzi@users.noreply.github.com> Co-authored-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-11-06 19:07:46 +01:00
Ettore Di Giacinto	622aaa9f7d	dockerfile: avoid pushing a big layer Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-05 10:31:33 +01:00
Ettore Di Giacinto	7b1ee203ce	tests: re-add flake-attempts Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-05 09:01:03 +01:00
Ettore Di Giacinto	f347e51927	feat(conda): conda environments (#1144 ) * feat(autogptq): add a separate conda environment for autogptq (#1137) Description This PR related to #1117 Notes for Reviewers Here we lock down the version of the dependencies. Make sure it can be used all the time without failed if the version of dependencies were upgraded. I change the order of importing packages according to the pylint, and no change the logic of code. It should be ok. I will do more investigate on writing some test cases for every backend. I can run the service in my environment, but there is not exist a way to test it. So, I am not confident on it. Add a README.md in the `grpc` root. This is the common commands for creating `conda` environment. And it can be used to the reference file for creating extral gRPC backend document. Signed-off-by: GitHub <noreply@github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * [Extra backend] Add seperate environment for ttsbark (#1141) Description This PR relates to #1117 Notes for Reviewers Same to the latest PR: * The code is also changed, but only the order of the import package parts. And some code comments are also added. * Add a configuration of the `conda` environment * Add a simple test case for testing if the service can be startup in current `conda` environment. It is succeed in VSCode, but the it is not out of box on terminal. So, it is hard to say the test case really useful. [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [x] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions ------------------------- The draft above helps to give a quick overview of your PR. Remember to remove this comment and to at least: 1. Include descriptive PR titles with [<component-name>] prepended. We use [conventional commits](https://www.conventionalcommits.org/en/v1.0.0/). 2. Build and test your changes before submitting a PR (`make build`). 3. Sign your commits 4. Tag maintainer: for a quicker response, tag the relevant maintainer (see below). 5. X/Twitter handle: we announce bigger features on X/Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. If no one reviews your PR within a few days, please @-mention @mudler. --> Signed-off-by: GitHub <noreply@github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(conda): add make target and entrypoints for the dockerfile Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(conda): Add seperate conda env for diffusers (#1145) Description This PR relates to #1117 Notes for Reviewers * Add `conda` env `diffusers.yml` * Add Makefile to create it automatically * Add `run.sh` to support running as a extra backend * Also adding it to the main Dockerfile * Add make command in the root Makefile * Testing the server, it can start up under the env Signed-off-by: GitHub <noreply@github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(conda):Add seperate env for vllm (#1148) Description This PR is related to #1117 Notes for Reviewers * The gRPC server can be started as normal * The test case can be triggered in VSCode * Same to other this kind of PRs, add `vllm.yml` Makefile and add `run.sh` to the main Dockerfile, and command to the main Makefile [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [x] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions ------------------------- The draft above helps to give a quick overview of your PR. Remember to remove this comment and to at least: 1. Include descriptive PR titles with [<component-name>] prepended. We use [conventional commits](https://www.conventionalcommits.org/en/v1.0.0/). 2. Build and test your changes before submitting a PR (`make build`). 3. Sign your commits 4. Tag maintainer: for a quicker response, tag the relevant maintainer (see below). 5. X/Twitter handle: we announce bigger features on X/Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. If no one reviews your PR within a few days, please @-mention @mudler. --> Signed-off-by: GitHub <noreply@github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(conda):Add seperate env for huggingface (#1146) Description This PR is related to #1117 Notes for Reviewers * Add conda env `huggingface.yml` * Change the import order, and also remove the no-used packages * Add `run.sh` and `make command` to the main Dockerfile and Makefile * Add test cases for it. It can be triggered and succeed under VSCode Python extension but it is hang by using `python -m unites test_huggingface.py` in the terminal ``` Running tests (unittest): /workspaces/LocalAI/extra/grpc/huggingface Running tests: /workspaces/LocalAI/extra/grpc/huggingface/test_huggingface.py::TestBackendServicer::test_embedding /workspaces/LocalAI/extra/grpc/huggingface/test_huggingface.py::TestBackendServicer::test_load_model /workspaces/LocalAI/extra/grpc/huggingface/test_huggingface.py::TestBackendServicer::test_server_startup ./test_huggingface.py::TestBackendServicer::test_embedding Passed ./test_huggingface.py::TestBackendServicer::test_load_model Passed ./test_huggingface.py::TestBackendServicer::test_server_startup Passed Total number of tests expected to run: 3 Total number of tests run: 3 Total number of tests passed: 3 Total number of tests failed: 0 Total number of tests failed with errors: 0 Total number of tests skipped: 0 Finished running tests! ``` [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [x] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions ------------------------- The draft above helps to give a quick overview of your PR. Remember to remove this comment and to at least: 1. Include descriptive PR titles with [<component-name>] prepended. We use [conventional commits](https://www.conventionalcommits.org/en/v1.0.0/). 2. Build and test your changes before submitting a PR (`make build`). 3. Sign your commits 4. Tag maintainer: for a quicker response, tag the relevant maintainer (see below). 5. X/Twitter handle: we announce bigger features on X/Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. If no one reviews your PR within a few days, please @-mention @mudler. --> Signed-off-by: GitHub <noreply@github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(conda): Add the seperate conda env for VALL-E X (#1147) Description This PR is related to #1117 Notes for Reviewers * The gRPC server cannot start up ``` (ttsvalle) @Aisuko ➜ /workspaces/LocalAI (feat/vall-e-x) $ /opt/conda/envs/ttsvalle/bin/python /workspaces/LocalAI/extra/grpc/vall-e-x/ttsvalle.py Traceback (most recent call last): File "/workspaces/LocalAI/extra/grpc/vall-e-x/ttsvalle.py", line 14, in <module> from utils.generation import SAMPLE_RATE, generate_audio, preload_models ModuleNotFoundError: No module named 'utils' ``` The installation steps follow https://github.com/Plachtaa/VALL-E-X#-installation below: * Under the `ttsvalle` conda env ``` git clone https://github.com/Plachtaa/VALL-E-X.git cd VALL-E-X pip install -r requirements.txt ``` [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [x] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions ------------------------- The draft above helps to give a quick overview of your PR. Remember to remove this comment and to at least: 1. Include descriptive PR titles with [<component-name>] prepended. We use [conventional commits](https://www.conventionalcommits.org/en/v1.0.0/). 2. Build and test your changes before submitting a PR (`make build`). 3. Sign your commits 4. Tag maintainer: for a quicker response, tag the relevant maintainer (see below). 5. X/Twitter handle: we announce bigger features on X/Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. If no one reviews your PR within a few days, please @-mention @mudler. --> Signed-off-by: GitHub <noreply@github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: set image type Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(conda):Add seperate conda env for exllama (#1149) Add seperate env for exllama Signed-off-by: Aisuko <urakiny@gmail.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Setup conda Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Set image_type arg Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: prepare only conda env in tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Dockerfile: comment manual pip calls Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * conda: add conda to PATH Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixes * add shebang * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * file perms Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * debug * Install new conda in the worker * Disable GPU tests for now until the worker is back * Rename workflows * debug * Fixup conda install * fixup(wrapper): pass args Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: GitHub <noreply@github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Aisuko <urakiny@gmail.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: Aisuko <urakiny@gmail.com>	2023-11-04 15:30:32 +01:00
LocalAI [bot]	9b17af18b3	⬆️ Update ggerganov/llama.cpp (#1236 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2023-11-03 19:23:53 +01:00

... 2 3 4 5 6 ...

599 Commits