LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2025-06-16 14:08:09 +00:00

Author	SHA1	Message	Date
LocalAI [bot]	1395e505cd	⬆️ Update ggerganov/llama.cpp (#1897 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com> v2.11.0	2024-03-26 00:34:10 +01:00
LocalAI [bot]	42a4c86dca	⬆️ Update ggerganov/whisper.cpp (#1896 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 00:33:46 +01:00
Ettore Di Giacinto	c9adc5680c	fix(aio): make image-gen for GPU functional, update docs (#1895 ) * readme: update quickstart * aio(gpu): fix dreamshaper * tests(aio): allow to run tests also against an endpoint * docs: split content * tests: less verbosity --------- Co-authored-by: Dave <dave@gray101.com>	2024-03-25 21:04:32 +00:00
Enrico Ros	08c7b17298	Fix NVIDIA VRAM detection on WSL2 environments (#1894 ) * NVIDIA VRAM detection on WSL2 environments More robust single NVIDIA GPU memory detection, following the improved NVIDIA WSL2 detection patch yesterday #1891. Tested and working on WSL2, Linux. Signed-off-by: Enrico Ros <enrico.ros@gmail.com> * Update aio/entrypoint.sh Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Enrico Ros <enrico.ros@gmail.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-03-25 18:36:18 +01:00
Enrico Ros	5e12382524	NVIDIA GPU detection support for WSL2 environments (#1891 ) This change makes the assumption that "Microsoft Corporation Device 008e" is an NVIDIA CUDA device. If this is not the case, please update the hardware detection script here. Signed-off-by: Enrico Ros <enrico.ros@gmail.com> Co-authored-by: Dave <dave@gray101.com>	2024-03-25 08:32:40 +01:00
Ettore Di Giacinto	6cf99527f8	docs(aio): Add All-in-One images docs (#1887 ) * docs(aio): Add AIO images docs * add image generation link to quickstart * while reviewing I noticed this one link was missing, so quickly adding it. Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>	2024-03-25 02:01:30 +00:00
LocalAI [bot]	3e293f1465	⬆️ Update ggerganov/llama.cpp (#1889 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-24 21:12:18 +00:00
LocalAI [bot]	0106c58181	⬆️ Update ggerganov/llama.cpp (#1885 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-24 14:54:01 +01:00
Ettore Di Giacinto	bd25d8049c	fix(watchdog): use ShutdownModel instead of StopModel (#1882 ) Fixes #1760	2024-03-23 16:19:57 +01:00
Ettore Di Giacinto	49cec7fd61	ci(aio): add latest tag images (#1884 ) Tangentially also fixes #1868	2024-03-23 16:08:32 +01:00
Ettore Di Giacinto	d9456f2a23	ci(aio): publish hipblas and Intel GPU images (#1883 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-03-23 15:54:14 +01:00
Ettore Di Giacinto	8495750cb8	Update release.yml Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-03-23 15:22:26 +01:00
Ettore Di Giacinto	1f501cc1ef	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-03-23 10:42:14 +01:00
LocalAI [bot]	a922119c41	⬆️ Update ggerganov/llama.cpp (#1881 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-23 09:23:28 +01:00
Richard Palethorpe	643d85d2cc	feat(stores): Vector store backend (#1795 ) Add simple vector store backend Signed-off-by: Richard Palethorpe <io@richiejp.com>	2024-03-22 21:14:04 +01:00
Ettore Di Giacinto	4b1ee0c170	feat(aio): add tests, update model definitions (#1880 )	2024-03-22 21:13:11 +01:00
Ettore Di Giacinto	3bec467a91	feat(models): add phi-2-chat, llava-1.6, bakllava, cerbero (#1879 )	2024-03-22 21:12:48 +01:00
Ettore Di Giacinto	600152df23	fix(config): pass by config options, respect defaults (#1878 ) This bug had the unpleasant effect that it ignored defaults passed by the CLI. For instance threads could be changed only via model config file.	2024-03-22 20:55:11 +01:00
LocalAI [bot]	dd84c29a3d	⬆️ Update ggerganov/whisper.cpp (#1875 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-22 09:14:56 +01:00
LocalAI [bot]	07468c8786	⬆️ Update ggerganov/llama.cpp (#1874 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-22 09:14:42 +01:00
Ettore Di Giacinto	418ba02025	ci: fix typo Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-03-22 09:14:17 +01:00
Ettore Di Giacinto	abc9360dc6	feat(aio): entrypoint, update workflows (#1872 )	2024-03-21 22:09:04 +01:00
Sebastian	743095b7d8	docs(mac): improve documentation for mac build (#1873 ) * docs(mac): Improve documentation for mac build - added documentation to build from current master - added troubleshooting information Signed-off-by: Sebastian <tauven@gmail.com> * docs(max): fix typo Signed-off-by: Sebastian <tauven@gmail.com> --------- Signed-off-by: Sebastian <tauven@gmail.com>	2024-03-21 22:08:33 +01:00
Ettore Di Giacinto	3cf64d1e7e	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-03-21 08:57:41 +01:00
Ettore Di Giacinto	e533dcf506	feat(functions/aio): all-in-one images, function template enhancements (#1862 ) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text	2024-03-21 01:12:20 +01:00
LocalAI [bot]	eeaf8c7ccd	⬆️ Update ggerganov/whisper.cpp (#1867 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 22:26:29 +00:00
LocalAI [bot]	7e34dfdae7	⬆️ Update ggerganov/llama.cpp (#1866 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 22:13:29 +00:00
LocalAI [bot]	e4bf51d5bd	⬆️ Update ggerganov/llama.cpp (#1864 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 09:05:53 +01:00
LocalAI [bot]	ead61bf9d5	⬆️ Update ggerganov/llama.cpp (#1857 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:03:17 +00:00
LocalAI [bot]	b12a205320	⬆️ Update docs version mudler/LocalAI (#1856 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:44:45 +01:00
LocalAI [bot]	621541a92f	⬆️ Update ggerganov/whisper.cpp (#1508 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:44:23 +01:00
Dave	ed5734ae25	test/fix: OSX Test Repair (#1843 ) * test with gguf instead of ggml. Updates testPrompt to match? Adds debugging line to Dockerfile that I've found helpful recently. * fix testPrompt slightly * Sad Experiment: Test GH runner without metal? * break apart CGO_LDFLAGS * switch runner * upstream llama.cpp disables Metal on Github CI! * missed a dir from clean-tests * CGO_LDFLAGS * tmate failure + NO_ACCELERATE * whisper.cpp has a metal fix * do the exact opposite of the name of this branch, but keep it around for unrelated fixes? * add back newlines * add tmate to linux for testing * update fixtures * timeout for tmate v2.10.1	2024-03-18 19:19:43 +01:00
Ettore Di Giacinto	a046dcac5e	fix(config-watcher): start only if config-directory exists (#1854 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-03-18 19:14:48 +01:00
Ettore Di Giacinto	843f93e1ab	fix(config): default to debug=false if not set (#1853 )	2024-03-18 18:59:39 +01:00
Ettore Di Giacinto	fa9e330fc6	fix(llama.cpp): fix eos without cache (#1852 )	2024-03-18 18:59:24 +01:00
Ettore Di Giacinto	b202bfaaa0	deps(whisper.cpp): update, fix cublas build (#1846 ) fix(whisper.cpp): Add stubs and -lcuda	2024-03-18 15:56:53 +01:00
LocalAI [bot]	0eb0ac7dd0	⬆️ Update ggerganov/llama.cpp (#1848 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-18 08:57:58 +01:00
LocalAI [bot]	d2b83d8357	⬆️ Update docs version mudler/LocalAI (#1847 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-17 23:08:32 +01:00
Ettore Di Giacinto	88b65f63d0	fix(go-llama): use llama-cpp as default (#1849 ) * fix(go-llama): use llama-cpp as default Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * fix(backends): drop obsoleted lines --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-03-17 23:08:22 +01:00
cryptk	020ce29cd8	fix(make): allow to parallelize jobs (#1845 ) * fix: clean up Makefile dependencies to allow for parallel builds * refactor: remove old unused backend from Makefile * fix: finish removing legacy backend, update piper * fix: I broke llama... I fixed llama * feat: give the tests and builds a few threads * fix: ensure libraries are replaced before build, add dropreplace target * Fix image build workflows	2024-03-17 15:39:20 +01:00
Chakib Benziane	801b481beb	fixes #1051 : handle openai presence and request penalty parameters (#1817 ) * fix request debugging, disable marshalling of context fields Signed-off-by: blob42 <contact@blob42.xyz> * merge frequency_penalty request parm with config Signed-off-by: blob42 <contact@blob42.xyz> * openai: add presence_penalty parameter Signed-off-by: blob42 <contact@blob42.xyz> --------- Signed-off-by: blob42 <contact@blob42.xyz>	2024-03-17 09:43:20 +01:00
LocalAI [bot]	8967ed1601	⬆️ Update ggerganov/llama.cpp (#1840 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com> v2.10.0	2024-03-16 11:25:41 +00:00
LocalAI [bot]	5826fb8e6d	⬆️ Update mudler/go-piper (#1844 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-15 23:51:03 +00:00
Ettore Di Giacinto	89351f1a7d	feat(embeddings): do not require to be configured (#1842 ) Certain engines requires to know during model loading if the embedding feature has to be enabled, however, it is impractical to have to set it to ALL the backends that supports embeddings. There are transformers and sentencentransformers that seamelessly handle both cases, without having this settings to be explicitly enabled. The case sussist only for ggml-based models that needs to enable featuresets during model loading (and thus settings `embedding` is required), however most of the other engines does not require this. This change disables the check done at code side, making easier to use embeddings by not having to specify explicitly `embeddings: true`. Part of: https://github.com/mudler/LocalAI/issues/1373	2024-03-15 18:14:23 +01:00
Ettore Di Giacinto	ae2e4fc2fe	docs(transformers): add docs section about transformers (#1841 )	2024-03-15 18:13:30 +01:00
Dave	db199f61da	fix: osx build default.metallib (#1837 ) fix: osx build default.metallib (#1837) * port osx fix from refactor pr to slim pr * manually bump llama.cpp version to unstick CI?	2024-03-15 08:18:58 +00:00
LocalAI [bot]	44adbd2c75	⬆️ Update go-skynet/go-llama.cpp (#1835 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-14 23:06:42 +00:00
Ettore Di Giacinto	20136ca8b7	feat(tts): add Elevenlabs and OpenAI TTS compatibility layer (#1834 ) * feat(elevenlabs): map elevenlabs API support to TTS This allows elevenlabs Clients to work automatically with LocalAI by supporting the elevenlabs API. The elevenlabs server endpoint is implemented such as it is wired to the TTS endpoints. Fixes: https://github.com/mudler/LocalAI/issues/1809 * feat(openai/tts): compat layer with openai tts Fixes: #1276 * fix: adapt tts CLI	2024-03-14 23:08:34 +01:00
Dave	45d520f913	fix: OSX Build Files for llama.cpp (#1836 ) bot ate my changes, seperate branch	2024-03-14 23:07:47 +01:00
fakezeta	3882130911	feat: Add Bitsandbytes quantization for transformer backend enhancement #1775 and fix: Transformer backend error on CUDA #1774 (#1823 ) * fixes #1775 and #1774 Add BitsAndBytes Quantization and fixes embedding on CUDA devices * Manage 4bit and 8 bit quantization Manage different BitsAndBytes options with the quantization: parameter in yaml * fix compilation errors on non CUDA environment	2024-03-14 23:06:30 +01:00

... 11 12 13 14 15 ...

1905 Commits