LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2025-06-12 03:58:19 +00:00

Author	SHA1	Message	Date
Ludovic Leroux	939411300a	Bump vLLM version + more options when loading models in vLLM (#1782 ) * Bump vLLM version to 0.3.2 * Add vLLM model loading options * Remove transformers-exllama * Fix install exllama	2024-03-01 22:48:53 +01:00
Dave	1c312685aa	refactor: move remaining api packages to core (#1731 ) * core 1 * api/openai/files fix * core 2 - core/config * move over core api.go and tests to the start of core/http * move over localai specific endpoints to core/http, begin the service/endpoint split there * refactor big chunk on the plane * refactor chunk 2 on plane, next step: port and modify changes to request.go * easy fixes for request.go, major changes not done yet * lintfix * json tag lintfix? * gitignore and .keep files * strange fix attempt: rename the config dir?	2024-03-01 16:19:53 +01:00
LocalAI [bot]	316de82f51	⬆️ Update ggerganov/llama.cpp (#1779 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-29 22:33:30 +00:00
Ettore Di Giacinto	9068bc5271	Create SECURITY.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-29 19:53:04 +01:00
Oussama	31a4c9c9d3	Fix Command Injection Vulnerability (#1778 ) * Added fix for command injection * changed function name from sh to runCommand	2024-02-29 18:32:29 +00:00
Ettore Di Giacinto	c1966af2cf	ci: reduce stress on self-hosted runners (#1776 ) Split jobs by self-hosted and free public runner provided by Github Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-29 11:40:08 +01:00
LocalAI [bot]	c665898652	⬆️ Update donomii/go-rwkv.cpp (#1771 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-28 23:50:27 +00:00
LocalAI [bot]	f651a660aa	⬆️ Update ggerganov/llama.cpp (#1772 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-28 23:02:30 +01:00
Ettore Di Giacinto	ba672b51da	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-28 16:03:38 +01:00
Ettore Di Giacinto	be498c5dd9	Update openai-functions.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-28 15:58:31 +01:00
Ettore Di Giacinto	6e95beccb9	Update overview.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-28 15:24:08 +01:00
Ettore Di Giacinto	c8be839481	Update openai-functions.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-27 23:24:46 +01:00
LocalAI [bot]	c7e08813a5	⬆️ Update ggerganov/llama.cpp (#1767 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-27 23:12:51 +01:00
LocalAI [bot]	d21a6b33ab	⬆️ Update ggerganov/llama.cpp (#1756 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-27 18:07:51 +00:00
Joshua Waring	9112cf153e	Update integrations.md (#1765 ) Added Jetbrains compatible plugin for LocalAI Signed-off-by: Joshua Waring <Joshhua5@users.noreply.github.com>	2024-02-27 17:35:59 +01:00
Ettore Di Giacinto	3868ac8402	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-27 15:44:15 +01:00
Ettore Di Giacinto	3f09010227	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-27 15:43:15 +01:00
Ettore Di Giacinto	d6cf82aba3	fix(tests): re-enable tests after code move (#1764 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-27 15:04:19 +01:00
Ettore Di Giacinto	dfe54639b1	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-27 10:37:56 +01:00
Ettore Di Giacinto	bc5f5aa538	deps(llama.cpp): update (#1759 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-26 13:18:44 +01:00
Ettore Di Giacinto	05818e0425	fix(functions): handle correctly when there are no results (#1758 )	2024-02-26 08:38:23 +01:00
Sertaç Özercan	7f72a61104	ci: add stablediffusion to release (#1757 ) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>	2024-02-25 23:06:18 +00:00
LocalAI [bot]	8e45d47740	⬆️ Update ggerganov/llama.cpp (#1753 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-25 10:03:19 +01:00
LocalAI [bot]	71771d1e9b	⬆️ Update docs version mudler/LocalAI (#1752 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-25 10:02:52 +01:00
Ettore Di Giacinto	aa098e4d0b	fix(sse): do not omit empty finish_reason (#1745 ) Fixes https://github.com/mudler/LocalAI/issues/1744	2024-02-24 11:51:59 +01:00
Ludovic Leroux	0135e1e3b9	fix: vllm - use AsyncLLMEngine to allow true streaming mode (#1749 ) * fix: use vllm AsyncLLMEngine to bring true stream Current vLLM implementation uses the LLMEngine, which was designed for offline batch inference, which results in the streaming mode outputing all blobs at once at the end of the inference. This PR reworks the gRPC server to use asyncio and gRPC.aio, in combination with vLLM's AsyncLLMEngine to bring true stream mode. This PR also passes more parameters to vLLM during inference (presence_penalty, frequency_penalty, stop, ignore_eos, seed, ...). * Remove unused import	2024-02-24 11:48:45 +01:00
LocalAI [bot]	ff88c390bb	⬆️ Update ggerganov/llama.cpp (#1750 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com> v2.9.0	2024-02-24 00:06:46 +01:00
LocalAI [bot]	d825821a22	⬆️ Update ggerganov/llama.cpp (#1740 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-23 00:07:15 +01:00
Luna Midori	cbed6ab1bb	Update README.md (#1739 ) * Update README.md Signed-off-by: Luna Midori <118759930+lunamidori5@users.noreply.github.com> * Update README.md Signed-off-by: Luna Midori <118759930+lunamidori5@users.noreply.github.com> --------- Signed-off-by: Luna Midori <118759930+lunamidori5@users.noreply.github.com>	2024-02-22 16:35:06 +01:00
LocalAI [bot]	6fc122fa1a	⬆️ Update ggerganov/llama.cpp (#1705 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-22 09:33:23 +00:00
Ettore Di Giacinto	feba38be36	examples(mistral-openorca): add stopword Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-22 00:15:08 +01:00
Ettore Di Giacinto	ba85d0bcad	feat(upload-api): do not display error if uploadedFiles.json is not present Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-22 00:15:08 +01:00
Ettore Di Giacinto	ad3623dd8d	examples(phi-2): strip newline at the end of the prompt template Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-21 23:17:51 +01:00
Ettore Di Giacinto	8292781045	deps(llama.cpp): update, support Gemma models (#1734 ) deps(llama.cpp): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-21 17:23:38 +01:00
Ettore Di Giacinto	54ec6348fa	deps(llama.cpp): update (#1714 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-21 11:35:44 +01:00
Dave	255748bcba	MQTT Startup Refactoring Part 1: core/ packages part 1 (#1728 ) This PR specifically introduces a `core` folder and moves the following packages over, without any other changes: - `api/backend` - `api/config` - `api/options` - `api/schema` Once this is merged and we confirm there's no regressions, I can migrate over the remaining changes piece by piece to split up application startup, backend services, http, and mqtt as was the goal of the earlier PRs!	2024-02-21 01:21:19 +00:00
Chakib Benziane	594eb468df	Add TTS dependency for cuda based builds fixes #1727 (#1730 ) Signed-off-by: Chakib Benziane <contact@blob42.xyz>	2024-02-20 21:59:43 +01:00
Ettore Di Giacinto	960d314e4f	feat(tools): Parallel function calling (#1726 ) feat(tools): support returning multiple tools choices Fixes: https://github.com/mudler/LocalAI/issues/1275	2024-02-20 21:58:45 +01:00
Ettore Di Giacinto	ed3b50622b	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-20 19:55:36 +01:00
Ettore Di Giacinto	9f2235c208	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-19 19:49:00 +01:00
Ettore Di Giacinto	4ec50bfc41	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-19 19:03:09 +01:00
Ettore Di Giacinto	51b67a247a	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-18 13:37:16 +01:00
Steven Christou	01205fd4c0	Initial implementation of upload files api. (#1703 ) * Initial implementation of upload files api. * Move sanitize method to utils. * Save uploaded data to uploads folder. * Avoid loop if we do not have a purpose. * Minor cleanup of api and fix bug where deleting duplicate filename cause error. * Revert defer of saving config * Moved creation of directory to startup. * Make file names unique when storing on disk. * Add test for files api. * Update dependencies.	2024-02-18 10:12:02 +00:00
Ettore Di Giacinto	c72808f18b	feat(tools): support Tool calls in the API (#1715 ) * feat(tools): support Tools in the API Co-authored-by: =?UTF-8?q?Stephan=20A=C3=9Fmus?= <stephan.assmus@sap.com> * feat(tools): support function streaming * Adhere to new return types when using tools instead of functions * Keep backward compatibility with function calling * Evaluate function names in chat templates * Disable recovery with --debug * Correctly stream out the entire result * Detect when llm chooses to reply and to not perform any action in SSE * Feedback from code review --------- Co-authored-by: =?UTF-8?q?Stephan=20A=C3=9Fmus?= <stephan.assmus@sap.com>	2024-02-17 10:00:34 +01:00
Ettore Di Giacinto	6b539a2972	Update README.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-16 15:22:35 +01:00
LocalAI [bot]	2151d21862	⬆️ Update docs version mudler/LocalAI (#1718 ) * ⬆️ Update docs version mudler/LocalAI Signed-off-by: GitHub <noreply@github.com> * Update docs/data/version.json Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: GitHub <noreply@github.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-16 15:11:53 +01:00
fenfir	fb0a4c5d9a	Build docker container for ROCm (#1595 ) * Dockerfile changes to build for ROCm * Adjust linker flags for ROCm * Update conda env for diffusers and transformers to use ROCm pytorch * Update transformers conda env for ROCm * ci: build hipblas images * fixup rebase * use self-hosted Signed-off-by: mudler <mudler@localai.io> * specify LD_LIBRARY_PATH only when BUILD_TYPE=hipblas --------- Signed-off-by: mudler <mudler@localai.io> Co-authored-by: mudler <mudler@localai.io>	2024-02-16 15:08:50 +01:00
Ettore Di Giacinto	e690bf387a	fix(tts): fix regression when supplying backend from requests (#1713 ) fixes #1707 Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> v2.8.2	2024-02-15 17:33:06 +01:00
Ettore Di Giacinto	5e155fb081	fix(python): pin exllama2 (#1711 ) fix(python): pin python deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> v2.8.1	2024-02-14 21:44:12 +01:00
Ettore Di Giacinto	39a6b562cf	fix(llama.cpp): downgrade to a known working version (#1706 ) sycl support is broken otherwise. See upstream issue: https://github.com/ggerganov/llama.cpp/issues/5469 Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-14 10:28:06 +01:00

1 2 3 4 5 ...

1228 Commits