LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2025-01-13 08:19:57 +00:00

Author	SHA1	Message	Date
Ettore Di Giacinto	0893d3cbbe	fix(health): do not require auth for /healthz and /readyz (#3656 ) * fix(health): do not require auth for /healthz and /readyz Fixes: #3655 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Comment so I don’t forget Adding a reminder here... --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Dave <dave@gray101.com>	2024-09-24 18:25:59 +00:00
Dave	90cacb9692	test: preliminary tests and merge fix for authv2 (#3584 ) * add api key to existing app tests, add preliminary auth test Signed-off-by: Dave Lee <dave@gray101.com> * small fix, run test Signed-off-by: Dave Lee <dave@gray101.com> * status on non-opaque Signed-off-by: Dave Lee <dave@gray101.com> * tweak auth error Signed-off-by: Dave Lee <dave@gray101.com> * exp Signed-off-by: Dave Lee <dave@gray101.com> * quick fix on real laptop Signed-off-by: Dave Lee <dave@gray101.com> * add downloader version that allows providing an auth header Signed-off-by: Dave Lee <dave@gray101.com> * stash some devcontainer fixes during testing Signed-off-by: Dave Lee <dave@gray101.com> * s2 Signed-off-by: Dave Lee <dave@gray101.com> * s Signed-off-by: Dave Lee <dave@gray101.com> * done with experiment Signed-off-by: Dave Lee <dave@gray101.com> * done with experiment Signed-off-by: Dave Lee <dave@gray101.com> * after merge fix Signed-off-by: Dave Lee <dave@gray101.com> * rename and fix Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-09-24 09:32:48 +02:00
Ettore Di Giacinto	191bc2e50a	feat(api): allow to pass audios to backends (#3603 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-09-19 12:26:53 +02:00
Ettore Di Giacinto	fbb9facda4	feat(api): allow to pass videos to backends (#3601 ) This prepares the API to receive videos as well for video understanding. It works similarly to images, where the request should be in the form: { "type": "video_url", "video_url": { "url": "url or base64 data" } } Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-09-19 11:21:59 +02:00
Ettore Di Giacinto	a53392f919	chore(refactor): drop duplicated shutdown logics (#3589 ) * chore(refactor): drop duplicated shutdown logics - Handle locking in Shutdown and CheckModelIsLoaded in a more go-idiomatic way - Drop duplicated code and re-organize shutdown code Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: drop leftover Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: improve logging and add missing locks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-09-17 16:51:40 +02:00
Dave	db1159b651	feat: auth v2 - supersedes #2894 (#3476 ) feat: auth v2 - supercedes #2894, metrics to follow later Signed-off-by: Dave Lee <dave@gray101.com>	2024-09-16 23:29:07 -04:00
Ettore Di Giacinto	cf747bcdec	feat: extract output with regexes from LLMs (#3491 ) * feat: extract output with regexes from LLMs This changset adds `extract_regex` to the LLM config. It is a list of regexes that can match output and will be used to re extract text from the LLM output. This is particularly useful for LLMs which outputs final results into tags. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add tests, enhance output in case of configuration error Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-09-13 13:27:36 +02:00
Ettore Di Giacinto	791c3ace72	feat: add endpoint to list system informations (#3449 ) * feat: add endpoint to list system informations For now, it lists the available backends, but can be expanded later on to include more system informations (such as GPU devices detected, RAM, threads configured, and so on so forth). Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * show also external backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add test Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-09-05 20:44:30 +02:00
Dave	81ae92f017	feat: elevenlabs `sound-generation` api (#3355 ) * initial version of elevenlabs compatible soundgeneration api and cli command Signed-off-by: Dave Lee <dave@gray101.com> * minor cleanup Signed-off-by: Dave Lee <dave@gray101.com> * restore TTS, add test Signed-off-by: Dave Lee <dave@gray101.com> * remove stray s Signed-off-by: Dave Lee <dave@gray101.com> * fix Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-08-24 00:20:28 +00:00
Ettore Di Giacinto	fbaae8528d	fix(chat): re-generated uuid, created, and text on each request (#3359 ) This was noticed by models returning content besides function calls. Sadly we can't test that easily in the CI so it got unnoticed. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-22 10:56:05 +02:00
Ettore Di Giacinto	b510352393	chore(anime.js): drop unused (#3351 ) * fix(anime.js): correctly set the static path Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop anime.js (unused) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-21 13:10:09 +02:00
Ettore Di Giacinto	0c84c7b1cc	chore(ux): allow to create and drag dots in the animation (#3287 ) Make the animation more interactive! Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-19 20:40:55 +02:00
Ettore Di Giacinto	73c9b3598d	chore(p2p): make commands easier to copy-paste (#3273 ) chore(p2p): make box easier to copy-paste Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-19 19:58:17 +02:00
Ettore Di Giacinto	13cb7960bd	chore(ux): add animated header with anime.js in p2p sections (#3271 ) feat(p2p): add animated header with anime.js Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-19 18:05:02 +02:00
Ettore Di Giacinto	1dbb3b8abc	fix(gallery): be consistent and disable UI routes as well (#3262 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-18 09:26:29 +02:00
Ettore Di Giacinto	7278bf3de8	chore: allow to disable gallery endpoints, improve p2p connection handling (#3256 ) * Add more debug messages Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: allow to disable gallery endpoints Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * improve p2p messaging Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * improve error handling Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Make sure to close the listening socket when context is exhausted Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-17 08:28:52 +02:00
Ettore Di Giacinto	3457acc48b	chore(explorer): add join instructions (#3255 ) * feat(explorer): give CLI instructions to join federated clusters Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * debug message Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-16 19:34:36 +02:00
Ettore Di Giacinto	c50e0edcb8	feat(gallery): lazy load images (#3246 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-14 12:53:42 +02:00
Ettore Di Giacinto	d6c4e751f2	feat(explorer): visual improvements (#3247 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-14 12:53:29 +02:00
Ettore Di Giacinto	9729d2ae37	feat(explorer): make possible to run sync in a separate process (#3224 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-12 19:25:44 +02:00
Ettore Di Giacinto	9e3e892ac7	feat(p2p): add network explorer and community pools (#3125 ) * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Wire up a simple explorer DB Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * wip Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor: group services id so can be identified easily in the ledger table Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(discovery): discovery service now gather worker informations correctly Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(explorer): display network token Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(explorer): display form to add new networks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(explorer): stop from overwriting networks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(explorer): display only networks with active workers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(explorer): list only clusters in a network if it has online workers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * remove invalid and inactive networks if networks have no workers delete them from the database, similarly, if invalid. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: add workflow to deploy new explorer versions automatically Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * build-api: build with p2p tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to specify a connection timeout Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * logging Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Better p2p defaults Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Set loglevel Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix dht enable Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Default to info for loglevel Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add navbar Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Slightly improve rendering Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to copy the token easily Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-09 20:12:01 +02:00
Ettore Di Giacinto	8814b31805	chore: drop gpt4all.cpp (#3106 ) chore: drop gpt4all gpt4all is already supported in llama.cpp - the backend was kept for keeping compatibility with old gpt4all models (prior to gguf format). It is good time now to clean up and remove it to slim the compilation process. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-07 23:35:55 +02:00
Ettore Di Giacinto	36e185ba63	feat(p2p): allow to run multiple clusters in the same p2p network (#3128 ) feat(p2p): allow to run multiple clusters in the same network Allow to specify a network ID via CLI which allows to run multiple clusters, logically separated within the same network (by using the same shared token). Note: This segregation is not "secure" by any means, anyone having the network token can see the services available in all the network, however, this provides a way to separate the inference endpoints. This allows for instance to have a node which is both federated and having attached a set of llama.cpp workers. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-07 23:35:44 +02:00
Ettore Di Giacinto	e198347886	feat(openai): add `json_schema` format type and strict mode (#3193 ) * feat(openai): add json_schema and strict mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * handle err vs _ security scanners prefer if we put these branches in, and I tend to agree. Signed-off-by: Dave <dave@gray101.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>	2024-08-07 15:27:02 -04:00
Ettore Di Giacinto	a36b721ca6	fix: be consistent in downloading files, check for scanner errors (#3108 ) * fix(downloader): be consistent in downloading files This PR puts some order in the downloader such as functions are re-used across several places. This fixes an issue with having uri's inside the model YAML file, it would resolve to MD5 rather then using the filename Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(scanner): do raise error only if unsafeFiles are found Fixes: https://github.com/mudler/LocalAI/issues/3114 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-02 20:06:25 +02:00
Ettore Di Giacinto	d792cf115b	fix(ui): do not show duplicate entries if not installed by gallery (#3107 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-01 17:27:40 +02:00
Ettore Di Giacinto	d6a7a77f6b	fix(gallery): do clear out errors once displayed (#3033 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-07-28 10:28:10 +02:00
Ettore Di Giacinto	2169c3497d	feat(grammar): add llama3.1 schema (#3015 ) * wip Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * get rid of panics Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose it properly from the config Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * forgot to commit Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Remove focus on test Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-07-26 20:11:29 +02:00
Ettore Di Giacinto	5eda7f578d	refactor: break down json grammar parser in different files (#3004 ) * refactor: break down json grammar parser in different files Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: patch to `refactor_grammars` - propagate errors (#3006) propagate errors around Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Dave Lee <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>	2024-07-25 08:41:00 +02:00
Ettore Di Giacinto	5e5037f10d	feat(p2p): warn the user to start with --p2p (#2993 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-07-23 10:42:51 +02:00
Ettore Di Giacinto	bf9dd1de7f	feat(functions): parse broken JSON when we parse the raw results, use dynamic rules for grammar keys (#2912 ) * feat(functions): enhance parsing with broken JSON when we parse the raw results Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * breaking: make function name by default Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(grammar): dynamically generate grammars with mutating keys Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor: simplify condition Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-07-18 17:52:22 +02:00
Ettore Di Giacinto	607900a4bb	docs: more swagger, update docs (#2907 ) * docs(swagger): finish convering gallery section Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * docs: add section to explain how to install models with local-ai run Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Minor docs adjustments Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-07-18 11:25:21 +02:00
Ettore Di Giacinto	b8b0c7ad0b	docs(swagger): core more localai/openai endpoints (#2904 ) * docs(swagger): core more localai/openai endpoints Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix swagger descriptions for backend_monitor.go Signed-off-by: Dave <dave@gray101.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Dave <dave@gray101.com> Co-authored-by: Dave <dave@gray101.com>	2024-07-18 00:38:41 -04:00
Ettore Di Giacinto	6564e7ea01	docs(swagger): cover p2p endpoints (#2862 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-07-15 08:31:38 +02:00
Ettore Di Giacinto	fbd0a270b3	docs(swagger): comment LocalAI gallery endpoints and rerankers (#2854 ) * docs(swagger): comment LocalAI gallery endpoints and rerankers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update core/http/endpoints/jina/rerank.go Co-authored-by: Dave <dave@gray101.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Update core/http/endpoints/jina/rerank.go Co-authored-by: Dave <dave@gray101.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Update core/http/endpoints/localai/gallery.go Co-authored-by: Dave <dave@gray101.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Update core/http/endpoints/localai/gallery.go Co-authored-by: Dave <dave@gray101.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: Dave <dave@gray101.com>	2024-07-14 05:46:42 +02:00
Ettore Di Giacinto	642f6cee75	feat(webui): show also models without a config in the welcome page (#2772 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-07-11 19:55:01 +02:00
Ettore Di Giacinto	59ef426fbf	feat(model-list): be consistent, skip known files from listing (#2760 ) fix(model-list): be consistent, skip known files from listing This changeset does two things: - Removes the dependency of listing models from the OpenAI schema. - Tries to reduce confusion between ListModels() in model loader and in the service - now there is only one ListModels which is in services and does not depend anymore on the OpenAI schema - The OpenAI-schema functions were moved nearby the OpenAI specific endpoints that needs the schema - Drops the ListModel Service structure as there was no real need for it. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-07-10 15:28:39 +02:00
Ettore Di Giacinto	f120a0c9f9	docs(swagger): enhance coverage of APIs (#2753 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-07-09 23:09:49 +02:00
Ettore Di Giacinto	cca881ec49	feat(p2p): Federation and AI swarms (#2723 ) * Wip p2p enhancements * get online state * Pass-by token to show in the dashboard Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Style * Minor fixups * parametrize SearchID * Refactoring * Allow to expose/bind more services Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add federation * Display federated mode in the WebUI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * make federated nodes visible from the WebUI * Fix version display * improve web page * live page update * visual enhancements * enhancements * visual enhancements --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-07-08 22:04:06 +02:00
Ettore Di Giacinto	6a2a10603c	fix(talk): identify the model by ID instead of name (#2685 ) This fixes a breakage in rendering the template. Now the models passed by to the renderer have the ID field rather then Name Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-29 23:31:50 +02:00
Ettore Di Giacinto	59af0e77af	feat(ui): allow to select between all the available models in the chat (#2657 ) feat(ui): let the chat to select from all the detected models Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-25 20:22:17 +02:00
Ettore Di Giacinto	03b1cf51fd	feat(whisper): add translate option (#2649 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-24 19:21:22 +02:00
Ettore Di Giacinto	a181dd0ebc	refactor: gallery inconsistencies (#2647 ) * refactor(gallery): move under core/ Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(unarchive): do not allow symlinks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-24 17:32:12 +02:00
Dave	12513ebae0	rf: centralize base64 image handling (#2595 ) contains simple fixes to warnings and errors, removes a broken / outdated test, runs go mod tidy, and as the actual change, centralizes base64 image handling Signed-off-by: Dave Lee <dave@gray101.com>	2024-06-24 08:34:36 +02:00
Sertaç Özercan	5866fc8ded	chore: fix go.mod module (#2635 ) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>	2024-06-23 08:24:36 +00:00
Ettore Di Giacinto	f569237a50	feat(oci): support OCI images and Ollama models (#2628 ) * Support specifying oci:// and ollama:// for model URLs Fixes: https://github.com/mudler/LocalAI/issues/2527 Fixes: https://github.com/mudler/LocalAI/issues/1028 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Lower watcher warnings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to install ollama models from CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not keep file ownership Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Skip test on darwin Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-22 08:17:41 +02:00
Ettore Di Giacinto	06de542032	feat(talk): display an informative box, better colors (#2600 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-18 15:10:01 +02:00
Ettore Di Giacinto	e96d2d7667	feat(ui): add page to talk with voice, transcription, and tts (#2520 ) * feat(ui): add page to talk with voice, transcription, and tts Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Enhance graphics and status reporting Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Better UX by blocking unvalid actions Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-09 00:03:26 +02:00
Ettore Di Giacinto	aae7ad9d73	feat(llama.cpp): guess model defaults from file (#2522 ) * wip: guess informations from gguf file Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * update go mod Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Identify llama3 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not try to guess the name, as reading gguf files can be expensive Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to disable guessing Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-08 22:13:02 +02:00
Ettore Di Giacinto	3b7a78adda	fix(stream): do not break channel consumption (#2517 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-07 17:20:42 +02:00
Sertaç Özercan	0d62594099	fix: fix chat webui response parsing (#2515 ) fix: fix chat webui Signed-off-by: Sertac Ozercan <sozercan@gmail.com>	2024-06-07 17:20:31 +02:00
Dave	d072835796	feat:`OpaqueErrors` to hide error information (#2486 ) * adds a new configuration option to hide all error message information from http requests --------- Signed-off-by: Dave Lee <dave@gray101.com>	2024-06-05 08:45:24 +02:00
Dave	4e1463fec2	feat: fiber CSRF (#2482 ) new config option - enables or disables the fiber csrf middleware Signed-off-by: Dave Lee <dave@gray101.com>	2024-06-04 19:43:46 +00:00
Dave	2fc6fe806b	fix: `pkg/downloader` should respect basePath for `file://` urls (#2481 ) * pass basePath down to pkg/downloader Signed-off-by: Dave Lee <dave@gray101.com> * enforce Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com>	2024-06-04 14:32:47 +00:00
Ettore Di Giacinto	34527737bb	feat(webui): enhance card visibility (#2473 ) Do not let the description text to clutter, also highlight the model names Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-03 17:07:26 +02:00
Chakib Benziane	b99182c8d4	TTS API improvements (#2308 ) * update doc on COQUI_LANGUAGE env variable Signed-off-by: blob42 <contact@blob42.xyz> * return errors from tts gRPC backend Signed-off-by: blob42 <contact@blob42.xyz> * handle speaker_id and language in coqui TTS backend Signed-off-by: blob42 <contact@blob42.xyz> * TTS endpoint: add optional language paramter Signed-off-by: blob42 <contact@blob42.xyz> * tts fix: empty language string breaks non-multilingual models Signed-off-by: blob42 <contact@blob42.xyz> * allow tts param definition in config file - consolidate TTS options under `tts` config entry Signed-off-by: blob42 <contact@blob42.xyz> * tts: update doc Signed-off-by: blob42 <contact@blob42.xyz> --------- Signed-off-by: blob42 <contact@blob42.xyz> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-06-01 18:26:27 +00:00
Ettore Di Giacinto	3f7212c660	feat(functions): better free string matching, allow to expect strings after JSON (#2445 ) Allow now any non-character, both as suffix and prefix when mixed grammars are enabled Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-31 09:36:27 +02:00
Ettore Di Giacinto	5b75bf16c7	models(gallery): add Codestral (#2442 ) models(gallery): add Coderstral Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-30 18:50:26 +02:00
Prajwal S Nayak	4d98dd9ce7	feat(image): support `response_type` in the OpenAI API request (#2347 ) * Change response_format type to string to match OpenAI Spec Signed-off-by: prajwal <prajwalnayak7@gmail.com> * updated response_type type to interface Signed-off-by: prajwal <prajwalnayak7@gmail.com> * feat: correctly parse generic struct Signed-off-by: mudler <mudler@localai.io> * add tests Signed-off-by: mudler <mudler@localai.io> --------- Signed-off-by: prajwal <prajwalnayak7@gmail.com> Signed-off-by: mudler <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: mudler <mudler@localai.io>	2024-05-29 14:40:54 +02:00
Ettore Di Giacinto	669cd06dd9	feat(functions): allow parallel calls with mixed/no grammars (#2432 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-28 21:06:09 +02:00
Ettore Di Giacinto	491e1d752b	feat(functions): relax mixedgrammars (#2365 ) * feat(functions): relax mixedgrammars Extend even more the functionalities and when mixed mode is enabled, tolerate also both strings and JSON in the result - in this case we make sure that the JSON can be correctly parsed. This also updates the examples and the gallery model to configure the grammar. The changeset also breaks current function/grammar configuration as it reserves now a stanza in the YAML config. For example: ```yaml function: grammar: # This allows the grammar to also return messages mixed_mode: true # Suffix to add to the grammar # prefix: '<tool_call>\n' # Force parallel calls in the grammar # parallel_calls: true ``` Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor, add a way to disable mixed json and freestring Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix linting issues Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-22 00:14:16 +02:00
Ettore Di Giacinto	8ccd5ab040	feat(webui): statically embed js/css assets (#2348 ) * feat(webui): statically embed js/css assets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * update font assets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-19 18:24:27 +02:00
Ettore Di Giacinto	beb598e4f9	feat(functions): mixed JSON BNF grammars (#2328 ) feat(functions): support mixed JSON BNF grammar This PR provides new options to control how functions are extracted from the LLM, and also provides more control on how JSON grammars can be used (also in conjunction). New YAML settings introduced: - `grammar_message`: when enabled, the generated grammar can also decide to push strings and not only JSON objects. This allows the LLM to pick to either respond freely or using JSON. - `grammar_prefix`: Allows to prefix a string to the JSON grammar definition. - `replace_results`: Is a map that allows to replace strings in the LLM result. As an example, consider the following settings for Hermes-2-Pro-Mistral, which allow extracting both JSON results coming from the model, and the ones coming from the grammar: ```yaml function: # disable injecting the "answer" tool disable_no_action: true # This allows the grammar to also return messages grammar_message: true # Suffix to add to the grammar grammar_prefix: '<tool_call>\n' return_name_in_function_response: true # Without grammar uncomment the lines below # Warning: this is relying only on the capability of the # LLM model to generate the correct function call. # no_grammar: true # json_regex_match: "(?s)<tool_call>(.*?)</tool_call>" replace_results: "<tool_call>": "" "\'": "\"" ``` Note: To disable entirely grammars usage in the example above, uncomment the `no_grammar` and `json_regex_match`. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-15 20:03:18 +02:00
Ettore Di Giacinto	c89271b2e4	feat(llama.cpp): add distributed llama.cpp inferencing (#2324 ) * feat(llama.cpp): support distributed llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: let tweak how chat messages are merged together Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Makefile: register to ALL_GRPC_BACKENDS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring, allow disable auto-detection of backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * minor fixups Signed-off-by: mudler <mudler@localai.io> * feat: add cmd to start rpc-server from llama.cpp Signed-off-by: mudler <mudler@localai.io> * ci: add ccache Signed-off-by: mudler <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: mudler <mudler@localai.io>	2024-05-15 01:17:02 +02:00
Ettore Di Giacinto	84e2407afa	feat(functions): allow to set JSON matcher (#2319 ) Signed-off-by: mudler <mudler@localai.io>	2024-05-14 09:39:20 +02:00
Ettore Di Giacinto	9d8c705fd9	feat(ui): display number of available models for installation (#2298 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-12 14:24:36 +02:00
Ettore Di Giacinto	efa32a2677	feat(grammar): support models with specific construct (#2291 ) When enabling grammar with functions, it might be useful to allow more flexibility to support models that are fine-tuned against returning function calls of the form of { "name": "function_name", "arguments" {...} } rather then { "function": "function_name", "arguments": {..} }. This might call out to a more generic approach later on, but for the moment being we can easily support both as we have just to specific different types. If needed we can expand on this later on Signed-off-by: mudler <mudler@localai.io>	2024-05-12 01:13:22 +02:00
Ettore Di Giacinto	5ff5f0b393	fix(ux): fix small glitches (#2265 ) also drop duplicates for displaying in-progress model ops Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-08 19:34:33 +02:00
Ettore Di Giacinto	6559ac11b1	feat(ui): prompt for chat, support vision, enhancements (#2259 ) * feat(ui): allow to set system prompt for chat Make also the models in the index clickable, and display as table Fixes #2257 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(vision): support also png with base64 input Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(ui): support vision and upload of files Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * display the processed image Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * make trust remote code stand out Signed-off-by: mudler <mudler@localai.io> * feat(ui): track in progress job across index/model gallery Signed-off-by: mudler <mudler@localai.io> * minor fixups Signed-off-by: mudler <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: mudler <mudler@localai.io>	2024-05-08 00:42:34 +02:00
Dave	d3ddc9e4aa	UI: flag `trust_remote_code` to users // favicon support (#2253 ) * attempt to indicate trust_remote_code in some way * bonus: favicon support! --------- Signed-off-by: Dave Lee <dave@gray101.com>	2024-05-07 08:39:23 +02:00
Ettore Di Giacinto	fe055d4b36	feat(webui): ux improvements (#2247 ) * ux: change welcome when there are no models installed Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ux: filter Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ux: show tags in filter Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * wip Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * make tags clickable Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * allow to delete models from the list Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ui: display icon of installed models Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * gallery: remove gallery file when removing model Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(gallery): show a re-install button Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * make filter buttons, rename Gallery field Signed-off-by: mudler <mudler@localai.io> * show again buttons at end of operations Signed-off-by: mudler <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: mudler <mudler@localai.io>	2024-05-07 01:17:07 +02:00
Dave	b52ff1249f	test: check the response URL during image gen in `app_test.go` (#2248 ) test: actually check the response URL from image gen Signed-off-by: Dave Lee <dave@gray101.com>	2024-05-05 18:46:33 +00:00
Ettore Di Giacinto	117c9873e1	fix(webui): display small navbar with smaller screens (#2240 ) Signed-off-by: mudler <mudler@localai.io>	2024-05-04 23:38:39 +02:00
Ettore Di Giacinto	530bec9c64	feat(llama.cpp): do not specify backends to autoload and add llama.cpp variants (#2232 ) * feat(initializer): do not specify backends to autoload We can simply try to autoload the backends extracted in the asset dir. This will allow to build variants of the same backend (for e.g. with different instructions sets), so to have a single binary for all the variants. Signed-off-by: mudler <mudler@localai.io> * refactor(prepare): refactor out llama.cpp prepare steps Make it so are idempotent and that we can re-build Signed-off-by: mudler <mudler@localai.io> * [TEST] feat(build): build noavx version along Signed-off-by: mudler <mudler@localai.io> * build: make build parallel Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * build: do not override CMAKE_ARGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * build: add fallback variant Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(huggingface-langchain): fail if no token is set Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(huggingface-langchain): rename Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: do not autoload local-store Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: give priority between the listed backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: mudler <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-04 17:56:12 +02:00
Ettore Di Giacinto	54faaa87ea	fix(webui): correct documentation URL for text2img (#2233 ) Signed-off-by: mudler <mudler@localai.io> Co-authored-by: Dave <dave@gray101.com>	2024-05-04 00:25:13 +00:00
Ettore Di Giacinto	b58274b8a2	feat(ui): support multilineand style `ul` (#2226 ) * feat(ui/chat): handle multiline in the input field Signed-off-by: mudler <mudler@localai.io> * feat(ui/chat): correctly display multiline messages Signed-off-by: mudler <mudler@localai.io> * feat(ui/chat): add list style Signed-off-by: mudler <mudler@localai.io> --------- Signed-off-by: mudler <mudler@localai.io>	2024-05-03 00:43:02 +02:00
Ettore Di Giacinto	2c5a46bc34	feat(ux): Add chat, tts, and image-gen pages to the WebUI (#2222 ) * feat(webui): Add chat page Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(webui): Add image-gen page Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(webui): Add tts page Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-02 21:14:10 +02:00
Dave	c4f958e11b	refactor(application): introduce application global state (#2072 ) * start breaking up the giant channel refactor now that it's better understood - easier to merge bites Signed-off-by: Dave Lee <dave@gray101.com> * add concurrency and base64 back in, along with new base64 tests. Signed-off-by: Dave Lee <dave@gray101.com> * Automatic rename of whisper.go's Result to TranscriptResult Signed-off-by: Dave Lee <dave@gray101.com> * remove pkg/concurrency - significant changes coming in split 2 Signed-off-by: Dave Lee <dave@gray101.com> * fix comments Signed-off-by: Dave Lee <dave@gray101.com> * add list_model service as another low-risk service to get it out of the way Signed-off-by: Dave Lee <dave@gray101.com> * split backend config loader into seperate file from the actual config struct. No changes yet, just reduce cognative load with smaller files of logical blocks Signed-off-by: Dave Lee <dave@gray101.com> * rename state.go ==> application.go Signed-off-by: Dave Lee <dave@gray101.com> * fix lost import? Signed-off-by: Dave Lee <dave@gray101.com> --------- Signed-off-by: Dave Lee <dave@gray101.com>	2024-04-29 17:42:37 +00:00
Ettore Di Giacinto	e8d44447ad	feat(gallery): support model deletion (#2173 ) * feat(gallery): op now supports deletion of models Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Wire things with WebUI(WIP) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * minor improvements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-28 23:42:46 +02:00
Ettore Di Giacinto	fb2a05ff43	feat(gallery): display job status also during navigation (#2151 ) * feat(gallery): keep showing progress also when refreshing Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(intel-gpu): better defaults Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: make it thread-safe Signed-off-by: mudler <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: mudler <mudler@localai.io>	2024-04-27 09:08:33 +02:00
Dave	006306b183	fix: use bluemonday as recommended by blackfriday (#2142 ) use bluemonday as recommended by blackfriday Signed-off-by: Dave Lee <dave@gray101.com>	2024-04-26 10:34:50 +02:00
Dave	c8dd8e5ef4	fix: reduce chmod permissions for created files and directories (#2137 ) quiet more security scanner issues: pass one of chmod restriction to remove group and other permissions Signed-off-by: Dave Lee <dave@gray101.com>	2024-04-26 00:47:06 +02:00
Ettore Di Giacinto	b664edde29	feat(rerankers): Add new backend, support jina rerankers API (#2121 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-25 00:19:02 +02:00
Ettore Di Giacinto	4fffc47e77	deps(llama.cpp): update, use better model for function call tests (#2119 ) deps(llama.cpp): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-24 18:44:04 +02:00
Ettore Di Giacinto	ac56ac2b2d	fix(gallery): show a fake image if no there is no icon (#2111 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-23 20:10:58 +02:00
Ettore Di Giacinto	a09fe1b9ba	fix(gallery): set margin for images Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-23 20:00:20 +02:00
Ettore Di Giacinto	d344daf129	feat(models-ui): minor visual enhancements (#2109 ) Show image if present, URL, tags, and better display buttons Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-23 18:43:25 +02:00
cryptk	3411e072ca	Fix cleanup sonarqube findings (#2106 ) * fix: update dockerignore and gitignore to exclude sonarqube work dir Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove useless equality check Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use sonarqube Dockerfile recommendations Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-23 18:43:00 +02:00
Ettore Di Giacinto	0d8bf91699	feat: Galleries UI (#2104 ) * WIP: add models to webui Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Register routes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: don't cache models Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: fixup multiple installs (strings.Clone) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-23 09:22:58 +02:00
Ettore Di Giacinto	f3f6535aad	fix: rename fiber entrypoint from http/api to http/app (#2096 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Dave <dave@gray101.com>	2024-04-21 22:39:28 +02:00
Ettore Di Giacinto	284ad026b1	refactor(routes): split routes registration (#2077 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-21 01:19:57 +02:00
cryptk	8d30b39811	feat: fiber logs with zerlog and add trace level (#2082 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-20 10:43:37 +02:00
Ettore Di Giacinto	bbea62b907	feat(functions): support models with no grammar, add tests (#2068 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-18 22:43:12 +02:00
Ettore Di Giacinto	af9e5a2d05	Revert #1963 (#2056 ) * Revert "fix(fncall): fix regression introduced in #1963 (#2048)" This reverts commit `6b06d4e0af`. * Revert "fix: action-tmate back to upstream, dead code removal (#2038)" This reverts commit `fdec8a9d00`. * Revert "feat(grpc): return consumed token count and update response accordingly (#2035)" This reverts commit `e843d7df0e`. * Revert "refactor: backend/service split, channel-based llm flow (#1963)" This reverts commit `eed5706994`. * feat(grpc): return consumed token count and update response accordingly Fixes: #1920 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-17 23:33:49 +02:00
Dave	fdec8a9d00	fix: action-tmate back to upstream, dead code removal (#2038 ) cleanup: upstream action-tmate has taken my PR, drop master reference. Also remove dead code from api.go Signed-off-by: Dave Lee <dave@gray101.com>	2024-04-16 01:46:36 +00:00
Dave	eed5706994	refactor: backend/service split, channel-based llm flow (#1963 ) Refactor: channel based llm flow and services split --------- Signed-off-by: Dave Lee <dave@gray101.com>	2024-04-13 09:45:34 +02:00
Ettore Di Giacinto	b2785ff06e	feat(gallery): support ConfigURLs (#2012 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-12 00:49:23 +02:00
Ludovic Leroux	12c0d9443e	feat: use tokenizer.apply_chat_template() in vLLM (#1990 ) Use tokenizer.apply_chat_template() in vLLM Signed-off-by: Ludovic LEROUX <ludovic@inpher.io>	2024-04-11 19:20:22 +02:00
Ettore Di Giacinto	8342553214	fix(llama.cpp): set better defaults for llama.cpp (#1961 ) fix(defaults): set better defaults for llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-06 22:56:45 +02:00
cryptk	b85dad0286	feat: first pass at improving logging (#1956 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-04 09:24:22 +02:00

1 2 3 4

170 Commits