LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2024-12-24 14:56:41 +00:00

Author	SHA1	Message	Date
LocalAI [bot]	0f134d557e	⬆️ Update ggerganov/whisper.cpp (#2507 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-06-06 23:21:25 +00:00
Ettore Di Giacinto	4c9623f50d	deps(whisper): update, add libcufft-dev (#2501 ) * arrow_up: Update ggerganov/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * fix(build): add libcufft-dev Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-06-06 08:41:04 +02:00
Ettore Di Giacinto	596cf76135	build(intel): bundle intel variants in single-binary (#2494 ) * wip: try to build also intel variants Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add dependencies * Select automatically intel backend --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-06 08:40:51 +02:00
LocalAI [bot]	a293aa1b79	⬆️ Update ggerganov/llama.cpp (#2493 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-06-06 00:02:51 +00:00
Ettore Di Giacinto	17cf6c4a4d	feat(amdgpu): try to build in single binary (#2485 ) * feat(amdgpu): try to build in single binary Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Release space from worker Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-06-05 08:44:15 +02:00
LocalAI [bot]	fab3e711ff	⬆️ Update ggerganov/llama.cpp (#2487 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-06-04 23:11:28 +00:00
LocalAI [bot]	67aa31faad	⬆️ Update ggerganov/llama.cpp (#2477 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-06-03 23:09:24 +00:00
LocalAI [bot]	5ddaa19914	⬆️ Update ggerganov/llama.cpp (#2467 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-06-02 21:34:29 +00:00
LocalAI [bot]	b588cae70e	⬆️ Update ggerganov/llama.cpp (#2465 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-06-01 22:31:32 +00:00
Chakib Benziane	b99182c8d4	TTS API improvements (#2308 ) * update doc on COQUI_LANGUAGE env variable Signed-off-by: blob42 <contact@blob42.xyz> * return errors from tts gRPC backend Signed-off-by: blob42 <contact@blob42.xyz> * handle speaker_id and language in coqui TTS backend Signed-off-by: blob42 <contact@blob42.xyz> * TTS endpoint: add optional language paramter Signed-off-by: blob42 <contact@blob42.xyz> * tts fix: empty language string breaks non-multilingual models Signed-off-by: blob42 <contact@blob42.xyz> * allow tts param definition in config file - consolidate TTS options under `tts` config entry Signed-off-by: blob42 <contact@blob42.xyz> * tts: update doc Signed-off-by: blob42 <contact@blob42.xyz> --------- Signed-off-by: blob42 <contact@blob42.xyz> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-06-01 18:26:27 +00:00
LocalAI [bot]	06b461b061	⬆️ Update ggerganov/llama.cpp (#2453 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-06-01 00:09:26 +02:00
LocalAI [bot]	3fe7e9f678	⬆️ Update ggerganov/whisper.cpp (#2452 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-31 21:59:48 +00:00
Ettore Di Giacinto	ff8a6962cd	build(Makefile): add back single target to build native llama-cpp (#2448 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-31 18:35:33 +02:00
LocalAI [bot]	5dc6bace49	⬆️ Update ggerganov/whisper.cpp (#2443 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-30 22:18:55 +00:00
LocalAI [bot]	3cd5918ae6	⬆️ Update ggerganov/llama.cpp (#2444 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-30 22:09:42 +00:00
LocalAI [bot]	b2fc92daa7	⬆️ Update ggerganov/whisper.cpp (#2438 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-30 06:07:28 +00:00
LocalAI [bot]	0787797961	⬆️ Update ggerganov/llama.cpp (#2437 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-29 23:15:36 +00:00
LocalAI [bot]	087bceccac	⬆️ Update ggerganov/llama.cpp (#2433 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-28 21:55:03 +00:00
LocalAI [bot]	577888f3c0	⬆️ Update ggerganov/llama.cpp (#2428 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-27 22:02:49 +00:00
LocalAI [bot]	1c80f628ff	⬆️ Update ggerganov/whisper.cpp (#2427 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-27 21:28:36 +00:00
Ettore Di Giacinto	10430a00bd	feat(hipblas): extend default hipblas GPU_TARGETS (#2426 ) Makefile: extend default hipblas GPU_TARGETS Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-27 22:35:11 +02:00
LocalAI [bot]	e9c28a1ed7	⬆️ Update ggerganov/llama.cpp (#2419 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-26 21:32:05 +00:00
LocalAI [bot]	593fb62bf0	⬆️ Update ggerganov/llama.cpp (#2409 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-26 08:43:50 +00:00
LocalAI [bot]	480834f75b	⬆️ Update ggerganov/whisper.cpp (#2408 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-26 08:05:15 +00:00
LocalAI [bot]	f8cea16c03	⬆️ Update ggerganov/llama.cpp (#2399 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-24 21:52:13 +00:00
LocalAI [bot]	dce63237f2	⬆️ Update ggerganov/llama.cpp (#2360 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-23 21:02:13 +00:00
LocalAI [bot]	c8d7d14a37	⬆️ Update go-skynet/go-bert.cpp (#1225 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-22 23:42:38 +00:00
LocalAI [bot]	c56bc0de98	⬆️ Update ggerganov/whisper.cpp (#2361 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-23 01:02:57 +02:00
Ettore Di Giacinto	3a9408363b	deps(llama.cpp): update and adapt API changes (#2381 ) deps(llama.cpp): update and rename function Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-23 01:02:11 +02:00
Ettore Di Giacinto	16474bfb40	build: add sha (#2356 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-20 18:02:19 +02:00
LocalAI [bot]	053531e434	⬆️ Update ggerganov/whisper.cpp (#2352 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-19 22:23:02 +00:00
LocalAI [bot]	b7ab4f25d9	⬆️ Update ggerganov/llama.cpp (#2351 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-19 22:22:03 +00:00
Ettore Di Giacinto	8ccd5ab040	feat(webui): statically embed js/css assets (#2348 ) * feat(webui): statically embed js/css assets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * update font assets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-19 18:24:27 +02:00
Ettore Di Giacinto	8ad669339e	add openvoice backend (#2334 ) Wip openvoice	2024-05-19 16:27:08 +02:00
LocalAI [bot]	5f35e85e86	⬆️ Update ggerganov/llama.cpp (#2342 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-18 21:06:29 +00:00
LocalAI [bot]	9ab8f8f5e0	⬆️ Update ggerganov/llama.cpp (#2339 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-17 21:13:01 +00:00
LocalAI [bot]	9a255d6453	⬆️ Update ggerganov/llama.cpp (#2337 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-16 21:53:19 +00:00
LocalAI [bot]	4e92569d45	⬆️ Update ggerganov/whisper.cpp (#2329 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-15 22:24:06 +00:00
LocalAI [bot]	b584dcf18a	⬆️ Update ggerganov/llama.cpp (#2316 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-15 22:20:37 +00:00
Ettore Di Giacinto	c89271b2e4	feat(llama.cpp): add distributed llama.cpp inferencing (#2324 ) * feat(llama.cpp): support distributed llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: let tweak how chat messages are merged together Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Makefile: register to ALL_GRPC_BACKENDS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring, allow disable auto-detection of backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * minor fixups Signed-off-by: mudler <mudler@localai.io> * feat: add cmd to start rpc-server from llama.cpp Signed-off-by: mudler <mudler@localai.io> * ci: add ccache Signed-off-by: mudler <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: mudler <mudler@localai.io>	2024-05-15 01:17:02 +02:00
LocalAI [bot]	566b5cf2ee	⬆️ Update ggerganov/whisper.cpp (#2326 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-14 21:17:46 +00:00
Sertaç Özercan	a670318a9f	feat: auto select llama-cpp cuda runtime (#2306 ) * auto select cpu variant Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * remove cuda target for now Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix metal Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix path Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * cuda Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * auto select cuda Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * update test Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * select CUDA backend only if present Signed-off-by: mudler <mudler@localai.io> * ci: keep cuda bin in path Signed-off-by: mudler <mudler@localai.io> * Makefile: make dist now builds also cuda Signed-off-by: mudler <mudler@localai.io> * Keep pushing fallback in case auto-flagset/nvidia fails There could be other reasons for which the default binary may fail. For example we might have detected an Nvidia GPU, however the user might not have the drivers/cuda libraries installed in the system, and so it would fail to start. We keep the fallback of llama.cpp at the end of the llama.cpp backends to try to fallback loading in case things go wrong Signed-off-by: mudler <mudler@localai.io> * Do not build cuda on MacOS Signed-off-by: mudler <mudler@localai.io> * cleanup Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * Apply suggestions from code review Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: mudler <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: mudler <mudler@localai.io>	2024-05-14 19:40:18 +02:00
LocalAI [bot]	4ac7956f68	⬆️ Update ggerganov/whisper.cpp (#2317 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-13 22:25:14 +00:00
Sertaç Özercan	e2c3ffb09b	feat: auto select llama-cpp cpu variant (#2305 ) * auto select cpu variant Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * remove cuda target for now Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix metal Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix path Signed-off-by: Sertac Ozercan <sozercan@gmail.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com>	2024-05-13 11:37:52 +02:00
LocalAI [bot]	b4cb22f444	⬆️ Update ggerganov/llama.cpp (#2303 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-12 21:18:59 +00:00
LocalAI [bot]	dfc420706c	⬆️ Update ggerganov/llama.cpp (#2290 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-11 21:16:34 +00:00
LocalAI [bot]	93e581dfd0	⬆️ Update ggerganov/llama.cpp (#2285 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-10 21:09:22 +00:00
Ettore Di Giacinto	9b09eb005f	build: do not specify a BUILD_ID by default (#2284 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-10 16:01:55 +02:00
LocalAI [bot]	18a04246fa	⬆️ Update ggerganov/llama.cpp (#2281 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-09 22:18:49 +00:00
LocalAI [bot]	d651f390cd	⬆️ Update ggerganov/whisper.cpp (#2273 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-08 22:11:10 +00:00
LocalAI [bot]	eca5200fbd	⬆️ Update ggerganov/llama.cpp (#2272 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-08 21:34:56 +00:00
LocalAI [bot]	995aa5ed21	⬆️ Update ggerganov/llama.cpp (#2263 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-07 21:39:12 +00:00
LocalAI [bot]	581b894789	⬆️ Update ggerganov/llama.cpp (#2255 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-06 21:28:07 +00:00
LocalAI [bot]	c5475020fe	⬆️ Update ggerganov/llama.cpp (#2251 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-05 21:16:00 +00:00
Ettore Di Giacinto	c5798500cb	feat(single-build): generate single binaries for releases (#2246 ) * feat(single-build): generate single binaries for releases Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop old targets Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-05 17:20:51 +02:00
LocalAI [bot]	17e94fbcb1	⬆️ Update ggerganov/llama.cpp (#2239 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-04 21:26:22 +00:00
Ettore Di Giacinto	530bec9c64	feat(llama.cpp): do not specify backends to autoload and add llama.cpp variants (#2232 ) * feat(initializer): do not specify backends to autoload We can simply try to autoload the backends extracted in the asset dir. This will allow to build variants of the same backend (for e.g. with different instructions sets), so to have a single binary for all the variants. Signed-off-by: mudler <mudler@localai.io> * refactor(prepare): refactor out llama.cpp prepare steps Make it so are idempotent and that we can re-build Signed-off-by: mudler <mudler@localai.io> * [TEST] feat(build): build noavx version along Signed-off-by: mudler <mudler@localai.io> * build: make build parallel Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * build: do not override CMAKE_ARGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * build: add fallback variant Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(huggingface-langchain): fail if no token is set Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(huggingface-langchain): rename Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: do not autoload local-store Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: give priority between the listed backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: mudler <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-04 17:56:12 +02:00
LocalAI [bot]	ac0f3d6e82	⬆️ Update ggerganov/whisper.cpp (#2230 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-03 22:16:26 +00:00
LocalAI [bot]	da0b6a89ae	⬆️ Update ggerganov/llama.cpp (#2229 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-03 21:39:28 +00:00
LocalAI [bot]	2cc1bd85af	⬆️ Update ggerganov/llama.cpp (#2224 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-02 21:23:40 +00:00
LocalAI [bot]	6a7a7996bb	⬆️ Update ggerganov/llama.cpp (#2213 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-01 21:19:44 +00:00
LocalAI [bot]	f90d56d371	⬆️ Update ggerganov/llama.cpp (#2203 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-30 21:53:31 +00:00
Chris Jowett	970cb3a219	chore: update go-stablediffusion to latest commit with Make jobserver fix Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-30 15:59:28 -05:00
LocalAI [bot]	29d7812344	⬆️ Update ggerganov/whisper.cpp (#2188 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-29 22:16:04 +00:00
cryptk	5fd46175dc	fix: ensure GNUMake jobserver is passed through to whisper.cpp build (#2187 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-29 16:40:50 -05:00
LocalAI [bot]	52a268c38c	⬆️ Update ggerganov/llama.cpp (#2189 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-29 21:36:30 +00:00
cryptk	93ca56086e	update go-tinydream to latest commit (#2182 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-29 15:17:09 +02:00
LocalAI [bot]	5fef3b0ff1	⬆️ Update ggerganov/whisper.cpp (#2177 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-28 22:32:45 +00:00
LocalAI [bot]	01860674c4	⬆️ Update ggerganov/llama.cpp (#2176 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-28 21:41:12 +00:00
cryptk	21974fe1d3	fix: swap to WHISPER_CUDA per deprecation message from whisper.cpp (#2170 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-28 17:51:53 +00:00
LocalAI [bot]	c3982212f9	⬆️ Update ggerganov/llama.cpp (#2159 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-27 21:32:43 +00:00
LocalAI [bot]	030d555995	⬆️ Update ggerganov/llama.cpp (#2150 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-27 02:18:28 +00:00
fakezeta	c9451cb604	Bump oneapi-basekit, optimum and openvino (#2139 ) * Bump oneapi-basekit, optimum and openvino * Changed PERFORMANCE HINT to CUMULATIVE_THROUGHPUT Minor latency change for first token but about 10-15% speedup on token generation.	2024-04-26 16:20:43 +02:00
LocalAI [bot]	365ef92530	⬆️ Update mudler/go-stable-diffusion (#2134 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-25 21:41:38 +00:00
LocalAI [bot]	5fceb876c4	⬆️ Update ggerganov/llama.cpp (#2133 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-25 21:40:41 +00:00
Ettore Di Giacinto	b664edde29	feat(rerankers): Add new backend, support jina rerankers API (#2121 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-25 00:19:02 +02:00
LocalAI [bot]	e16658b7ec	⬆️ Update ggerganov/llama.cpp (#2123 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-24 22:00:17 +00:00
LocalAI [bot]	d30280ed23	⬆️ Update ggerganov/whisper.cpp (#2122 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-24 21:55:30 +00:00
Ettore Di Giacinto	4fffc47e77	deps(llama.cpp): update, use better model for function call tests (#2119 ) deps(llama.cpp): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-24 18:44:04 +02:00
LocalAI [bot]	38c9abed8b	⬆️ Update ggerganov/llama.cpp (#2089 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-21 16:35:30 +00:00
Ettore Di Giacinto	284ad026b1	refactor(routes): split routes registration (#2077 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-21 01:19:57 +02:00
LocalAI [bot]	1e37101930	⬆️ Update ggerganov/llama.cpp (#2080 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-20 00:05:16 +00:00
LocalAI [bot]	e9448005a5	⬆️ Update ggerganov/llama.cpp (#2051 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-18 21:30:55 +00:00
cryptk	e9f090257c	fix: adjust some sources names to match the naming of their repositories (#2061 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-18 01:59:05 +00:00
Ettore Di Giacinto	af9e5a2d05	Revert #1963 (#2056 ) * Revert "fix(fncall): fix regression introduced in #1963 (#2048)" This reverts commit `6b06d4e0af`. * Revert "fix: action-tmate back to upstream, dead code removal (#2038)" This reverts commit `fdec8a9d00`. * Revert "feat(grpc): return consumed token count and update response accordingly (#2035)" This reverts commit `e843d7df0e`. * Revert "refactor: backend/service split, channel-based llm flow (#1963)" This reverts commit `eed5706994`. * feat(grpc): return consumed token count and update response accordingly Fixes: #1920 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-17 23:33:49 +02:00
LocalAI [bot]	af8c705ecd	⬆️ Update ggerganov/whisper.cpp (#2060 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-17 21:17:25 +00:00
LocalAI [bot]	5763dc1613	⬆️ Update ggerganov/whisper.cpp (#2050 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-16 21:37:50 +00:00
LocalAI [bot]	0cc1ad2188	⬆️ Update ggerganov/whisper.cpp (#2042 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-15 23:27:52 +00:00
LocalAI [bot]	cdece3879f	⬆️ Update ggerganov/llama.cpp (#2043 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-15 22:47:29 +00:00
LocalAI [bot]	de3a1a0a8e	⬆️ Update ggerganov/llama.cpp (#2033 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-14 23:35:44 +00:00
Ettore Di Giacinto	0fdff26924	feat(parler-tts): Add new backend (#2027 ) * feat(parler-tts): Add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(parler-tts): try downgrade protobuf Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(parler-tts): add parler conda env Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Revert "feat(parler-tts): try downgrade protobuf" This reverts commit bd5941d5cfc00676b45a99f71debf3c34249cf3c. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * deps: add grpc Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: try to gen proto with same environment * workaround * Revert "fix: try to gen proto with same environment" This reverts commit `998c745e2f`. * Workaround fixup --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Dave <dave@gray101.com>	2024-04-13 18:59:21 +02:00
LocalAI [bot]	619f2517a4	⬆️ Update ggerganov/llama.cpp (#2028 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-13 13:47:39 +00:00
Dave	eed5706994	refactor: backend/service split, channel-based llm flow (#1963 ) Refactor: channel based llm flow and services split --------- Signed-off-by: Dave Lee <dave@gray101.com>	2024-04-13 09:45:34 +02:00
cryptk	1981154f49	fix: dont commit generated files to git (#1993 ) * fix: initial work towards not committing generated files to the repository Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * feat: improve build docs Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove unused folder from .dockerignore and .gitignore Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: attempt to fix extra backend tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: attempt to fix other tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more test fixes Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: fix apple tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more extras tests fixes Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add GOBIN to PATH in docker build Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: extra tests and Dockerfile corrections Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove build dependency checks Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add golang protobuf compilers to tests-linux action Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: ensure protogen is run for extra backend installs Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use newer protobuf Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more missing protoc binaries Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: missing dependencies during docker build Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: don't install grpc compilers in the final stage if they aren't needed Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: python-grpc-tools in 22.04 repos is too old Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add a couple of extra build dependencies to Makefile Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: unbreak container rebuild functionality Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-13 09:37:32 +02:00
LocalAI [bot]	912d2dccfa	⬆️ Update ggerganov/llama.cpp (#2024 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-13 09:13:00 +02:00
LocalAI [bot]	677e20756b	⬆️ Update ggerganov/llama.cpp (#2014 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-12 00:49:41 +02:00
LocalAI [bot]	e152b07b74	⬆️ Update ggerganov/llama.cpp (#1991 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-11 09:22:07 +02:00
LocalAI [bot]	7e2f8bb408	⬆️ Update ggerganov/whisper.cpp (#1980 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-10 09:08:00 +02:00
LocalAI [bot]	951e39d36c	⬆️ Update ggerganov/llama.cpp (#1979 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-10 09:07:41 +02:00
LocalAI [bot]	195be10050	⬆️ Update ggerganov/llama.cpp (#1973 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-08 23:26:52 +02:00
LocalAI [bot]	efcca15d3f	⬆️ Update ggerganov/llama.cpp (#1970 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-08 08:38:47 +02:00
LocalAI [bot]	a153b628c2	⬆️ Update ggerganov/whisper.cpp (#1969 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-08 08:38:17 +02:00
LocalAI [bot]	ed13782986	⬆️ Update ggerganov/llama.cpp (#1964 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-07 10:32:10 +02:00
LocalAI [bot]	8aa5f5a660	⬆️ Update ggerganov/llama.cpp (#1960 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-06 19:15:25 +00:00
LocalAI [bot]	b2d9e3f704	⬆️ Update ggerganov/llama.cpp (#1959 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-05 08:41:55 +02:00
LocalAI [bot]	f744e1f931	⬆️ Update ggerganov/whisper.cpp (#1958 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-05 08:41:35 +02:00
LocalAI [bot]	3851b51d98	⬆️ Update ggerganov/llama.cpp (#1953 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-04 00:27:57 +02:00
LocalAI [bot]	4d4d76114d	⬆️ Update ggerganov/llama.cpp (#1941 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-02 09:16:04 +02:00
LocalAI [bot]	66f90f8dc1	⬆️ Update ggerganov/llama.cpp (#1937 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-01 08:59:23 +02:00
LocalAI [bot]	784657a652	⬆️ Update ggerganov/llama.cpp (#1934 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-31 00:27:38 +01:00
LocalAI [bot]	831efa8893	⬆️ Update ggerganov/whisper.cpp (#1933 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-31 00:27:16 +01:00
LocalAI [bot]	2bba62ca4d	⬆️ Update ggerganov/llama.cpp (#1928 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 22:52:01 +00:00
cryptk	93702e39d4	feat(build): adjust number of parallel make jobs (#1915 ) * feat(build): adjust number of parallel make jobs * fix: update make on MacOS from brew to support --output-sync argument * fix: cache grpc with version as part of key to improve validity of cache hits * fix: use gmake for tests-apple to use the updated GNU make version * fix: actually use the new make version for tests-apple * feat: parallelize tests-extra * feat: attempt to cache grpc build for docker images * fix: don't quote GRPC version * fix: don't cache go modules, we have limited cache space, better used elsewhere * fix: release with the same version of go that we test with * fix: don't fail on exporting cache layers * fix: remove deprecated BUILD_GRPC docker arg from Makefile	2024-03-29 22:32:40 +01:00
LocalAI [bot]	a7fc89c207	⬆️ Update ggerganov/whisper.cpp (#1927 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 22:29:50 +01:00
Ettore Di Giacinto	123a5a2e16	feat(swagger): Add swagger API doc (#1926 ) * makefile(build): add minimal and api build target * feat(swagger): Add swagger	2024-03-29 22:29:33 +01:00
LocalAI [bot]	ab2f403dd0	⬆️ Update ggerganov/whisper.cpp (#1924 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 00:13:59 +01:00
LocalAI [bot]	b9c5e14e2c	⬆️ Update ggerganov/llama.cpp (#1923 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 00:13:38 +01:00
LocalAI [bot]	07c49ee4b8	⬆️ Update ggerganov/whisper.cpp (#1914 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-27 22:53:13 +00:00
LocalAI [bot]	07c4bdda7c	⬆️ Update ggerganov/llama.cpp (#1913 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-27 21:57:59 +00:00
cryptk	0c0efc871c	fix(build): better CI logging and correct some build failure modes in Makefile (#1899 ) * feat: group make output by target when running parallelized builds in CI * fix: quote GO_TAGS in makefile to fix handling of whitespace in value * fix: set CPATH to find opencv2 in it's commonly installed location * fix: add missing go mod dropreplace for go-llama.cpp * chore: remove opencv symlink from github workflows	2024-03-27 21:12:19 +01:00
Gianluca Boiano	7ef5f3b473	⬆️ Update M0Rf30/go-tiny-dream (#1911 )	2024-03-27 21:12:04 +01:00
LocalAI [bot]	b500ceaf73	⬆️ Update ggerganov/llama.cpp (#1904 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 23:21:54 +00:00
LocalAI [bot]	1395e505cd	⬆️ Update ggerganov/llama.cpp (#1897 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 00:34:10 +01:00
LocalAI [bot]	42a4c86dca	⬆️ Update ggerganov/whisper.cpp (#1896 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 00:33:46 +01:00
LocalAI [bot]	3e293f1465	⬆️ Update ggerganov/llama.cpp (#1889 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-24 21:12:18 +00:00
LocalAI [bot]	0106c58181	⬆️ Update ggerganov/llama.cpp (#1885 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-24 14:54:01 +01:00
LocalAI [bot]	a922119c41	⬆️ Update ggerganov/llama.cpp (#1881 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-23 09:23:28 +01:00
Richard Palethorpe	643d85d2cc	feat(stores): Vector store backend (#1795 ) Add simple vector store backend Signed-off-by: Richard Palethorpe <io@richiejp.com>	2024-03-22 21:14:04 +01:00
Ettore Di Giacinto	4b1ee0c170	feat(aio): add tests, update model definitions (#1880 )	2024-03-22 21:13:11 +01:00
LocalAI [bot]	dd84c29a3d	⬆️ Update ggerganov/whisper.cpp (#1875 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-22 09:14:56 +01:00
LocalAI [bot]	07468c8786	⬆️ Update ggerganov/llama.cpp (#1874 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-22 09:14:42 +01:00
Ettore Di Giacinto	abc9360dc6	feat(aio): entrypoint, update workflows (#1872 )	2024-03-21 22:09:04 +01:00
Ettore Di Giacinto	e533dcf506	feat(functions/aio): all-in-one images, function template enhancements (#1862 ) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text	2024-03-21 01:12:20 +01:00
LocalAI [bot]	eeaf8c7ccd	⬆️ Update ggerganov/whisper.cpp (#1867 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 22:26:29 +00:00
LocalAI [bot]	7e34dfdae7	⬆️ Update ggerganov/llama.cpp (#1866 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 22:13:29 +00:00
LocalAI [bot]	e4bf51d5bd	⬆️ Update ggerganov/llama.cpp (#1864 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 09:05:53 +01:00
LocalAI [bot]	ead61bf9d5	⬆️ Update ggerganov/llama.cpp (#1857 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:03:17 +00:00
LocalAI [bot]	621541a92f	⬆️ Update ggerganov/whisper.cpp (#1508 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:44:23 +01:00
Dave	ed5734ae25	test/fix: OSX Test Repair (#1843 ) * test with gguf instead of ggml. Updates testPrompt to match? Adds debugging line to Dockerfile that I've found helpful recently. * fix testPrompt slightly * Sad Experiment: Test GH runner without metal? * break apart CGO_LDFLAGS * switch runner * upstream llama.cpp disables Metal on Github CI! * missed a dir from clean-tests * CGO_LDFLAGS * tmate failure + NO_ACCELERATE * whisper.cpp has a metal fix * do the exact opposite of the name of this branch, but keep it around for unrelated fixes? * add back newlines * add tmate to linux for testing * update fixtures * timeout for tmate	2024-03-18 19:19:43 +01:00
Ettore Di Giacinto	b202bfaaa0	deps(whisper.cpp): update, fix cublas build (#1846 ) fix(whisper.cpp): Add stubs and -lcuda	2024-03-18 15:56:53 +01:00
LocalAI [bot]	0eb0ac7dd0	⬆️ Update ggerganov/llama.cpp (#1848 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-18 08:57:58 +01:00
cryptk	020ce29cd8	fix(make): allow to parallelize jobs (#1845 ) * fix: clean up Makefile dependencies to allow for parallel builds * refactor: remove old unused backend from Makefile * fix: finish removing legacy backend, update piper * fix: I broke llama... I fixed llama * feat: give the tests and builds a few threads * fix: ensure libraries are replaced before build, add dropreplace target * Fix image build workflows	2024-03-17 15:39:20 +01:00
LocalAI [bot]	8967ed1601	⬆️ Update ggerganov/llama.cpp (#1840 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-16 11:25:41 +00:00
LocalAI [bot]	5826fb8e6d	⬆️ Update mudler/go-piper (#1844 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-15 23:51:03 +00:00
Dave	db199f61da	fix: osx build default.metallib (#1837 ) fix: osx build default.metallib (#1837) * port osx fix from refactor pr to slim pr * manually bump llama.cpp version to unstick CI?	2024-03-15 08:18:58 +00:00
LocalAI [bot]	44adbd2c75	⬆️ Update go-skynet/go-llama.cpp (#1835 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-14 23:06:42 +00:00
Dave	45d520f913	fix: OSX Build Files for llama.cpp (#1836 ) bot ate my changes, seperate branch	2024-03-14 23:07:47 +01:00
LocalAI [bot]	f82065703d	⬆️ Update ggerganov/llama.cpp (#1827 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-14 08:39:39 +01:00
LocalAI [bot]	5c5f07c1e7	⬆️ Update ggerganov/llama.cpp (#1821 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-13 10:05:46 +01:00
LocalAI [bot]	8e57f4df31	⬆️ Update ggerganov/llama.cpp (#1818 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-11 00:02:37 +01:00
LocalAI [bot]	a08cc5adbb	⬆️ Update ggerganov/llama.cpp (#1816 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-10 09:32:09 +01:00
LocalAI [bot]	595a73fce4	⬆️ Update ggerganov/llama.cpp (#1813 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-09 09:27:06 +01:00
LocalAI [bot]	dc919e08e8	⬆️ Update ggerganov/llama.cpp (#1811 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-08 08:21:25 +01:00
Ettore Di Giacinto	5d1018495f	feat(intel): add diffusers/transformers support (#1746 ) * feat(intel): add diffusers support * try to consume upstream container image * Debug * Manually install deps * Map transformers/hf cache dir to modelpath if not specified * fix(compel): update initialization, pass by all gRPC options * fix: add dependencies, implement transformers for xpu * base it from the oneapi image * Add pillow * set threads if specified when launching the API * Skip conda install if intel * defaults to non-intel * ci: add to pipelines * prepare compel only if enabled * Skip conda install if intel * fix cleanup * Disable compel by default * Install torch 2.1.0 with Intel * Skip conda on some setups * Detect python * Quiet output * Do not override system python with conda * Prefer python3 * Fixups * exllama2: do not install without conda (overrides pytorch version) * exllama/exllama2: do not install if not using cuda * Add missing dataset dependency * Small fixups, symlink to python, add requirements * Add neural_speed to the deps * correctly handle model offloading * fix: device_map == xpu * go back at calling python, fixed at dockerfile level * Exllama2 restricted to only nvidia gpus * Tokenizer to xpu	2024-03-07 14:37:45 +01:00
LocalAI [bot]	ad6fd7a991	⬆️ Update ggerganov/llama.cpp (#1805 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-06 23:28:31 +01:00
LocalAI [bot]	e022b5959e	⬆️ Update mudler/go-stable-diffusion (#1802 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-05 23:39:57 +00:00
LocalAI [bot]	db7f4955a1	⬆️ Update ggerganov/llama.cpp (#1801 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-05 21:50:27 +00:00
LocalAI [bot]	c8e29033c2	⬆️ Update ggerganov/llama.cpp (#1794 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-05 08:59:09 +01:00
LocalAI [bot]	d0bd961bde	⬆️ Update ggerganov/llama.cpp (#1791 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-04 09:44:21 +01:00
LocalAI [bot]	b60a3fc879	⬆️ Update ggerganov/llama.cpp (#1789 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-03 08:49:23 +01:00
LocalAI [bot]	daa0b8741c	⬆️ Update ggerganov/llama.cpp (#1785 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-01 22:38:24 +00:00
Dave	1c312685aa	refactor: move remaining api packages to core (#1731 ) * core 1 * api/openai/files fix * core 2 - core/config * move over core api.go and tests to the start of core/http * move over localai specific endpoints to core/http, begin the service/endpoint split there * refactor big chunk on the plane * refactor chunk 2 on plane, next step: port and modify changes to request.go * easy fixes for request.go, major changes not done yet * lintfix * json tag lintfix? * gitignore and .keep files * strange fix attempt: rename the config dir?	2024-03-01 16:19:53 +01:00
LocalAI [bot]	316de82f51	⬆️ Update ggerganov/llama.cpp (#1779 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-29 22:33:30 +00:00
LocalAI [bot]	c665898652	⬆️ Update donomii/go-rwkv.cpp (#1771 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-28 23:50:27 +00:00
LocalAI [bot]	f651a660aa	⬆️ Update ggerganov/llama.cpp (#1772 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-28 23:02:30 +01:00
LocalAI [bot]	c7e08813a5	⬆️ Update ggerganov/llama.cpp (#1767 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-27 23:12:51 +01:00
LocalAI [bot]	d21a6b33ab	⬆️ Update ggerganov/llama.cpp (#1756 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-27 18:07:51 +00:00
Ettore Di Giacinto	d6cf82aba3	fix(tests): re-enable tests after code move (#1764 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-27 15:04:19 +01:00
Ettore Di Giacinto	bc5f5aa538	deps(llama.cpp): update (#1759 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-26 13:18:44 +01:00
Sertaç Özercan	7f72a61104	ci: add stablediffusion to release (#1757 ) Signed-off-by: Sertac Ozercan <sozercan@gmail.com>	2024-02-25 23:06:18 +00:00
LocalAI [bot]	8e45d47740	⬆️ Update ggerganov/llama.cpp (#1753 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-25 10:03:19 +01:00
LocalAI [bot]	ff88c390bb	⬆️ Update ggerganov/llama.cpp (#1750 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-24 00:06:46 +01:00
LocalAI [bot]	d825821a22	⬆️ Update ggerganov/llama.cpp (#1740 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-23 00:07:15 +01:00
LocalAI [bot]	6fc122fa1a	⬆️ Update ggerganov/llama.cpp (#1705 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-22 09:33:23 +00:00
Ettore Di Giacinto	8292781045	deps(llama.cpp): update, support Gemma models (#1734 ) deps(llama.cpp): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-21 17:23:38 +01:00
Ettore Di Giacinto	54ec6348fa	deps(llama.cpp): update (#1714 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-21 11:35:44 +01:00
fenfir	fb0a4c5d9a	Build docker container for ROCm (#1595 ) * Dockerfile changes to build for ROCm * Adjust linker flags for ROCm * Update conda env for diffusers and transformers to use ROCm pytorch * Update transformers conda env for ROCm * ci: build hipblas images * fixup rebase * use self-hosted Signed-off-by: mudler <mudler@localai.io> * specify LD_LIBRARY_PATH only when BUILD_TYPE=hipblas --------- Signed-off-by: mudler <mudler@localai.io> Co-authored-by: mudler <mudler@localai.io>	2024-02-16 15:08:50 +01:00
Ettore Di Giacinto	5e155fb081	fix(python): pin exllama2 (#1711 ) fix(python): pin python deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-14 21:44:12 +01:00
Ettore Di Giacinto	39a6b562cf	fix(llama.cpp): downgrade to a known working version (#1706 ) sycl support is broken otherwise. See upstream issue: https://github.com/ggerganov/llama.cpp/issues/5469 Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-14 10:28:06 +01:00
LocalAI [bot]	02f6e18adc	⬆️ Update ggerganov/llama.cpp (#1700 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-12 21:43:33 +00:00
LocalAI [bot]	4436e62cf1	⬆️ Update ggerganov/llama.cpp (#1698 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-12 09:56:04 +01:00
LocalAI [bot]	58cdf97361	⬆️ Update ggerganov/llama.cpp (#1694 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-11 10:01:11 +01:00
LocalAI [bot]	ef1306f703	⬆️ Update mudler/go-stable-diffusion (#1674 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-09 21:59:15 +00:00
LocalAI [bot]	3196967995	⬆️ Update ggerganov/llama.cpp (#1691 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-09 21:50:34 +00:00
LocalAI [bot]	fc8423392f	⬆️ Update ggerganov/llama.cpp (#1688 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-09 00:02:23 +01:00
Ettore Di Giacinto	ddd21f1644	feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689 ) * cleanup backends * switch image to ubuntu 22.04 * adapt commands for ubuntu * transformers cleanup * no contrib on ubuntu * Change test model to gguf * ci: disable bark tests (too cpu-intensive) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * refinements * use intel base image * Makefile: Add docker targets * Change test model --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-08 20:12:51 +01:00
Ettore Di Giacinto	e0632f2ce2	fix(llama.cpp): downgrade to fix sycl build Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-02-07 00:16:52 +01:00
LocalAI [bot]	d8b17795d7	⬆️ Update ggerganov/llama.cpp (#1683 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-06 09:26:01 +01:00
LocalAI [bot]	8ace0a9ba7	⬆️ Update ggerganov/llama.cpp (#1681 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-04 21:59:14 +00:00
Ettore Di Giacinto	98ad93d53e	Drop ggml-based gpt2 and starcoder (supported by llama.cpp) (#1679 ) * Drop ggml-based gpt2 and starcoder (supported by llama.cpp) * Update compatibility table	2024-02-04 13:15:51 +01:00
LocalAI [bot]	38e4ec0b2a	⬆️ Update ggerganov/llama.cpp (#1678 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-04 00:55:12 +01:00
Ettore Di Giacinto	df13ba655c	Drop old falcon backend (deprecated) (#1675 ) Drop old falcon backend	2024-02-03 13:01:13 +01:00
LocalAI [bot]	7678b25755	⬆️ Update ggerganov/llama.cpp (#1673 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-02 21:46:26 +00:00
LocalAI [bot]	c87ca4f320	⬆️ Update ggerganov/llama.cpp (#1669 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-02-02 19:14:03 +01:00
Ettore Di Giacinto	1c57f8d077	feat(sycl): Add support for Intel GPUs with sycl (#1647 ) (#1660 ) * feat(sycl): Add sycl support (#1647) * onekit: install without prompts * set cmake args only in grpc-server Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * cleanup * fixup sycl source env * Cleanup docs * ci: runs on self-hosted * fix typo * bump llama.cpp * llama.cpp: update server * adapt to upstream changes * adapt to upstream changes * docs: add sycl --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-02-01 19:21:52 +01:00
LocalAI [bot]	16cebf0390	⬆️ Update ggerganov/llama.cpp (#1665 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-30 23:38:05 +00:00
LocalAI [bot]	c1bae1ee81	⬆️ Update ggerganov/llama.cpp (#1656 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-30 00:43:36 +01:00
LocalAI [bot]	abd678e147	⬆️ Update ggerganov/llama.cpp (#1655 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-28 09:24:44 +01:00
LocalAI [bot]	f928899338	⬆️ Update ggerganov/llama.cpp (#1652 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-27 00:13:38 +01:00
LocalAI [bot]	ac19998e5e	⬆️ Update ggerganov/llama.cpp (#1644 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-01-26 00:13:39 +01:00

... 2 3 4 5 6 ...

708 Commits