2cc1bd85af
⬆️ Update ggerganov/llama.cpp ( #2224 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2024-05-02 21:23:40 +00:00
2c5a46bc34
feat(ux): Add chat, tts, and image-gen pages to the WebUI ( #2222 )
...
* feat(webui): Add chat page
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(webui): Add image-gen page
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(webui): Add tts page
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-05-02 21:14:10 +02:00
f7f8b4804b
models(gallery): Add Hermes-2-Pro-Llama-3-8B-GGUF ( #2218 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-02 18:31:13 +02:00
e5bd9a76c7
models(gallery): add wizardlm2 ( #2209 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-05-02 18:31:02 +02:00
4690b534e0
feat: user defined inference device for CUDA and OpenVINO ( #2212 )
...
user defined inference device
configuration via main_gpu parameter
2024-05-02 09:54:29 +02:00
6a7a7996bb
⬆️ Update ggerganov/llama.cpp ( #2213 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2024-05-01 21:19:44 +00:00
962ebbaf77
models(gallery): fixup phi-3 sha
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-01 23:06:58 +02:00
f90d56d371
⬆️ Update ggerganov/llama.cpp ( #2203 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2024-04-30 21:53:31 +00:00
445cfd4db3
models(gallery): add guillaumetell ( #2195 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-30 23:24:41 +02:00
b24d44dc56
models(gallery): add suzume-llama-3-8B-multilingual-gguf ( #2194 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-30 23:24:28 +02:00
cd31f8d865
models(gallery): add lexifun ( #2193 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-30 23:24:13 +02:00
970cb3a219
chore: update go-stablediffusion to latest commit with Make jobserver fix
...
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-30 15:59:28 -05:00
f7aabf1b50
fix: bring everything onto the same GRPC version to fix tests ( #2199 )
...
fix: more places where we are installing grpc that need a version specified
fix: attempt to fix metal tests
fix: metal/brew is forcing an update, they don't have 1.58 available anymore
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-30 19:12:15 +00:00
e38610e521
feat: OpenVINO acceleration for embeddings in transformer backend ( #2190 )
...
OpenVINO acceleration for embeddings
New argument type: OVModelForFeatureExtraction
2024-04-30 10:13:04 +02:00
3754f154ee
feat: organize Dockerfile into distinct sections ( #2181 )
...
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-30 10:12:19 +02:00
29d7812344
⬆️ Update ggerganov/whisper.cpp ( #2188 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2024-04-29 22:16:04 +00:00
5fd46175dc
fix: ensure GNUMake jobserver is passed through to whisper.cpp build ( #2187 )
...
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-29 16:40:50 -05:00
52a268c38c
⬆️ Update ggerganov/llama.cpp ( #2189 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2024-04-29 21:36:30 +00:00
53c3842bc2
build(deps): bump dependabot/fetch-metadata from 2.0.0 to 2.1.0 ( #2186 )
...
Bumps [dependabot/fetch-metadata](https://github.com/dependabot/fetch-metadata ) from 2.0.0 to 2.1.0.
- [Release notes](https://github.com/dependabot/fetch-metadata/releases )
- [Commits](https://github.com/dependabot/fetch-metadata/compare/v2.0.0...v2.1.0 )
---
updated-dependencies:
- dependency-name: dependabot/fetch-metadata
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-29 21:12:37 +00:00
c4f958e11b
refactor(application): introduce application global state ( #2072 )
...
* start breaking up the giant channel refactor now that it's better understood - easier to merge bites
Signed-off-by: Dave Lee <dave@gray101.com >
* add concurrency and base64 back in, along with new base64 tests.
Signed-off-by: Dave Lee <dave@gray101.com >
* Automatic rename of whisper.go's Result to TranscriptResult
Signed-off-by: Dave Lee <dave@gray101.com >
* remove pkg/concurrency - significant changes coming in split 2
Signed-off-by: Dave Lee <dave@gray101.com >
* fix comments
Signed-off-by: Dave Lee <dave@gray101.com >
* add list_model service as another low-risk service to get it out of the way
Signed-off-by: Dave Lee <dave@gray101.com >
* split backend config loader into seperate file from the actual config struct. No changes yet, just reduce cognative load with smaller files of logical blocks
Signed-off-by: Dave Lee <dave@gray101.com >
* rename state.go ==> application.go
Signed-off-by: Dave Lee <dave@gray101.com >
* fix lost import?
Signed-off-by: Dave Lee <dave@gray101.com >
---------
Signed-off-by: Dave Lee <dave@gray101.com >
2024-04-29 17:42:37 +00:00
147440b39b
docs: add reference for concurrent requests
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-29 18:31:50 +02:00
baff5ff8c2
models(gallery): add openvino models ( #2184 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-29 18:17:47 +02:00
ea13863221
models(gallery): add llama3-32k ( #2183 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-29 18:17:39 +02:00
93ca56086e
update go-tinydream to latest commit ( #2182 )
...
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-29 15:17:09 +02:00
11c48a0004
fix: security scanner warning noise: error handlers part 2 ( #2145 )
...
check off a few more error handlers
Signed-off-by: Dave Lee <dave@gray101.com >
2024-04-29 15:11:42 +02:00
b7ea9602f5
fix: undefined symbol: iJIT_NotifyEvent in import torch ##2153 ( #2179 )
...
* add extra index to Intel repository
* Update install.sh
2024-04-29 15:11:09 +02:00
982dc6a2bd
fix: github bump_docs.sh regex to drop emoji and other text ( #2180 )
...
fix: bump_docs regex
Signed-off-by: Dave Lee <dave@gray101.com >
2024-04-29 03:55:29 +00:00
74d903acca
[Documentations] Removed invalid numberings from troubleshooting mac
( #2174 )
...
* updated troubleshooting mac
Signed-off-by: LeonSijiaLu <leonsijialu1@gmail.com >
* prepend -
Signed-off-by: LeonSijiaLu <leonsijialu1@gmail.com >
---------
Signed-off-by: LeonSijiaLu <leonsijialu1@gmail.com >
2024-04-29 02:21:51 +00:00
5fef3b0ff1
⬆️ Update ggerganov/whisper.cpp ( #2177 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2024-04-28 22:32:45 +00:00
0674893649
Update .env
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-28 23:56:10 +02:00
e8d44447ad
feat(gallery): support model deletion ( #2173 )
...
* feat(gallery): op now supports deletion of models
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Wire things with WebUI(WIP)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* minor improvements
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-28 23:42:46 +02:00
a24cd4fda0
docs: enhance and condense few sections ( #2178 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-28 23:41:59 +02:00
01860674c4
⬆️ Update ggerganov/llama.cpp ( #2176 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2024-04-28 21:41:12 +00:00
987b7ad42d
feat: only keep the build artifacts from the grpc build ( #2172 )
...
* feat: only keep the build artifacts from the grpc build
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: remove separate Cache GRPC build step
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: remove docker inspect step, it is leftover from previous debugging
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-28 19:24:16 +00:00
21974fe1d3
fix: swap to WHISPER_CUDA per deprecation message from whisper.cpp ( #2170 )
...
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-28 17:51:53 +00:00
26e1892521
Issue-1720: Updated Build on mac
documentations ( #2171 )
...
updated build on macs documentation
Signed-off-by: LeonSijiaLu <leonsijialu1@gmail.com >
2024-04-28 19:38:02 +02:00
a78cd67737
Update quickstart.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-28 19:30:23 +02:00
5e243ceaeb
docs: update gallery, add rerankers ( #2166 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-28 15:54:15 +02:00
1a0a6f60a7
docs: update model-gallery.md with correct gallery file ( #2163 )
...
* Update model-gallery.md with correct gallery file
The readme points to a file that hasn't been updated in months so when there are announcements about new models, user's won't get them pointing to the old file. Point to the updated files instead.
Signed-off-by: QuinnPiers <167640194+QuinnPiers@users.noreply.github.com >
* Update model-gallery.md
second pass with more understanding
Signed-off-by: QuinnPiers <167640194+QuinnPiers@users.noreply.github.com >
* Update model-gallery.md
Signed-off-by: QuinnPiers <167640194+QuinnPiers@users.noreply.github.com >
* Update model-gallery.md
Signed-off-by: QuinnPiers <167640194+QuinnPiers@users.noreply.github.com >
---------
Signed-off-by: QuinnPiers <167640194+QuinnPiers@users.noreply.github.com >
2024-04-28 12:34:15 +02:00
3179c019af
Revert " ⬆️ Update docs version mudler/LocalAI" ( #2165 )
...
* Revert "⬆️ Update docs version mudler/LocalAI (#2149 )"
This reverts commit 56d843c263
.
* Apply suggestions from code review
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-28 12:32:49 +02:00
a8089494fd
models(gallery): add biomistral-7b ( #2161 )
...
* models(gallery): add biomistral-7b
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
* add <|end_of_text|> to llama3 as stopword
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-28 12:04:04 +02:00
a248ede222
models(gallery): add Undi95/Llama-3-LewdPlay-8B-evo-GGUF ( #2160 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-28 12:02:50 +02:00
0f0ae13ad0
models(gallery): add poppy porpoise ( #2158 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-28 12:01:01 +02:00
773d5d23d5
models(gallery): add solana ( #2157 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-28 11:57:22 +02:00
c3982212f9
⬆️ Update ggerganov/llama.cpp ( #2159 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2024-04-27 21:32:43 +00:00
7e6bf6e7a1
ci: add auto-label rule for gallery in labeler.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-27 19:52:26 +02:00
9fc0135991
feat: cleanup Dockerfile and make final image a little smaller ( #2146 )
...
* feat: cleanup Dockerfile and make final image a little smaller
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add build-essential to final stage
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: more GRPC cache misses
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: correct for another cause of GRPC cache misses
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: generate new GRPC cache automatically if needed
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: use new GRPC_MAKEFLAGS build arg in GRPC cache generation
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-27 19:48:20 +02:00
164be58445
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-27 18:10:58 +02:00
1f8461767d
models(gallery): add average_normie ( #2155 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-27 17:59:31 +02:00
935f4c23f6
models(gallery): add SOVL ( #2154 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-27 17:53:56 +02:00