d454118887
fix(container-images): install uv as system package ( #4094 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-11-08 11:47:43 +01:00
e06daf437a
chore(Dockerfile): default to cmake from package manager ( #3746 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-10-07 16:42:56 +02:00
fbca9f82fd
fix(vllm): bump cmake - vllm requires it ( #3744 )
...
* fix(vllm): bump cmake - vllm requires it
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(tests): try to increase coqui timeout
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-10-07 11:22:55 +02:00
90cacb9692
test: preliminary tests and merge fix for authv2 ( #3584 )
...
* add api key to existing app tests, add preliminary auth test
Signed-off-by: Dave Lee <dave@gray101.com >
* small fix, run test
Signed-off-by: Dave Lee <dave@gray101.com >
* status on non-opaque
Signed-off-by: Dave Lee <dave@gray101.com >
* tweak auth error
Signed-off-by: Dave Lee <dave@gray101.com >
* exp
Signed-off-by: Dave Lee <dave@gray101.com >
* quick fix on real laptop
Signed-off-by: Dave Lee <dave@gray101.com >
* add downloader version that allows providing an auth header
Signed-off-by: Dave Lee <dave@gray101.com >
* stash some devcontainer fixes during testing
Signed-off-by: Dave Lee <dave@gray101.com >
* s2
Signed-off-by: Dave Lee <dave@gray101.com >
* s
Signed-off-by: Dave Lee <dave@gray101.com >
* done with experiment
Signed-off-by: Dave Lee <dave@gray101.com >
* done with experiment
Signed-off-by: Dave Lee <dave@gray101.com >
* after merge fix
Signed-off-by: Dave Lee <dave@gray101.com >
* rename and fix
Signed-off-by: Dave Lee <dave@gray101.com >
---------
Signed-off-by: Dave Lee <dave@gray101.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-09-24 09:32:48 +02:00
51cba89682
fix(hipblas): do not push all variants to hipblas builds ( #3630 )
...
Like with CUDA builds, we don't need all the variants when we are
compiling against the accelerated variants - in this way we save space
and we avoid to exceed embedFS golang size limits.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-09-23 11:49:07 +02:00
25e7661de2
chore(exllama): drop exllama backend ( #3536 )
...
* chore(exllama): drop exllama backend
For polishing and cleaning up it makes now sense to drop exllama which
is completely unmaintained, and was only supporting the llamav1
architecture (nowadays it's superseded by llamav1) .
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(gosec): fix CI
downgrade to latest known version of the gosec action
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-09-13 19:52:13 +00:00
300f2779e4
fix: speedup and improve cachability of docker build of builder-sd
( #3430 )
...
fix: speedup and improve cachability of docker build of `builder-sd` (#3430 )
---------
Signed-off-by: Dave Lee <dave@gray101.com >
2024-09-10 08:57:16 +02:00
5d892f86ea
chore(cuda): reduce binary size ( #3379 )
...
fix(cuda): reduce binary size
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-08-26 14:47:36 +02:00
736df11454
fix(ci): pin to llvmlite 0.43 ( #3342 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-08-20 20:14:35 +02:00
aca2c4196a
ci(Dockerfile): try to install lvm-10 from Ubuntu repositories
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-08-20 19:36:11 +02:00
6aba6223c7
ci(Dockerfile): adjust deps from typos
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-08-20 19:21:47 +02:00
d02a0f6f01
ci: add llvm dependencies
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-08-20 18:27:10 +02:00
b06046fe4c
chore: install llvm 10
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-08-20 18:06:55 +02:00
9a8a249932
feat: devcontainer part 3 ( #3318 )
...
* stash initial fixes, attempt to open branch inside container
Signed-off-by: Dave Lee <dave@gray101.com >
* add yq, from inside DC
Signed-off-by: Dave Lee <dave@gray101.com >
* stash progress, rebuild container
Signed-off-by: Dave Lee <dave@gray101.com >
* snap
Signed-off-by: Dave Lee <dave@gray101.com >
* split builder into builder-sd, will speed up devcontainer build times and potentially help caching in other situations.
Signed-off-by: Dave Lee <dave@gray101.com >
* fix yq
Signed-off-by: Dave Lee <dave@gray101.com >
* fix paths
Signed-off-by: Dave Lee <dave@gray101.com >
* fix paths - new folder to bypass the .dockerignore which _should_ exclude the other files
Signed-off-by: Dave Lee <dave@gray101.com >
* fix
Signed-off-by: Dave Lee <dave@gray101.com >
* fix ]
Signed-off-by: Dave Lee <dave@gray101.com >
---------
Signed-off-by: Dave Lee <dave@gray101.com >
2024-08-20 12:16:21 +02:00
dfa183551e
fix: add llvm to extra images ( #3321 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-08-20 12:14:47 +02:00
9475a6fa05
chore: drop petals ( #3316 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-08-20 10:01:38 +02:00
57f7900210
feat: Initial Version of vscode DevContainer ( #3217 )
...
initial version of devcontainer
---------
Signed-off-by: Dave Lee <dave@gray101.com >
2024-08-14 09:06:41 +02:00
a9757fb057
fix(cuda): downgrade to 12.0 to increase compatibility range ( #2994 )
...
* fix(cuda): downgrade to 12.0 to increase compatibility range
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* improve messaging
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-07-23 23:35:31 +02:00
bcd9e153ba
ci(Makefile): reduce binary size by compressing ( #2947 )
...
Makefile: try to reduce binary size
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-07-22 15:39:57 +02:00
38b3115a15
fix: cleanup indentation and remove duplicate dockerfile stanza ( #2889 )
...
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-07-16 01:51:15 +00:00
fc87507012
chore(deps): Update Dependencies ( #2538 )
...
* chore(deps): Update dependencies
Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com >
* chore(deps): Upgrade github.com/imdario/mergo to dario.cat/mergo
Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com >
* remove version identifiers for MeloTTS
Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com >
---------
Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com >
Signed-off-by: Dave <dave@gray101.com >
Co-authored-by: Dave <dave@gray101.com >
2024-07-12 19:54:08 +00:00
fd0bc21c3e
fix abseil test issue [attempt 3] ( #2769 )
...
* use a sed hack to jam a missing line in place for grpc's abseil version.
Signed-off-by: Dave Lee <dave@gray101.com >
---------
Signed-off-by: Dave Lee <dave@gray101.com >
2024-07-11 01:40:54 +00:00
2845baecd5
fix(cuda): downgrade default version from 12.5 to 12.4 ( #2707 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-07-09 23:13:29 +02:00
e591ff2e74
fix(initializer): do select backends that exist ( #2694 )
...
we were not checking if the binary exists before picking these up from
the asset dir.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-07-01 22:50:36 +02:00
e84b31935c
feat(vulkan): add vulkan support to the llama.cpp backend ( #2648 )
...
feat(vulkan): add vulkan support to llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-06-24 20:04:58 +02:00
43f0688a95
feat: Upgrade to CUDA 12.5 ( #2601 )
...
Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com >
2024-06-19 17:50:49 +02:00
b4d4c0a18f
chore(deps): Update Dockerfile ( #2532 )
...
Signed-off-by: Rene Leonhardt <65483435+reneleonhardt@users.noreply.github.com >
2024-06-10 08:40:02 +00:00
4c9623f50d
deps(whisper): update, add libcufft-dev ( #2501 )
...
* arrow_up: Update ggerganov/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* fix(build): add libcufft-dev
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2024-06-06 08:41:04 +02:00
c603b95ac7
ci: pin build-time protoc ( #2461 )
...
ci: pin protoc
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-06-01 18:59:15 +02:00
2bbc52fcc8
feat(build): add arm64 core containers ( #2421 )
...
ci: add arm64 container images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-05-28 10:34:59 +02:00
9f5c274321
feat(images): do not install python deps in the core image ( #2425 )
...
do not install python deps in the core image
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-05-27 22:07:48 +02:00
3200a6655e
fix: gpu fetch device info ( #2403 )
...
* fix: gpu fetch device info
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* use pciutils package
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
---------
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
2024-05-26 09:56:06 +02:00
371d0cc1f7
ci: generate specific image for intel builds ( #2374 )
...
ci: fix intel images until are fixed upstream
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-05-22 23:35:39 +02:00
f91e4e5c03
ci: correctly build p2p in GO_TAGS ( #2369 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-05-22 10:15:36 +02:00
fdb45153fe
feat(llama.cpp): Totally decentralized, private, distributed, p2p inference ( #2343 )
...
* feat(llama.cpp): Enable decentralized, distributed inference
As https://github.com/mudler/LocalAI/pull/2324 introduced distributed inferencing thanks to
@rgerganov implementation in https://github.com/ggerganov/llama.cpp/pull/6829 in upstream llama.cpp, now
it is possible to distribute the workload to remote llama.cpp gRPC server.
This changeset now uses mudler/edgevpn to establish a secure, distributed network between the nodes using a shared token.
The token is generated automatically when starting the server with the `--p2p` flag, and can be used by starting the workers
with `local-ai worker p2p-llama-cpp-rpc` by passing the token via environment variable (TOKEN) or with args (--token).
As per how mudler/edgevpn works, a network is established between the server and the workers with dht and mdns discovery protocols,
the llama.cpp rpc server is automatically started and exposed to the underlying p2p network so the API server can connect on.
When the HTTP server is started, it will discover the workers in the network and automatically create the port-forwards to the service locally.
Then llama.cpp is configured to use the services.
This feature is behind the "p2p" GO_FLAGS
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* go mod tidy
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* ci: add p2p tag
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* better message
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-05-20 19:17:59 +02:00
8ad669339e
add openvoice backend ( #2334 )
...
Wip openvoice
2024-05-19 16:27:08 +02:00
c89271b2e4
feat(llama.cpp): add distributed llama.cpp inferencing ( #2324 )
...
* feat(llama.cpp): support distributed llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: let tweak how chat messages are merged together
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* refactor
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Makefile: register to ALL_GRPC_BACKENDS
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* refactoring, allow disable auto-detection of backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* minor fixups
Signed-off-by: mudler <mudler@localai.io >
* feat: add cmd to start rpc-server from llama.cpp
Signed-off-by: mudler <mudler@localai.io >
* ci: add ccache
Signed-off-by: mudler <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: mudler <mudler@localai.io >
2024-05-15 01:17:02 +02:00
28a421cb1d
feat: migrate python backends from conda to uv ( #2215 )
...
* feat: migrate diffusers backend from conda to uv
- replace conda with UV for diffusers install (prototype for all
extras backends)
- add ability to build docker with one/some/all extras backends
instead of all or nothing
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate autogtpq bark coqui from conda to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: convert exllama over to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate exllama2 to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate mamba to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate parler to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate petals to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: fix tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate rerankers to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate sentencetransformers to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: install uv for tests-linux
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: make sure file exists before installing on intel images
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate transformers backend to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate transformers-musicgen to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate vall-e-x to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate vllm to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add uv install to the rest of test-extra.yml
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: adjust file perms on all install/run/test scripts
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add missing acclerate dependencies
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add some more missing dependencies to python backends
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: parler tests venv py dir fix
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: correct filename for transformers-musicgen tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: adjust the pwd for valle tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: cleanup and optimization work for uv migration
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add setuptools to requirements-install for mamba
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: more size optimization work
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: make installs and tests more consistent, cleanup some deps
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: cleanup
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: mamba backend is cublas only
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: uncomment lines in makefile
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-05-10 15:08:08 +02:00
a0aa5d01a1
feat: update ROCM and use smaller image ( #2196 )
...
* feat: update ROCM and use smaller image
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add call to ldconfig to fix AMDs broken library packages
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-05-03 18:46:49 +02:00
3754f154ee
feat: organize Dockerfile into distinct sections ( #2181 )
...
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-30 10:12:19 +02:00
987b7ad42d
feat: only keep the build artifacts from the grpc build ( #2172 )
...
* feat: only keep the build artifacts from the grpc build
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: remove separate Cache GRPC build step
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: remove docker inspect step, it is leftover from previous debugging
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-28 19:24:16 +00:00
9fc0135991
feat: cleanup Dockerfile and make final image a little smaller ( #2146 )
...
* feat: cleanup Dockerfile and make final image a little smaller
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add build-essential to final stage
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: more GRPC cache misses
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: correct for another cause of GRPC cache misses
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: generate new GRPC cache automatically if needed
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: use new GRPC_MAKEFLAGS build arg in GRPC cache generation
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-27 19:48:20 +02:00
b664edde29
feat(rerankers): Add new backend, support jina rerankers API ( #2121 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-25 00:19:02 +02:00
3411e072ca
Fix cleanup sonarqube findings ( #2106 )
...
* fix: update dockerignore and gitignore to exclude sonarqube work dir
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: remove useless equality check
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: use sonarqube Dockerfile recommendations
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-23 18:43:00 +02:00
13012cfa70
feat: better control of GRPC docker cache ( #2070 )
...
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-18 16:19:36 -04:00
0fdff26924
feat(parler-tts): Add new backend ( #2027 )
...
* feat(parler-tts): Add new backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(parler-tts): try downgrade protobuf
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(parler-tts): add parler conda env
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Revert "feat(parler-tts): try downgrade protobuf"
This reverts commit bd5941d5cfc00676b45a99f71debf3c34249cf3c.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* deps: add grpc
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: try to gen proto with same environment
* workaround
* Revert "fix: try to gen proto with same environment"
This reverts commit 998c745e2f
.
* Workaround fixup
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: Dave <dave@gray101.com >
2024-04-13 18:59:21 +02:00
1981154f49
fix: dont commit generated files to git ( #1993 )
...
* fix: initial work towards not committing generated files to the repository
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: improve build docs
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: remove unused folder from .dockerignore and .gitignore
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: attempt to fix extra backend tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: attempt to fix other tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: more test fixes
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: fix apple tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: more extras tests fixes
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add GOBIN to PATH in docker build
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: extra tests and Dockerfile corrections
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: remove build dependency checks
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add golang protobuf compilers to tests-linux action
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: ensure protogen is run for extra backend installs
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: use newer protobuf
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: more missing protoc binaries
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: missing dependencies during docker build
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: don't install grpc compilers in the final stage if they aren't needed
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: python-grpc-tools in 22.04 repos is too old
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add a couple of extra build dependencies to Makefile
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: unbreak container rebuild functionality
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-13 09:37:32 +02:00
93702e39d4
feat(build): adjust number of parallel make jobs ( #1915 )
...
* feat(build): adjust number of parallel make jobs
* fix: update make on MacOS from brew to support --output-sync argument
* fix: cache grpc with version as part of key to improve validity of cache hits
* fix: use gmake for tests-apple to use the updated GNU make version
* fix: actually use the new make version for tests-apple
* feat: parallelize tests-extra
* feat: attempt to cache grpc build for docker images
* fix: don't quote GRPC version
* fix: don't cache go modules, we have limited cache space, better used elsewhere
* fix: release with the same version of go that we test with
* fix: don't fail on exporting cache layers
* fix: remove deprecated BUILD_GRPC docker arg from Makefile
2024-03-29 22:32:40 +01:00
6cf99527f8
docs(aio): Add All-in-One images docs ( #1887 )
...
* docs(aio): Add AIO images docs
* add image generation link to quickstart
* while reviewing I noticed this one link was missing, so quickly adding it.
Signed-off-by: Dave <dave@gray101.com >
Co-authored-by: Dave <dave@gray101.com >
2024-03-25 02:01:30 +00:00
ed5734ae25
test/fix: OSX Test Repair ( #1843 )
...
* test with gguf instead of ggml. Updates testPrompt to match? Adds debugging line to Dockerfile that I've found helpful recently.
* fix testPrompt slightly
* Sad Experiment: Test GH runner without metal?
* break apart CGO_LDFLAGS
* switch runner
* upstream llama.cpp disables Metal on Github CI!
* missed a dir from clean-tests
* CGO_LDFLAGS
* tmate failure + NO_ACCELERATE
* whisper.cpp has a metal fix
* do the exact opposite of the name of this branch, but keep it around for unrelated fixes?
* add back newlines
* add tmate to linux for testing
* update fixtures
* timeout for tmate
2024-03-18 19:19:43 +01:00