mirror of
https://github.com/mudler/LocalAI.git
synced 2024-12-19 04:37:53 +00:00
e49ea0123b
feat(llama.cpp): add flash_attn and no_kv_offload Signed-off-by: Ettore Di Giacinto <mudler@localai.io> |
||
---|---|---|
.. | ||
CMakeLists.txt | ||
grpc-server.cpp | ||
json.hpp | ||
Makefile | ||
prepare.sh | ||
utils.hpp |