LocalAI/backend/cpp
Ettore Di Giacinto d4c1746c7d
feat(llama.cpp): expose cache_type_k and cache_type_v for quant of kv cache (#4329)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2024-12-06 10:23:59 +01:00
..
grpc fix: speedup git submodule update with --single-branch (#2847) 2024-07-13 22:32:25 +02:00
llama feat(llama.cpp): expose cache_type_k and cache_type_v for quant of kv cache (#4329) 2024-12-06 10:23:59 +01:00