LocalAI/embedded/models/llava.yaml

backend: llama-cpp
context_size: 4096
f16: true

gpu_layers: 90
mmap: true
name: llava

roles:
  user: "USER:"
  assistant: "ASSISTANT:"
  system: "SYSTEM:"

mmproj: bakllava-mmproj.gguf
parameters:
  model: bakllava.gguf
  temperature: 0.2
  top_k: 40
  top_p: 0.95
  seed: -1
mirostat: 2
mirostat_eta: 1.0
mirostat_tau: 1.0

template:
  chat: |
    A chat between a curious human and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the human's questions.
    {{.Input}}
    ASSISTANT:

download_files:
- filename: bakllava.gguf
  uri: huggingface://mys/ggml_bakllava-1/ggml-model-q4_k.gguf
- filename: bakllava-mmproj.gguf
  uri: huggingface://mys/ggml_bakllava-1/mmproj-model-f16.gguf

usage: |
    curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{
        "model": "llava",
        "messages": [{"role": "user", "content": [{"type":"text", "text": "What is in the image?"}, {"type": "image_url", "image_url": {"url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" }}], "temperature": 0.9}]}'
feat: embedded model configurations, add popular model examples, refactoring (#1532) * move downloader out * separate startup functions for preloading configuration files * docs: add popular model examples Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * shorteners * Add llava * Add mistral-openorca * Better link to build section * docs: update * fixup * Drop code dups * Minor fixups * Apply suggestions from code review Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * ci: try to cache gRPC build during tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: do not build all images for tests, just necessary * ci: cache gRPC also in release pipeline * fixes * Update model_preload_test.go Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2024-01-05 22:16:33 +00:00			`backend: llama-cpp`
			`context_size: 4096`
			`f16: true`

			`gpu_layers: 90`
			`mmap: true`
			`name: llava`

			`roles:`
			`user: "USER:"`
			`assistant: "ASSISTANT:"`
			`system: "SYSTEM:"`

			`mmproj: bakllava-mmproj.gguf`
			`parameters:`
			`model: bakllava.gguf`
			`temperature: 0.2`
			`top_k: 40`
			`top_p: 0.95`
feat(transformers): support also text generation (#1630) * feat(transformers): support also text generation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * embedded: set seed -1 --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-01-23 22:07:31 +00:00			`seed: -1`
fix(doc/examples): set defaults to mirostat (#1820) The default sampler on some models don't return enough candidates which leads to a false sense of randomness. Tracing back the code it looks that with the temperature sampler there might not be enough candidates to pick from, and since the seed and "randomness" take effect while picking a good candidate this yields to the same results over and over. Fixes https://github.com/mudler/LocalAI/issues/1723 by updating the examples and documentation to use mirostat instead. 2024-03-11 18:49:03 +00:00			`mirostat: 2`
			`mirostat_eta: 1.0`
			`mirostat_tau: 1.0`
feat: embedded model configurations, add popular model examples, refactoring (#1532) * move downloader out * separate startup functions for preloading configuration files * docs: add popular model examples Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * shorteners * Add llava * Add mistral-openorca * Better link to build section * docs: update * fixup * Drop code dups * Minor fixups * Apply suggestions from code review Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * ci: try to cache gRPC build during tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: do not build all images for tests, just necessary * ci: cache gRPC also in release pipeline * fixes * Update model_preload_test.go Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> 2024-01-05 22:16:33 +00:00
			`template:`
			`chat: \|`
			`A chat between a curious human and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the human's questions.`
			`{{.Input}}`
			`ASSISTANT:`

			`download_files:`
			`- filename: bakllava.gguf`
			`uri: huggingface://mys/ggml_bakllava-1/ggml-model-q4_k.gguf`
			`- filename: bakllava-mmproj.gguf`
feat: more embedded models, coqui fixes, add model usage and description (#1556) * feat: add model descriptions and usage * remove default model gallery * models: add embeddings and tts * docs: update table * docs: updates * images: cleanup pip cache after install * images: always run apt-get clean * ux: improve gRPC connection errors * ux: improve some messages * fix: fix coqui when no AudioPath is passed by * embedded: add more models * Add usage * Reorder table 2024-01-07 23:37:02 +00:00			`uri: huggingface://mys/ggml_bakllava-1/mmproj-model-f16.gguf`

			`usage: \|`
			`curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{`
			`"model": "llava",`
			`"messages": [{"role": "user", "content": [{"type":"text", "text": "What is in the image?"}, {"type": "image_url", "image_url": {"url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" }}], "temperature": 0.9}]}'`