chore(model gallery): add gemma-3-4b-it (#5008)

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
This commit is contained in:
Ettore Di Giacinto 2025-03-13 09:47:01 +01:00 committed by GitHub
parent 87ca801f00
commit 8d16a0a536
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -38,6 +38,20 @@
- filename: gemma-3-12b-it-Q4_K_M.gguf
sha256: 7bb69bff3f48a7b642355d64a90e481182a7794707b3133890646b1efa778ff5
uri: huggingface://ggml-org/gemma-3-12b-it-GGUF/gemma-3-12b-it-Q4_K_M.gguf
- !!merge <<: *gemma3
name: "gemma-3-4b-it"
urls:
- https://ai.google.dev/gemma/docs/core
- https://huggingface.co/ggml-org/gemma-3-4b-it-GGUF
description: |
Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. Gemma 3 models are multimodal, handling text and image input and generating text output, with open weights for both pre-trained variants and instruction-tuned variants. Gemma 3 has a large, 128K context window, multilingual support in over 140 languages, and is available in more sizes than previous versions. Gemma 3 models are well-suited for a variety of text generation and image understanding tasks, including question answering, summarization, and reasoning. Their relatively small size makes it possible to deploy them in environments with limited resources such as laptops, desktops or your own cloud infrastructure, democratizing access to state of the art AI models and helping foster innovation for everyone. Gemma-3-4b-it is a 4 billion parameter model.
overrides:
parameters:
model: gemma-3-4b-it-Q4_K_M.gguf
files:
- filename: gemma-3-4b-it-Q4_K_M.gguf
sha256: 882e8d2db44dc554fb0ea5077cb7e4bc49e7342a1f0da57901c0802ea21a0863
uri: huggingface://ggml-org/gemma-3-4b-it-GGUF/gemma-3-4b-it-Q4_K_M.gguf
- &phi4
url: "github:mudler/LocalAI/gallery/phi-4-chat.yaml@master"
name: "phi-4"