chore(model gallery): add fusechat-gemma-2-9b-instruct

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2024-12-24 06:46:39 +00:00 · 2024-12-14 11:25:23 +01:00 · 2024-12-14 11:25:23 +01:00 · a55da2aa40
commit a55da2aa40
parent fc4a714992
1 changed files with 15 additions and 0 deletions
--- a/gallery/index.yaml
+++ b/gallery/index.yaml
@ -5479,6 +5479,21 @@
    - filename: BgGPT-Gemma-2-2.6B-IT-v1.0.Q4_K_M.gguf
      sha256: 1e92fe80ccad80e97076ee26b002c2280f075dfe2507d534b46a4391a077f319
      uri: huggingface://QuantFactory/BgGPT-Gemma-2-2.6B-IT-v1.0-GGUF/BgGPT-Gemma-2-2.6B-IT-v1.0.Q4_K_M.gguf
 - !!merge <<: *gemma
  name: "fusechat-gemma-2-9b-instruct"
  icon: "https://huggingface.co/FuseAI/FuseChat-Gemma-2-9B-Instruct/resolve/main/FuseChat-3.0.png"
  urls:
    - https://huggingface.co/FuseAI/FuseChat-Gemma-2-9B-Instruct
    - https://huggingface.co/bartowski/FuseChat-Gemma-2-9B-Instruct-GGUF
  description: |
    We present FuseChat-3.0, a series of models crafted to enhance performance by integrating the strengths of multiple source LLMs into more compact target LLMs. To achieve this fusion, we utilized four powerful source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. For the target LLMs, we employed three widely-used smaller models—Llama-3.1-8B-Instruct, Gemma-2-9B-It, and Qwen-2.5-7B-Instruct—along with two even more compact models—Llama-3.2-3B-Instruct and Llama-3.2-1B-Instruct. The implicit model fusion process involves a two-stage training pipeline comprising Supervised Fine-Tuning (SFT) to mitigate distribution discrepancies between target and source LLMs, and Direct Preference Optimization (DPO) for learning preferences from multiple source LLMs. The resulting FuseChat-3.0 models demonstrated substantial improvements in tasks related to general conversation, instruction following, mathematics, and coding. Notably, when Llama-3.1-8B-Instruct served as the target LLM, our fusion approach achieved an average improvement of 6.8 points across 14 benchmarks. Moreover, it showed significant improvements of 37.1 and 30.1 points on instruction-following test sets AlpacaEval-2 and Arena-Hard respectively. We have released the FuseChat-3.0 models on Huggingface, stay tuned for the forthcoming dataset and code.
  overrides:
    parameters:
      model: FuseChat-Gemma-2-9B-Instruct-Q4_K_M.gguf
  files:
    - filename: FuseChat-Gemma-2-9B-Instruct-Q4_K_M.gguf
      sha256: f5aef201be68f344bebff3433af87aac6428fd227adfd7e468c8bfbcf9660ece
      uri: huggingface://bartowski/FuseChat-Gemma-2-9B-Instruct-GGUF/FuseChat-Gemma-2-9B-Instruct-Q4_K_M.gguf
 - &llama3
  url: "github:mudler/LocalAI/gallery/llama3-instruct.yaml@master"
  icon: https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/aJJxKus1wP5N-euvHEUq7.png