LocalAI/aio/cpu/text-to-text.yaml

name: gpt-4
mmap: true
parameters:
  model: huggingface://NousResearch/Hermes-2-Pro-Mistral-7B-GGUF/Hermes-2-Pro-Mistral-7B.Q2_K.gguf

template:
  chat_message: |
    <|im_start|>{{if eq .RoleName "assistant"}}assistant{{else if eq .RoleName "system"}}system{{else if eq .RoleName "tool"}}tool{{else if eq .RoleName "user"}}user{{end}}
    {{- if .FunctionCall }}
    <tool_call>
    {{- else if eq .RoleName "tool" }}
    <tool_response>
    {{- end }}
    {{- if .Content}}
    {{.Content }}
    {{- end }}
    {{- if .FunctionCall}}
    {{toJson .FunctionCall}}
    {{- end }}
    {{- if .FunctionCall }}
    </tool_call>
    {{- else if eq .RoleName "tool" }}
    </tool_response>
    {{- end }}<|im_end|>
  # https://huggingface.co/NousResearch/Hermes-2-Pro-Mistral-7B-GGUF#prompt-format-for-function-calling
  function: |
    <|im_start|>system
    You are a function calling AI model. You are provided with function signatures within <tools></tools> XML tags. You may call one or more functions to assist with the user query. Don't make assumptions about what values to plug into functions. Here are the available tools:
    <tools>
    {{range .Functions}}
    {'type': 'function', 'function': {'name': '{{.Name}}', 'description': '{{.Description}}', 'parameters': {{toJson .Parameters}} }}
    {{end}}
    </tools>
    Use the following pydantic model json schema for each tool call you will make:
    {'title': 'FunctionCall', 'type': 'object', 'properties': {'arguments': {'title': 'Arguments', 'type': 'object'}, 'name': {'title': 'Name', 'type': 'string'}}, 'required': ['arguments', 'name']}
    For each function call return a json object with function name and arguments within <tool_call></tool_call> XML tags as follows:
    <tool_call>
    {'arguments': <args-dict>, 'name': <function-name>}
    </tool_call><|im_end|>
    {{.Input -}}
    <|im_start|>assistant
    <tool_call>
  chat: |
    {{.Input -}}
    <|im_start|>assistant
  completion: |
    {{.Input}}
context_size: 4096
f16: true
stopwords:
- <|im_end|>
- <dummy32000>
- "\n</tool_call>"
- "\n\n\n"
usage: |
      curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{
          "model": "gpt-4",
          "messages": [{"role": "user", "content": "How are you doing?", "temperature": 0.1}]
      }'
feat(aio): add tests, update model definitions (#1880) 2024-03-22 20:13:11 +00:00			`name: gpt-4`
feat(functions/aio): all-in-one images, function template enhancements (#1862) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text 2024-03-21 00:12:20 +00:00			`mmap: true`
			`parameters:`
fix(grammar): respect JSONmode and grammar from user input (#1935) * fix(grammar): Fix JSON mode and custom grammar * tests(aio): add jsonmode test * tests(aio): add functioncall test * fix(aio): use hermes-2-pro-mistral as llm for CPU profile * add phi-2-orange 2024-03-31 11:04:09 +00:00			`model: huggingface://NousResearch/Hermes-2-Pro-Mistral-7B-GGUF/Hermes-2-Pro-Mistral-7B.Q2_K.gguf`
feat(functions/aio): all-in-one images, function template enhancements (#1862) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text 2024-03-21 00:12:20 +00:00
			`template:`
feat(aio): add tests, update model definitions (#1880) 2024-03-22 20:13:11 +00:00			`chat_message: \|`
fix(grammar): respect JSONmode and grammar from user input (#1935) * fix(grammar): Fix JSON mode and custom grammar * tests(aio): add jsonmode test * tests(aio): add functioncall test * fix(aio): use hermes-2-pro-mistral as llm for CPU profile * add phi-2-orange 2024-03-31 11:04:09 +00:00			`<\|im_start\|>{{if eq .RoleName "assistant"}}assistant{{else if eq .RoleName "system"}}system{{else if eq .RoleName "tool"}}tool{{else if eq .RoleName "user"}}user{{end}}`
models(llama3): add llama3 to embedded models (#2074) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-04-19 16:23:44 +00:00			`{{- if .FunctionCall }}`
			`<tool_call>`
			`{{- else if eq .RoleName "tool" }}`
			`<tool_response>`
			`{{- end }}`
fix(hermes-2-pro-mistral): correct dashes in template to suppress newlines (#1966) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-04-07 16:23:47 +00:00			`{{- if .Content}}`
models(llama3): add llama3 to embedded models (#2074) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-04-19 16:23:44 +00:00			`{{.Content }}`
			`{{- end }}`
			`{{- if .FunctionCall}}`
			`{{toJson .FunctionCall}}`
			`{{- end }}`
			`{{- if .FunctionCall }}`
			`</tool_call>`
			`{{- else if eq .RoleName "tool" }}`
			`</tool_response>`
models(gallery): add new models to the gallery (#2124) * models: add reranker and parler-tts-mini Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: chatml im_end should not have a newline Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * models(noromaid): add Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * models(llama3): add 70b, add dolphin2.9 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * models(llama3): add unholy-8b Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * models(llama3): add therapyllama3, aura Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-04-24 23:28:02 +00:00			`{{- end }}<\|im_end\|>`
fix(grammar): respect JSONmode and grammar from user input (#1935) * fix(grammar): Fix JSON mode and custom grammar * tests(aio): add jsonmode test * tests(aio): add functioncall test * fix(aio): use hermes-2-pro-mistral as llm for CPU profile * add phi-2-orange 2024-03-31 11:04:09 +00:00			`# https://huggingface.co/NousResearch/Hermes-2-Pro-Mistral-7B-GGUF#prompt-format-for-function-calling`
			`function: \|`
			`<\|im_start\|>system`
			`You are a function calling AI model. You are provided with function signatures within <tools></tools> XML tags. You may call one or more functions to assist with the user query. Don't make assumptions about what values to plug into functions. Here are the available tools:`
			`<tools>`
			`{{range .Functions}}`
			`{'type': 'function', 'function': {'name': '{{.Name}}', 'description': '{{.Description}}', 'parameters': {{toJson .Parameters}} }}`
			`{{end}}`
			`</tools>`
			`Use the following pydantic model json schema for each tool call you will make:`
			`{'title': 'FunctionCall', 'type': 'object', 'properties': {'arguments': {'title': 'Arguments', 'type': 'object'}, 'name': {'title': 'Name', 'type': 'string'}}, 'required': ['arguments', 'name']}`
			`For each function call return a json object with function name and arguments within <tool_call></tool_call> XML tags as follows:`
			`<tool_call>`
			`{'arguments': <args-dict>, 'name': <function-name>}`
models(gallery): add new models to the gallery (#2124) * models: add reranker and parler-tts-mini Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: chatml im_end should not have a newline Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * models(noromaid): add Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * models(llama3): add 70b, add dolphin2.9 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * models(llama3): add unholy-8b Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * models(llama3): add therapyllama3, aura Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-04-24 23:28:02 +00:00			`</tool_call><\|im_end\|>`
fix(hermes-2-pro-mistral): correct dashes in template to suppress newlines (#1966) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-04-07 16:23:47 +00:00			`{{.Input -}}`
fix(grammar): respect JSONmode and grammar from user input (#1935) * fix(grammar): Fix JSON mode and custom grammar * tests(aio): add jsonmode test * tests(aio): add functioncall test * fix(aio): use hermes-2-pro-mistral as llm for CPU profile * add phi-2-orange 2024-03-31 11:04:09 +00:00			`<\|im_start\|>assistant`
			`<tool_call>`
feat(aio): add tests, update model definitions (#1880) 2024-03-22 20:13:11 +00:00			`chat: \|`
fix(hermes-2-pro-mistral): correct dashes in template to suppress newlines (#1966) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-04-07 16:23:47 +00:00			`{{.Input -}}`
feat(aio): add tests, update model definitions (#1880) 2024-03-22 20:13:11 +00:00			`<\|im_start\|>assistant`
			`completion: \|`
			`{{.Input}}`
fix(grammar): respect JSONmode and grammar from user input (#1935) * fix(grammar): Fix JSON mode and custom grammar * tests(aio): add jsonmode test * tests(aio): add functioncall test * fix(aio): use hermes-2-pro-mistral as llm for CPU profile * add phi-2-orange 2024-03-31 11:04:09 +00:00			`context_size: 4096`
feat(aio): add tests, update model definitions (#1880) 2024-03-22 20:13:11 +00:00			`f16: true`
			`stopwords:`
			`- <\|im_end\|>`
			`- <dummy32000>`
fix(hermes-2-pro-mistral): add stopword for toolcall (#1939) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-04-01 09:48:35 +00:00			`- "\n</tool_call>"`
fix(hermes-2-pro-mistral): correct stopwords (#1947) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> 2024-04-02 13:38:00 +00:00			`- "\n\n\n"`
feat(functions/aio): all-in-one images, function template enhancements (#1862) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text 2024-03-21 00:12:20 +00:00			`usage: \|`
			`curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{`
fix(grammar): respect JSONmode and grammar from user input (#1935) * fix(grammar): Fix JSON mode and custom grammar * tests(aio): add jsonmode test * tests(aio): add functioncall test * fix(aio): use hermes-2-pro-mistral as llm for CPU profile * add phi-2-orange 2024-03-31 11:04:09 +00:00			`"model": "gpt-4",`
feat(functions/aio): all-in-one images, function template enhancements (#1862) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text 2024-03-21 00:12:20 +00:00			`"messages": [{"role": "user", "content": "How are you doing?", "temperature": 0.1}]`
			`}'`