mirror of
https://github.com/mudler/LocalAI.git
synced 2025-05-06 18:48:24 +00:00
Also change icons on GPT vision page Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
39 lines
1.9 KiB
Markdown
39 lines
1.9 KiB
Markdown
|
|
+++
|
|
disableToc = false
|
|
title = "🥽 GPT Vision"
|
|
weight = 14
|
|
url = "/features/gpt-vision/"
|
|
+++
|
|
|
|
LocalAI supports understanding images by using [LLaVA](https://llava.hliu.cc/), and implements the [GPT Vision API](https://platform.openai.com/docs/guides/vision) from OpenAI.
|
|
|
|

|
|
|
|
## Usage
|
|
|
|
OpenAI docs: https://platform.openai.com/docs/guides/vision
|
|
|
|
To let LocalAI understand and reply with what sees in the image, use the `/v1/chat/completions` endpoint, for example with curl:
|
|
|
|
```bash
|
|
curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{
|
|
"model": "llava",
|
|
"messages": [{"role": "user", "content": [{"type":"text", "text": "What is in the image?"}, {"type": "image_url", "image_url": {"url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" }}], "temperature": 0.9}]}'
|
|
```
|
|
|
|
Grammars and function tools can be used as well in conjunction with vision APIs:
|
|
|
|
```bash
|
|
curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{
|
|
"model": "llava", "grammar": "root ::= (\"yes\" | \"no\")",
|
|
"messages": [{"role": "user", "content": [{"type":"text", "text": "Is there some grass in the image?"}, {"type": "image_url", "image_url": {"url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" }}], "temperature": 0.9}]}'
|
|
```
|
|
|
|
### Setup
|
|
|
|
All-in-One images have already shipped the llava model as `gpt-4-vision-preview`, so no setup is needed in this case.
|
|
|
|
To setup the LLaVa models, follow the full example in the [configuration examples](https://github.com/mudler/LocalAI/blob/master/examples/configurations/README.md#llava).
|
|
|