LocalAI/gpt-vision.md at db926896bd7a0d51f8d94fc7c5a78dfbf45b0dda

mirror of https://github.com/mudler/LocalAI.git synced 2024-12-29 17:08:52 +00:00

Ettore Di Giacinto ba5ab26f2e

docs: Add llava, update hot topics (#1322 )

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-11-23 18:54:55 +01:00

1.2 KiB

Raw Blame History

+++ disableToc = false title = "🆕 GPT Vision" weight = 2 +++

{{% notice note %}} Available only on master builds {{% /notice %}}

LocalAI supports understanding images by using LLaVA, and implements the GPT Vision API from OpenAI.

Usage

OpenAI docs: https://platform.openai.com/docs/guides/vision

To let LocalAI understand and reply with what sees in the image, use the /v1/chat/completions endpoint, for example with curl:

curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{
     "model": "llava",
     "messages": [{"role": "user", "content": [{"type":"text", "text": "What is in the image?"}, {"type": "image_url", "image_url": {"url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" }}], "temperature": 0.9}]}'

Setup

To setup the LLaVa models, follow the full example in the configuration examples.

1.2 KiB Raw Blame History

Usage

Setup

1.2 KiB

Raw Blame History