ExternalVendorCode/LocalAI

Fork 0

mirror of https://github.com/mudler/LocalAI.git synced 2025-06-21 08:09:21 +00:00

Go to file

Ettore Di Giacinto badbd212f7

Explorer deployment / build-linux (push) Waiting to run

Details

GPU tests / ubuntu-latest (1.21.x) (push) Waiting to run

Details

generate and publish GRPC docker caches / generate_caches (ubuntu:22.04, linux/amd64,linux/arm64, ubuntu-latest) (push) Waiting to run

Details

generate and publish intel docker caches / generate_caches (intel/oneapi-basekit:2025.0.0-0-devel-ubuntu22.04, linux/amd64, ubuntu-latest) (push) Waiting to run

Details

build container images / hipblas-jobs (-aio-gpu-hipblas, rocm/dev-ubuntu-22.04:6.1, hipblas, true, ubuntu:22.04, extras, latest-gpu-hipblas, latest-aio-gpu-hipblas, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, auto, -hipblas-ffmpeg) (push) Waiting to run

Details

build container images / hipblas-jobs (rocm/dev-ubuntu-22.04:6.1, hipblas, false, ubuntu:22.04, core, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -hipblas-core) (push) Waiting to run

Details

build container images / hipblas-jobs (rocm/dev-ubuntu-22.04:6.1, hipblas, false, ubuntu:22.04, extras, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -hipblas) (push) Waiting to run

Details

build container images / hipblas-jobs (rocm/dev-ubuntu-22.04:6.1, hipblas, true, ubuntu:22.04, core, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -hipblas-ffmpeg-core) (push) Waiting to run

Details

build container images / self-hosted-jobs (-aio-gpu-intel-f16, quay.io/go-skynet/intel-oneapi-base:latest, sycl_f16, true, ubuntu:22.04, extras, latest-gpu-intel-f16, latest-aio-gpu-intel-f16, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, auto, -sycl-f16-ffmpeg) (push) Waiting to run

Details

build container images / self-hosted-jobs (-aio-gpu-intel-f32, quay.io/go-skynet/intel-oneapi-base:latest, sycl_f32, true, ubuntu:22.04, extras, latest-gpu-intel-f32, latest-aio-gpu-intel-f32, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, auto, -sycl-f32-ffmpeg) (push) Waiting to run

Details

build container images / self-hosted-jobs (-aio-gpu-nvidia-cuda-11, ubuntu:22.04, cublas, 11, 7, true, extras, latest-gpu-nvidia-cuda-11, latest-aio-gpu-nvidia-cuda-11, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, auto, -cublas-cuda11-ffmpeg) (push) Waiting to run

Details

build container images / self-hosted-jobs (-aio-gpu-nvidia-cuda-12, ubuntu:22.04, cublas, 12, 0, true, extras, latest-gpu-nvidia-cuda-12, latest-aio-gpu-nvidia-cuda-12, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, auto, -cublas-cuda12-ffmpeg) (push) Waiting to run

Details

build container images / self-hosted-jobs (quay.io/go-skynet/intel-oneapi-base:latest, sycl_f16, false, ubuntu:22.04, core, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -sycl-f16-core) (push) Waiting to run

Details

build container images / self-hosted-jobs (quay.io/go-skynet/intel-oneapi-base:latest, sycl_f16, true, ubuntu:22.04, core, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -sycl-f16-ffmpeg-core) (push) Waiting to run

Details

build container images / self-hosted-jobs (quay.io/go-skynet/intel-oneapi-base:latest, sycl_f32, false, ubuntu:22.04, core, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -sycl-f32-core) (push) Waiting to run

Details

build container images / self-hosted-jobs (quay.io/go-skynet/intel-oneapi-base:latest, sycl_f32, true, ubuntu:22.04, core, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -sycl-f32-ffmpeg-core) (push) Waiting to run

Details

build container images / self-hosted-jobs (ubuntu:22.04, , , extras, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, auto, ) (push) Waiting to run

Details

build container images / self-hosted-jobs (ubuntu:22.04, , true, extras, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, auto, -ffmpeg) (push) Waiting to run

Details

build container images / self-hosted-jobs (ubuntu:22.04, cublas, 11, 7, , extras, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -cublas-cuda11) (push) Waiting to run

Details

build container images / self-hosted-jobs (ubuntu:22.04, cublas, 12, 0, , extras, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -cublas-cuda12) (push) Waiting to run

Details

build container images / core-image-build (-aio-cpu, ubuntu:22.04, , true, core, latest-cpu, latest-aio-cpu, --jobs=4 --output-sync=target, linux/amd64,linux/arm64, arc-runner-set, auto, -ffmpeg-core) (push) Waiting to run

Details

build container images / core-image-build (ubuntu:22.04, cublas, 11, 7, , core, --jobs=4 --output-sync=target, linux/amd64, arc-runner-set, false, -cublas-cuda11-core) (push) Waiting to run

Details

build container images / core-image-build (ubuntu:22.04, cublas, 11, 7, true, core, --jobs=4 --output-sync=target, linux/amd64, arc-runner-set, false, -cublas-cuda11-ffmpeg-core) (push) Waiting to run

Details

build container images / core-image-build (ubuntu:22.04, cublas, 12, 0, , core, --jobs=4 --output-sync=target, linux/amd64, arc-runner-set, false, -cublas-cuda12-core) (push) Waiting to run

Details

build container images / core-image-build (ubuntu:22.04, cublas, 12, 0, true, core, --jobs=4 --output-sync=target, linux/amd64, arc-runner-set, false, -cublas-cuda12-ffmpeg-core) (push) Waiting to run

Details

build container images / core-image-build (ubuntu:22.04, vulkan, true, core, latest-vulkan-ffmpeg-core, --jobs=4 --output-sync=target, linux/amd64, arc-runner-set, false, -vulkan-ffmpeg-core) (push) Waiting to run

Details

Security Scan / tests (push) Waiting to run

Details

Tests extras backends / tests-transformers (push) Waiting to run

Details

Tests extras backends / tests-sentencetransformers (push) Waiting to run

Details

Tests extras backends / tests-rerankers (push) Waiting to run

Details

Tests extras backends / tests-diffusers (push) Waiting to run

Details

Tests extras backends / tests-parler-tts (push) Waiting to run

Details

Tests extras backends / tests-openvoice (push) Waiting to run

Details

Tests extras backends / tests-transformers-musicgen (push) Waiting to run

Details

Tests extras backends / tests-vallex (push) Waiting to run

Details

Tests extras backends / tests-coqui (push) Waiting to run

Details

tests / tests-linux (1.21.x) (push) Waiting to run

Details

tests / tests-aio-container (push) Waiting to run

Details

tests / tests-apple (1.21.x) (push) Waiting to run

Details

chore(model gallery): add tq2.5-14b-neon-v1 (#4441 )

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-12-20 16:11:16 +01:00

.bruno/LocalAI Test Requests

chore: drop examples folder now that LocalAI-examples has been created (#4017 )

2024-10-30 09:10:33 +01:00

.devcontainer

feat: devcontainer part 4 (#3339 )

2024-08-20 19:25:22 +02:00

.devcontainer-scripts

test: preliminary tests and merge fix for authv2 (#3584 )

2024-09-24 09:32:48 +02:00

.github

chore(ci): set auto-labeler for dependencies

2024-12-04 18:35:54 +01:00

.vscode

feat: Initial Version of vscode DevContainer (#3217 )

2024-08-14 09:06:41 +02:00

aio

feat(backends): Drop bert.cpp (#4272 )

2024-11-27 16:34:28 +01:00

backend

fix(openvoice): pin numpy before installing torch (#4439 )

2024-12-20 10:34:23 +01:00

configuration

refactor: move remaining api packages to core (#1731 )

2024-03-01 16:19:53 +01:00

core

feat: stream tokens usage (#4415 )

2024-12-18 09:48:50 +01:00

custom-ca-certs

feat(certificates): add support for custom CA certificates (#880 )

2023-11-01 20:10:14 +01:00

docs

chore(docs): patch p2p detail in env and docs (#4434 )

2024-12-19 15:19:31 +01:00

embedded

feat(backends): Drop bert.cpp (#4272 )

2024-11-27 16:34:28 +01:00

examples

chore: create examples/README to redirect to the new repository

2024-10-30 09:11:32 +01:00

gallery

chore(model gallery): add tq2.5-14b-neon-v1 (#4441 )

2024-12-20 16:11:16 +01:00

internal

feat: cleanups, small enhancements

2023-07-04 18:58:19 +02:00

models

Add docker-compose

2023-04-13 01:13:14 +02:00

pkg

feat: stream tokens usage (#4415 )

2024-12-18 09:48:50 +01:00

prompt-templates

Requested Changes from GPT4ALL to Luna-AI-Llama2 (#1092 )

2023-09-22 11:22:17 +02:00

scripts

chore(scripts): handle summarization errors (#4271 )

2024-11-26 14:51:55 +01:00

swagger

feat(swagger): update swagger (#4211 )

2024-11-20 23:10:51 +01:00

tests

fix(rwkv model): add stoptoken (#4283 )

2024-11-28 09:34:35 +01:00

.dockerignore

feat: Initial Version of vscode DevContainer (#3217 )

2024-08-14 09:06:41 +02:00

.editorconfig

feat(stores): Vector store backend (#1795 )

2024-03-22 21:14:04 +01:00

.env

chore(docs): patch p2p detail in env and docs (#4434 )

2024-12-19 15:19:31 +01:00

.gitattributes

chore(linguist): add *.hpp files to linguist-vendored (#4154 )

2024-11-14 14:12:16 +01:00

.gitignore

feat(bark-cpp): add new bark.cpp backend (#4287 )

2024-11-28 22:16:44 +01:00

.gitmodules

docs/examples: enhancements (#1572 )

2024-01-18 19:41:08 +01:00

.yamllint

fix: yamlint warnings and errors (#2131 )

2024-04-25 17:25:56 +00:00

assets.go

feat: Update gpt4all, support multiple implementations in runtime (#472 )

2023-06-01 23:38:52 +02:00

CONTRIBUTING.md

Update CONTRIBUTING.md (#3723 )

2024-10-03 20:03:35 +02:00

docker-compose.yaml

feat: Initial Version of vscode DevContainer (#3217 )

2024-08-14 09:06:41 +02:00

Dockerfile

fix(container-images): install uv as system package (#4094 )

2024-11-08 11:47:43 +01:00

Dockerfile.aio

feat(aio): entrypoint, update workflows (#1872 )

2024-03-21 22:09:04 +01:00

Earthfile

Rename project to LocalAI (#35 )

2023-04-19 18:43:10 +02:00

Entitlements.plist

Feat: OSX Local Codesigning (#1319 )

2023-11-23 15:22:54 +01:00

entrypoint.sh

deps(llama.cpp): bump to latest, update build variables (#2669 )

2024-06-27 23:10:04 +02:00

go.mod

feat(template): read jinja templates from gguf files (#4332 )

2024-12-08 13:50:33 +01:00

go.sum

feat(template): read jinja templates from gguf files (#4332 )

2024-12-08 13:50:33 +01:00

LICENSE

docs/examples: enhancements (#1572 )

2024-01-18 19:41:08 +01:00

main.go

chore: fix go.mod module (#2635 )

2024-06-23 08:24:36 +00:00

Makefile

chore: ⬆️ Update ggerganov/llama.cpp to d408bb9268a988c5a60a5746d3a6430386e7604d (#4437 )

2024-12-19 23:03:47 +00:00

README.md

Update README.md

2024-12-04 11:31:08 +01:00

renovate.json

ci: manually update deps

2023-05-04 15:01:29 +02:00

SECURITY.md

Create SECURITY.md

2024-02-29 19:53:04 +01:00

README.md

LocalAI

💡 Get help - ❓FAQ 💭Discussions 💬 Discord 📖 Documentation website

💻 Quickstart 🖼️ Models 🚀 Roadmap 🥽 Demo 🌍 Explorer 🛫 Examples

LocalAI is the free, Open Source OpenAI alternative. LocalAI act as a drop-in replacement REST API that’s compatible with OpenAI (Elevenlabs, Anthropic... ) API specifications for local AI inferencing. It allows you to run LLMs, generate images, audio (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families. Does not require GPU. It is created and maintained by Ettore Di Giacinto.

Run the installer script:

curl https://localai.io/install.sh | sh

Or run with docker:

# CPU only image:
docker run -ti --name local-ai -p 8080:8080 localai/localai:latest-cpu

# Nvidia GPU:
docker run -ti --name local-ai -p 8080:8080 --gpus all localai/localai:latest-gpu-nvidia-cuda-12

# CPU and GPU image (bigger size):
docker run -ti --name local-ai -p 8080:8080 localai/localai:latest

# AIO images (it will pre-download a set of models ready for use, see https://localai.io/basics/container/)
docker run -ti --name local-ai -p 8080:8080 localai/localai:latest-aio-cpu

To load models:

# From the model gallery (see available models with `local-ai models list`, in the WebUI from the model tab, or visiting https://models.localai.io)
local-ai run llama-3.2-1b-instruct:q4_k_m
# Start LocalAI with the phi-2 model directly from huggingface
local-ai run huggingface://TheBloke/phi-2-GGUF/phi-2.Q8_0.gguf
# Install and run a model from the Ollama OCI registry
local-ai run ollama://gemma:2b
# Run a model from a configuration file
local-ai run https://gist.githubusercontent.com/.../phi-2.yaml
# Install and run a model from a standard OCI registry (e.g., Docker Hub)
local-ai run oci://localai/phi-2:latest

💻 Getting started

📰 Latest project news

Dec 2024: stablediffusion.cpp backend (ggml) added ( https://github.com/mudler/LocalAI/pull/4289 )
Nov 2024: Bark.cpp backend added ( https://github.com/mudler/LocalAI/pull/4287 )
Nov 2024: Voice activity detection models (VAD) added to the API: https://github.com/mudler/LocalAI/pull/4204
Oct 2024: examples moved to LocalAI-examples
Aug 2024: 🆕 FLUX-1, P2P Explorer
July 2024: 🔥🔥 🆕 P2P Dashboard, LocalAI Federated mode and AI Swarms: https://github.com/mudler/LocalAI/pull/2723
June 2024: 🆕 You can browse now the model gallery without LocalAI! Check out https://models.localai.io
June 2024: Support for models from OCI registries: https://github.com/mudler/LocalAI/pull/2628
May 2024: 🔥🔥 Decentralized P2P llama.cpp: https://github.com/mudler/LocalAI/pull/2343 (peer2peer llama.cpp!) 👉 Docs https://localai.io/features/distribute/
May 2024: 🔥🔥 Openvoice: https://github.com/mudler/LocalAI/pull/2334
May 2024: 🆕 Function calls without grammars and mixed mode: https://github.com/mudler/LocalAI/pull/2328
May 2024: 🔥🔥 Distributed inferencing: https://github.com/mudler/LocalAI/pull/2324
May 2024: Chat, TTS, and Image generation in the WebUI: https://github.com/mudler/LocalAI/pull/2222
April 2024: Reranker API: https://github.com/mudler/LocalAI/pull/2121

Roadmap items: List of issues

🔥🔥 Hot topics (looking for help):

Multimodal with vLLM and Video understanding: https://github.com/mudler/LocalAI/pull/3729
Realtime API https://github.com/mudler/LocalAI/issues/3714
🔥🔥 Distributed, P2P Global community pools: https://github.com/mudler/LocalAI/issues/3113
WebUI improvements: https://github.com/mudler/LocalAI/issues/2156
Backends v2: https://github.com/mudler/LocalAI/issues/1126
Improving UX v2: https://github.com/mudler/LocalAI/issues/1373
Assistant API: https://github.com/mudler/LocalAI/issues/1273
Moderation endpoint: https://github.com/mudler/LocalAI/issues/999
Vulkan: https://github.com/mudler/LocalAI/issues/1647
Anthropic API: https://github.com/mudler/LocalAI/issues/1808

If you want to help and contribute, issues up for grabs: https://github.com/mudler/LocalAI/issues?q=is%3Aissue+is%3Aopen+label%3A%22up+for+grabs%22

🚀 Features

📖 Text generation with GPTs (llama.cpp, gpt4all.cpp, ... 📖 and more)
🗣 Text to Audio
🔈 Audio to Text (Audio transcription with whisper.cpp)
🎨 Image generation with stable diffusion
🔥 OpenAI-alike tools API
🧠 Embeddings generation for vector databases
✍️ Constrained grammars
🖼️ Download Models directly from Huggingface
🥽 Vision API
📈 Reranker API
🆕🖧 P2P Inferencing
🌍 Integrated WebUI!

💻 Usage

Check out the Getting started section in our documentation.

🔗 Community and integrations

Build and deploy custom containers:

https://github.com/sozercan/aikit

WebUIs:

https://github.com/Jirubizu/localai-admin
https://github.com/go-skynet/LocalAI-frontend
QA-Pilot(An interactive chat project that leverages LocalAI LLMs for rapid understanding and navigation of GitHub code repository) https://github.com/reid41/QA-Pilot

Model galleries

https://github.com/go-skynet/model-gallery

Other:

Helm chart https://github.com/go-skynet/helm-charts
VSCode extension https://github.com/badgooooor/localai-vscode-plugin
Terminal utility https://github.com/djcopley/ShellOracle
Local Smart assistant https://github.com/mudler/LocalAGI
Home Assistant https://github.com/sammcj/homeassistant-localai / https://github.com/drndos/hass-openai-custom-conversation / https://github.com/valentinfrlch/ha-gpt4vision
Discord bot https://github.com/mudler/LocalAGI/tree/main/examples/discord
Slack bot https://github.com/mudler/LocalAGI/tree/main/examples/slack
Shell-Pilot(Interact with LLM using LocalAI models via pure shell scripts on your Linux or MacOS system) https://github.com/reid41/shell-pilot
Telegram bot https://github.com/mudler/LocalAI/tree/master/examples/telegram-bot
Another Telegram Bot https://github.com/JackBekket/Hellper
Auto-documentation https://github.com/JackBekket/Reflexia
Github bot which answer on issues, with code and documentation as context https://github.com/JackBekket/GitHelper
Github Actions: https://github.com/marketplace/actions/start-localai
Examples: https://github.com/mudler/LocalAI/tree/master/examples/

🔗 Resources

LLM finetuning guide
How to build locally
How to install in Kubernetes
Projects integrating LocalAI
How tos section (curated by our community)

Citation

If you utilize this repository, data in a downstream project, please consider citing it with:

@misc{localai,
  author = {Ettore Di Giacinto},
  title = {LocalAI: The free, Open source OpenAI alternative},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/go-skynet/LocalAI}},

❤️ Sponsors

Do you find LocalAI useful?

Support the project by becoming a backer or sponsor. Your logo will show up here with a link to your website.

A huge thank you to our generous sponsors who support this project covering CI expenses, and our Sponsor list:

🌟 Star history

📖 License

LocalAI is a community-driven project created by Ettore Di Giacinto.

MIT - Author Ettore Di Giacinto mudler@localai.io

🙇 Acknowledgements

LocalAI couldn't have been built without the help of great software already available from the community. Thank you!

🤗 Contributors

This is a community project, a special thanks to our contributors! 🤗

Description

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes libp2p llama llama3 llm mamba mistral musicgen rerank rwkv stable-diffusion text-generation tts

Readme MIT 105 MiB

Languages

Go 88.2%

Python 3.2%

JavaScript 2.9%

HTML 2.7%

Makefile 1%

Other 1.9%

README.md Unescape Escape

LocalAI

📰 Latest project news

🔥🔥 Hot topics (looking for help):

🚀 Features

💻 Usage

🔗 Community and integrations

🔗 Resources

📖 🎥 Media, Blogs, Social

Citation

❤️ Sponsors

🌟 Star history

📖 License

🙇 Acknowledgements

🤗 Contributors

README.md