mirror of https://github.com/mudler/LocalAI.git synced 2025-05-28 21:14:15 +00:00

Go to file

Ettore Di Giacinto ac4a94dd44

feat(build): bundle libs for arm64 and x86 linux binaries (#2572 )

This PR bundles further libs into the arm64 and x86_64 binaries

This can be improved by a lot - it's far from perfect, however in this PR I wanted to collect the required libs, and give a simple baseline to improve later upon. It is quite challenging to do this exercise with CI only - but it's the fastest way I see now. 

I hope that after the list is initially built we can further improve this down the line and remove some of the technical debt left here to speedup things and do not get stuck in the middle of CI cycles.

In this PR:

- The x86_64 binary now bundles hipblas, nvidia and intel libraries too to avoid any dependency to be installed in the host
- Similarly, for the arm64 we now bundle all the required assets

## What's left

We should be also able to cross-compile Nvidia for arm64 - however I didn't succeed so far so I've left that open. Similarly I might have missed some libraries, but we will see with bug reports and testing around with the new binaries. I've tested on my arm64 board and I could finally start things up.

An open point still is shipping libraries for e.g. tts and stablediffusion. this is not done yet, however with the same methodology we should be able to extend support also for these two backends in the binary.

2024-06-16 09:10:44 +02:00

.github

feat(build): bundle libs for arm64 and x86 linux binaries (#2572 )

2024-06-16 09:10:44 +02:00

.vscode

feat: first pass at improving logging (#1956 )

2024-04-04 09:24:22 +02:00

aio

models(gallery): add mistral-0.3 and command-r, update functions (#2388 )

2024-05-23 19:16:08 +02:00

backend

bugfix: CUDA acceleration not working (#2475 )

2024-06-03 22:41:42 +02:00

configuration

refactor: move remaining api packages to core (#1731 )

2024-03-01 16:19:53 +01:00

core

feat(guesser): identify gemma models (#2561 )

2024-06-13 19:12:37 +02:00

custom-ca-certs

feat(certificates): add support for custom CA certificates (#880 )

2023-11-01 20:10:14 +01:00

docs

Fix standard image latest Docker tags (#2574 )

2024-06-15 22:08:30 +02:00

embedded

fix: pkg/downloader should respect basePath for file:// urls (#2481 )

2024-06-04 14:32:47 +00:00

examples

docs: Update semantic-todo/README.md (#2294 )

2024-05-12 09:02:11 +02:00

gallery

models(gallery): add firefly-gemma-7b (#2576 )

2024-06-15 23:07:20 +02:00

internal

feat: cleanups, small enhancements

2023-07-04 18:58:19 +02:00

models

Add docker-compose

2023-04-13 01:13:14 +02:00

pkg

feat(darwin): embed grpc libs (#2567 )

2024-06-14 08:51:25 +02:00

prompt-templates

Requested Changes from GPT4ALL to Luna-AI-Llama2 (#1092 )

2023-09-22 11:22:17 +02:00

swagger

feat(swagger): update swagger (#2464 )

2024-06-01 22:04:01 +00:00

tests

test: e2e /reranker endpoint (#2211 )

2024-06-07 18:45:52 +00:00

.dockerignore

feat: migrate python backends from conda to uv (#2215 )

2024-05-10 15:08:08 +02:00

.editorconfig

feat(stores): Vector store backend (#1795 )

2024-03-22 21:14:04 +01:00

.env

feat(llama.cpp): add distributed llama.cpp inferencing (#2324 )

2024-05-15 01:17:02 +02:00

.gitattributes

Create .gitattributes to force git clone to keep the LF line endings on .sh files (#838 )

2023-07-30 15:27:43 +02:00

.gitignore

feat(gallery): show available models in website, allow local-ai models install to install from galleries (#2555 )

2024-06-13 00:47:16 +02:00

.gitmodules

docs/examples: enhancements (#1572 )

2024-01-18 19:41:08 +01:00

.yamllint

fix: yamlint warnings and errors (#2131 )

2024-04-25 17:25:56 +00:00

assets.go

feat: Update gpt4all, support multiple implementations in runtime (#472 )

2023-06-01 23:38:52 +02:00

CONTRIBUTING.md

Update CONTRIBUTING.md

2024-04-12 15:27:40 +02:00

docker-compose.yaml

fix(docker-compose): update docker compose file (#1824 )

2024-03-13 17:57:45 +01:00

Dockerfile

chore(deps): Update Dockerfile (#2532 )

2024-06-10 08:40:02 +00:00

Dockerfile.aio

feat(aio): entrypoint, update workflows (#1872 )

2024-03-21 22:09:04 +01:00

Earthfile

Rename project to LocalAI (#35 )

2023-04-19 18:43:10 +02:00

Entitlements.plist

Feat: OSX Local Codesigning (#1319 )

2023-11-23 15:22:54 +01:00

entrypoint.sh

fix: use exec in entrypoint scripts to fix signal handling (#1943 )

2024-04-02 09:15:44 +02:00

go.mod

feat(llama.cpp): guess model defaults from file (#2522 )

2024-06-08 22:13:02 +02:00

go.sum

feat(llama.cpp): guess model defaults from file (#2522 )

2024-06-08 22:13:02 +02:00

LICENSE

docs/examples: enhancements (#1572 )

2024-01-18 19:41:08 +01:00

main.go

feat(util): add util command to print GGUF informations (#2528 )

2024-06-09 19:27:42 +02:00

Makefile

⬆️ Update ggerganov/llama.cpp (#2575 )

2024-06-15 23:45:10 +00:00

README.md

Add integrations (#2535 )

2024-06-10 19:18:47 +02:00

renovate.json

ci: manually update deps

2023-05-04 15:01:29 +02:00

SECURITY.md

Create SECURITY.md

2024-02-29 19:53:04 +01:00

README.md

LocalAI

💡 Get help - ❓FAQ 💭Discussions 💬 Discord 📖 Documentation website

💻 Quickstart 📣 News 🛫 Examples 🖼️ Models 🚀 Roadmap

LocalAI is the free, Open Source OpenAI alternative. LocalAI act as a drop-in replacement REST API that’s compatible with OpenAI (Elevenlabs, Anthropic... ) API specifications for local AI inferencing. It allows you to run LLMs, generate images, audio (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families. Does not require GPU. It is created and maintained by Ettore Di Giacinto.

docker run -ti --name local-ai -p 8080:8080 localai/localai:latest-aio-cpu
# Alternative images:
# - if you have an Nvidia GPU:
# docker run -ti --name local-ai -p 8080:8080 --gpus all localai/localai:latest-aio-gpu-nvidia-cuda-12
# - without preconfigured models
# docker run -ti --name local-ai -p 8080:8080 localai/localai:latest
# - without preconfigured models for Nvidia GPUs
# docker run -ti --name local-ai -p 8080:8080 --gpus all localai/localai:latest-gpu-nvidia-cuda-12

💻 Getting started

🔥🔥 Hot topics / Roadmap

Roadmap

🔥🔥 Decentralized llama.cpp: https://github.com/mudler/LocalAI/pull/2343 (peer2peer llama.cpp!) 👉 Docs https://localai.io/features/distribute/
🔥🔥 Openvoice: https://github.com/mudler/LocalAI/pull/2334
🆕 Function calls without grammars and mixed mode: https://github.com/mudler/LocalAI/pull/2328
🔥🔥 Distributed inferencing: https://github.com/mudler/LocalAI/pull/2324
Chat, TTS, and Image generation in the WebUI: https://github.com/mudler/LocalAI/pull/2222
Reranker API: https://github.com/mudler/LocalAI/pull/2121

Hot topics (looking for contributors):

WebUI improvements: https://github.com/mudler/LocalAI/issues/2156
Backends v2: https://github.com/mudler/LocalAI/issues/1126
Improving UX v2: https://github.com/mudler/LocalAI/issues/1373
Assistant API: https://github.com/mudler/LocalAI/issues/1273
Moderation endpoint: https://github.com/mudler/LocalAI/issues/999
Vulkan: https://github.com/mudler/LocalAI/issues/1647

If you want to help and contribute, issues up for grabs: https://github.com/mudler/LocalAI/issues?q=is%3Aissue+is%3Aopen+label%3A%22up+for+grabs%22

🚀 Features

📖 Text generation with GPTs (llama.cpp, gpt4all.cpp, ... 📖 and more)
🗣 Text to Audio
🔈 Audio to Text (Audio transcription with whisper.cpp)
🎨 Image generation with stable diffusion
🔥 OpenAI-alike tools API
🧠 Embeddings generation for vector databases
✍️ Constrained grammars
🖼️ Download Models directly from Huggingface
🥽 Vision API
📈 Reranker API
🆕🖧 P2P Inferencing

💻 Usage

Check out the Getting started section in our documentation.

🔗 Community and integrations

Build and deploy custom containers:

https://github.com/sozercan/aikit

WebUIs:

https://github.com/Jirubizu/localai-admin
https://github.com/go-skynet/LocalAI-frontend
QA-Pilot(An interactive chat project that leverages LocalAI LLMs for rapid understanding and navigation of GitHub code repository) https://github.com/reid41/QA-Pilot

Model galleries

https://github.com/go-skynet/model-gallery

Other:

Helm chart https://github.com/go-skynet/helm-charts
VSCode extension https://github.com/badgooooor/localai-vscode-plugin
Terminal utility https://github.com/djcopley/ShellOracle
Local Smart assistant https://github.com/mudler/LocalAGI
Home Assistant https://github.com/sammcj/homeassistant-localai / https://github.com/drndos/hass-openai-custom-conversation / https://github.com/valentinfrlch/ha-gpt4vision
Discord bot https://github.com/mudler/LocalAGI/tree/main/examples/discord
Slack bot https://github.com/mudler/LocalAGI/tree/main/examples/slack
Shell-Pilot(Interact with LLM using LocalAI models via pure shell scripts on your Linux or MacOS system) https://github.com/reid41/shell-pilot
Telegram bot https://github.com/mudler/LocalAI/tree/master/examples/telegram-bot
Examples: https://github.com/mudler/LocalAI/tree/master/examples/

🔗 Resources

LLM finetuning guide
How to build locally
How to install in Kubernetes
Projects integrating LocalAI
How tos section (curated by our community)

Citation

If you utilize this repository, data in a downstream project, please consider citing it with:

@misc{localai,
  author = {Ettore Di Giacinto},
  title = {LocalAI: The free, Open source OpenAI alternative},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/go-skynet/LocalAI}},

❤️ Sponsors

Do you find LocalAI useful?

Support the project by becoming a backer or sponsor. Your logo will show up here with a link to your website.

A huge thank you to our generous sponsors who support this project covering CI expenses, and our Sponsor list:

🌟 Star history

📖 License

LocalAI is a community-driven project created by Ettore Di Giacinto.

MIT - Author Ettore Di Giacinto mudler@localai.io

🙇 Acknowledgements

LocalAI couldn't have been built without the help of great software already available from the community. Thank you!

🤗 Contributors

This is a community project, a special thanks to our contributors! 🤗

Description

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes libp2p llama llama3 llm mamba mistral musicgen rerank rwkv stable-diffusion text-generation tts

Readme MIT 95 MiB

Languages

Go 88.6%

Python 3.1%

JavaScript 2.8%

HTML 2.5%

Makefile 1%

Other 1.8%

README.md Unescape Escape

LocalAI

🔥🔥 Hot topics / Roadmap

🚀 Features

💻 Usage

🔗 Community and integrations

🔗 Resources

📖 🎥 Media, Blogs, Social

Citation

❤️ Sponsors

🌟 Star history

📖 License

🙇 Acknowledgements

🤗 Contributors

README.md