mirror of https://github.com/mudler/LocalAI.git synced 2025-06-18 06:58:09 +00:00

Go to file

Ludovic Leroux 0135e1e3b9 fix: vllm - use AsyncLLMEngine to allow true streaming mode (#1749 )

* fix: use vllm AsyncLLMEngine to bring true stream

Current vLLM implementation uses the LLMEngine, which was designed for offline batch inference, which results in the streaming mode outputing all blobs at once at the end of the inference.

This PR reworks the gRPC server to use asyncio and gRPC.aio, in combination with vLLM's AsyncLLMEngine to bring true stream mode.

This PR also passes more parameters to vLLM during inference (presence_penalty, frequency_penalty, stop, ignore_eos, seed, ...).

* Remove unused import

2024-02-24 11:48:45 +01:00

.github

Build docker container for ROCm (#1595 )

2024-02-16 15:08:50 +01:00

.vscode

feat: Add more test-cases and remove dev container (#433 )

2023-05-30 13:01:55 +02:00

api

feat(upload-api): do not display error if uploadedFiles.json is not present

2024-02-22 00:15:08 +01:00

backend

fix: vllm - use AsyncLLMEngine to allow true streaming mode (#1749 )

2024-02-24 11:48:45 +01:00

core

MQTT Startup Refactoring Part 1: core/ packages part 1 (#1728 )

2024-02-21 01:21:19 +00:00

custom-ca-certs

feat(certificates): add support for custom CA certificates (#880 )

2023-11-01 20:10:14 +01:00

docs

⬆️ Update docs version mudler/LocalAI (#1718 )

2024-02-16 15:11:53 +01:00

embedded

examples(mistral-openorca): add stopword

2024-02-22 00:15:08 +01:00

examples

examples(phi-2): strip newline at the end of the prompt template

2024-02-21 23:17:51 +01:00

internal

feat: cleanups, small enhancements

2023-07-04 18:58:19 +02:00

metrics

Revert "[Refactor]: Core/API Split" (#1550 )

2024-01-05 18:04:46 +01:00

models

Add docker-compose

2023-04-13 01:13:14 +02:00

pkg

MQTT Startup Refactoring Part 1: core/ packages part 1 (#1728 )

2024-02-21 01:21:19 +00:00

prompt-templates

Requested Changes from GPT4ALL to Luna-AI-Llama2 (#1092 )

2023-09-22 11:22:17 +02:00

tests

MQTT Startup Refactoring Part 1: core/ packages part 1 (#1728 )

2024-02-21 01:21:19 +00:00

.dockerignore

Remove .git from .dockerignore

2023-07-06 21:25:10 +02:00

.env

feat: initial watchdog implementation (#1341 )

2023-11-26 18:36:23 +01:00

.gitattributes

Create .gitattributes to force git clone to keep the LF line endings on .sh files (#838 )

2023-07-30 15:27:43 +02:00

.gitignore

Revert "[Refactor]: Core/API Split" (#1550 )

2024-01-05 18:04:46 +01:00

.gitmodules

docs/examples: enhancements (#1572 )

2024-01-18 19:41:08 +01:00

assets.go

feat: Update gpt4all, support multiple implementations in runtime (#472 )

2023-06-01 23:38:52 +02:00

CONTRIBUTING.md

Add the CONTRIBUTING.md (#1098 )

2023-09-24 14:54:55 +02:00

docker-compose.yaml

fix: update docker-compose.yaml (#1131 )

2023-10-05 22:13:18 +02:00

Dockerfile

Build docker container for ROCm (#1595 )

2024-02-16 15:08:50 +01:00

Earthfile

Rename project to LocalAI (#35 )

2023-04-19 18:43:10 +02:00

Entitlements.plist

Feat: OSX Local Codesigning (#1319 )

2023-11-23 15:22:54 +01:00

entrypoint.sh

feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends (#1689 )

2024-02-08 20:12:51 +01:00

go.mod

Initial implementation of upload files api. (#1703 )

2024-02-18 10:12:02 +00:00

go.sum

Initial implementation of upload files api. (#1703 )

2024-02-18 10:12:02 +00:00

LICENSE

docs/examples: enhancements (#1572 )

2024-01-18 19:41:08 +01:00

main.go

MQTT Startup Refactoring Part 1: core/ packages part 1 (#1728 )

2024-02-21 01:21:19 +00:00

Makefile

⬆️ Update ggerganov/llama.cpp (#1750 )

2024-02-24 00:06:46 +01:00

README.md

Update README.md (#1739 )

2024-02-22 16:35:06 +01:00

renovate.json

ci: manually update deps

2023-05-04 15:01:29 +02:00

README.md

LocalAI

💡 Get help - ❓FAQ 💭Discussions 💬 Discord 📖 Documentation website

💻 Quickstart 📣 News 🛫 Examples 🖼️ Models 🚀 Roadmap

LocalAI is the free, Open Source OpenAI alternative. LocalAI act as a drop-in replacement REST API that’s compatible with OpenAI API specifications for local inferencing. It allows you to run LLMs, generate images, audio (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families. Does not require GPU.

🔥🔥 Hot topics / Roadmap

Roadmap

Parallel function calling: https://github.com/mudler/LocalAI/pull/1726
Upload file API: https://github.com/mudler/LocalAI/pull/1703
Tools API support: https://github.com/mudler/LocalAI/pull/1715
LLaVa 1.6: https://github.com/mudler/LocalAI/pull/1714
ROCm container images: https://github.com/mudler/LocalAI/pull/1595
Intel GPU support (sycl): https://github.com/mudler/LocalAI/issues/1653
Deprecation of old backends: https://github.com/mudler/LocalAI/issues/1651
Mamba support: https://github.com/mudler/LocalAI/pull/1589
Start and share models with config file: https://github.com/mudler/LocalAI/pull/1522
🐸 Coqui: https://github.com/mudler/LocalAI/pull/1489
Img2vid https://github.com/mudler/LocalAI/pull/1442

Hot topics (looking for contributors):

Backends v2: https://github.com/mudler/LocalAI/issues/1126
Improving UX v2: https://github.com/mudler/LocalAI/issues/1373
Assistant API: https://github.com/mudler/LocalAI/issues/1273

If you want to help and contribute, issues up for grabs: https://github.com/mudler/LocalAI/issues?q=is%3Aissue+is%3Aopen+label%3A%22up+for+grabs%22

💻 Getting started

For a detailed step-by-step introduction, refer to the Getting Started guide. For those in a hurry, here's a straightforward one-liner to launch a LocalAI instance with phi-2 using docker:

docker run -ti -p 8080:8080 localai/localai:v2.7.0-ffmpeg-core phi-2

🚀 Features

📖 Text generation with GPTs (llama.cpp, gpt4all.cpp, ... 📖 and more)
🗣 Text to Audio
🔈 Audio to Text (Audio transcription with whisper.cpp)
🎨 Image generation with stable diffusion
🔥 OpenAI functions 🆕
🧠 Embeddings generation for vector databases
✍️ Constrained grammars
🖼️ Download Models directly from Huggingface
🆕 Vision API

💻 Usage

Check out the Getting started section in our documentation.

🔗 Community and integrations

Build and deploy custom containers:

https://github.com/sozercan/aikit

WebUIs:

Model galleries

https://github.com/go-skynet/model-gallery

UI / Management Programs

LocalAI Manager

Other:

Helm chart https://github.com/go-skynet/helm-charts
VSCode extension https://github.com/badgooooor/localai-vscode-plugin
Local Smart assistant https://github.com/mudler/LocalAGI
Home Assistant https://github.com/sammcj/homeassistant-localai / https://github.com/drndos/hass-openai-custom-conversation
Discord bot https://github.com/mudler/LocalAGI/tree/main/examples/discord
Slack bot https://github.com/mudler/LocalAGI/tree/main/examples/slack
Telegram bot https://github.com/mudler/LocalAI/tree/master/examples/telegram-bot
Examples: https://github.com/mudler/LocalAI/tree/master/examples/

🔗 Resources

🆕 New! LLM finetuning guide
How to build locally
How to install in Kubernetes
Projects integrating LocalAI
How tos section (curated by our community)

Citation

If you utilize this repository, data in a downstream project, please consider citing it with:

@misc{localai,
  author = {Ettore Di Giacinto},
  title = {LocalAI: The free, Open source OpenAI alternative},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/go-skynet/LocalAI}},

❤️ Sponsors

Do you find LocalAI useful?

Support the project by becoming a backer or sponsor. Your logo will show up here with a link to your website.

A huge thank you to our generous sponsors who support this project:


Spectro Cloud
Spectro Cloud kindly supports LocalAI by providing GPU and computing resources to run tests on lamdalabs!

And a huge shout-out to individuals sponsoring the project by donating hardware or backing the project.

Sponsor list
JDAM00 (donating HW for the CI)

🌟 Star history

📖 License

LocalAI is a community-driven project created by Ettore Di Giacinto.

MIT - Author Ettore Di Giacinto

🙇 Acknowledgements

LocalAI couldn't have been built without the help of great software already available from the community. Thank you!

🤗 Contributors

This is a community project, a special thanks to our contributors! 🤗

Description

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes libp2p llama llama3 llm mamba mistral musicgen rerank rwkv stable-diffusion text-generation tts

Readme MIT 105 MiB

Languages

Go 88.2%

Python 3.2%

JavaScript 2.9%

HTML 2.7%

Makefile 1%

Other 1.9%

README.md Unescape Escape

LocalAI

🔥🔥 Hot topics / Roadmap

💻 Getting started

🚀 Features

💻 Usage

🔗 Community and integrations

🔗 Resources

📖 🎥 Media, Blogs, Social

Citation

❤️ Sponsors

🌟 Star history

📖 License

🙇 Acknowledgements

🤗 Contributors

README.md