mirror of https://github.com/mudler/LocalAI.git synced 2025-06-11 19:51:43 +00:00

Go to file

Ettore Di Giacinto 491e1d752b feat(functions): relax mixedgrammars (#2365 )

* feat(functions): relax mixedgrammars

Extend even more the functionalities and when mixed mode is enabled,
tolerate also both strings and JSON in the result - in this case we make
sure that the JSON can be correctly parsed.

This also updates the examples and the gallery model to configure the
grammar.

The changeset also breaks current function/grammar configuration as it
reserves now a stanza in the YAML config.

For example:

```yaml
function:
  grammar:
    # This allows the grammar to also return messages
    mixed_mode: true
    # Suffix to add to the grammar
    # prefix: '<tool_call>\n'
    # Force parallel calls in the grammar
    # parallel_calls: true
```

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactor, add a way to disable mixed json and freestring

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Fix linting issues

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2024-05-22 00:14:16 +02:00

.github

dependencies(grpcio): bump to fix CI issues (#2362 )

2024-05-21 14:33:47 +02:00

.vscode

feat: first pass at improving logging (#1956 )

2024-04-04 09:24:22 +02:00

aio

feat(functions): relax mixedgrammars (#2365 )

2024-05-22 00:14:16 +02:00

backend

dependencies(grpcio): bump to fix CI issues (#2362 )

2024-05-21 14:33:47 +02:00

configuration

refactor: move remaining api packages to core (#1731 )

2024-03-01 16:19:53 +01:00

core

feat(functions): relax mixedgrammars (#2365 )

2024-05-22 00:14:16 +02:00

custom-ca-certs

feat(certificates): add support for custom CA certificates (#880 )

2023-11-01 20:10:14 +01:00

docs

Update openai-functions.md

2024-05-10 17:09:51 +02:00

embedded

feat(webui): statically embed js/css assets (#2348 )

2024-05-19 18:24:27 +02:00

examples

docs: Update semantic-todo/README.md (#2294 )

2024-05-12 09:02:11 +02:00

gallery

feat(functions): relax mixedgrammars (#2365 )

2024-05-22 00:14:16 +02:00

internal

feat: cleanups, small enhancements

2023-07-04 18:58:19 +02:00

models

Add docker-compose

2023-04-13 01:13:14 +02:00

pkg

feat(functions): relax mixedgrammars (#2365 )

2024-05-22 00:14:16 +02:00

prompt-templates

Requested Changes from GPT4ALL to Luna-AI-Llama2 (#1092 )

2023-09-22 11:22:17 +02:00

swagger

feat(swagger): update swagger (#2302 )

2024-05-12 21:00:18 +00:00

tests

fix: security scanner warning noise: error handlers part 2 (#2145 )

2024-04-29 15:11:42 +02:00

.dockerignore

feat: migrate python backends from conda to uv (#2215 )

2024-05-10 15:08:08 +02:00

.editorconfig

feat(stores): Vector store backend (#1795 )

2024-03-22 21:14:04 +01:00

.env

feat(llama.cpp): add distributed llama.cpp inferencing (#2324 )

2024-05-15 01:17:02 +02:00

.gitattributes

Create .gitattributes to force git clone to keep the LF line endings on .sh files (#838 )

2023-07-30 15:27:43 +02:00

.gitignore

feat: migrate python backends from conda to uv (#2215 )

2024-05-10 15:08:08 +02:00

.gitmodules

docs/examples: enhancements (#1572 )

2024-01-18 19:41:08 +01:00

.yamllint

fix: yamlint warnings and errors (#2131 )

2024-04-25 17:25:56 +00:00

assets.go

feat: Update gpt4all, support multiple implementations in runtime (#472 )

2023-06-01 23:38:52 +02:00

CONTRIBUTING.md

Update CONTRIBUTING.md

2024-04-12 15:27:40 +02:00

docker-compose.yaml

fix(docker-compose): update docker compose file (#1824 )

2024-03-13 17:57:45 +01:00

Dockerfile

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343 )

2024-05-20 19:17:59 +02:00

Dockerfile.aio

feat(aio): entrypoint, update workflows (#1872 )

2024-03-21 22:09:04 +01:00

Earthfile

Rename project to LocalAI (#35 )

2023-04-19 18:43:10 +02:00

Entitlements.plist

Feat: OSX Local Codesigning (#1319 )

2023-11-23 15:22:54 +01:00

entrypoint.sh

fix: use exec in entrypoint scripts to fix signal handling (#1943 )

2024-04-02 09:15:44 +02:00

go.mod

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343 )

2024-05-20 19:17:59 +02:00

go.sum

feat(llama.cpp): Totally decentralized, private, distributed, p2p inference (#2343 )

2024-05-20 19:17:59 +02:00

LICENSE

docs/examples: enhancements (#1572 )

2024-01-18 19:41:08 +01:00

main.go

fix: security scanner warning noise: error handlers part 1 (#2141 )

2024-04-26 10:34:31 +02:00

Makefile

build: add sha (#2356 )

2024-05-20 18:02:19 +02:00

README.md

Update README.md

2024-05-19 16:37:10 +02:00

renovate.json

ci: manually update deps

2023-05-04 15:01:29 +02:00

SECURITY.md

Create SECURITY.md

2024-02-29 19:53:04 +01:00

README.md

LocalAI

💡 Get help - ❓FAQ 💭Discussions 💬 Discord 📖 Documentation website

💻 Quickstart 📣 News 🛫 Examples 🖼️ Models 🚀 Roadmap

LocalAI is the free, Open Source OpenAI alternative. LocalAI act as a drop-in replacement REST API that’s compatible with OpenAI (Elevenlabs, Anthropic... ) API specifications for local AI inferencing. It allows you to run LLMs, generate images, audio (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families. Does not require GPU. It is created and maintained by Ettore Di Giacinto.

docker run -ti --name local-ai -p 8080:8080 localai/localai:latest-aio-cpu
# Alternative images:
# - if you have an Nvidia GPU:
# docker run -ti --name local-ai -p 8080:8080 --gpus all localai/localai:latest-aio-gpu-nvidia-cuda-12
# - without preconfigured models
# docker run -ti --name local-ai -p 8080:8080 localai/localai:latest
# - without preconfigured models for Nvidia GPUs
# docker run -ti --name local-ai -p 8080:8080 --gpus all localai/localai:latest-gpu-nvidia-cuda-12

💻 Getting started

🔥🔥 Hot topics / Roadmap

Roadmap

🔥🔥 Decentralized llama.cpp: https://github.com/mudler/LocalAI/pull/2343 (peer2peer llama.cpp!)
🔥🔥 Openvoice: https://github.com/mudler/LocalAI/pull/2334
🆕 Function calls without grammars and mixed mode: https://github.com/mudler/LocalAI/pull/2328
🔥🔥 Distributed inferencing: https://github.com/mudler/LocalAI/pull/2324
Chat, TTS, and Image generation in the WebUI: https://github.com/mudler/LocalAI/pull/2222
Reranker API: https://github.com/mudler/LocalAI/pull/2121

Hot topics (looking for contributors):

WebUI improvements: https://github.com/mudler/LocalAI/issues/2156
Backends v2: https://github.com/mudler/LocalAI/issues/1126
Improving UX v2: https://github.com/mudler/LocalAI/issues/1373
Assistant API: https://github.com/mudler/LocalAI/issues/1273
Moderation endpoint: https://github.com/mudler/LocalAI/issues/999
Vulkan: https://github.com/mudler/LocalAI/issues/1647

If you want to help and contribute, issues up for grabs: https://github.com/mudler/LocalAI/issues?q=is%3Aissue+is%3Aopen+label%3A%22up+for+grabs%22

🚀 Features

📖 Text generation with GPTs (llama.cpp, gpt4all.cpp, ... 📖 and more)
🗣 Text to Audio
🔈 Audio to Text (Audio transcription with whisper.cpp)
🎨 Image generation with stable diffusion
🔥 OpenAI functions 🆕
🧠 Embeddings generation for vector databases
✍️ Constrained grammars
🖼️ Download Models directly from Huggingface
🥽 Vision API
🆕 Reranker API

💻 Usage

Check out the Getting started section in our documentation.

🔗 Community and integrations

Build and deploy custom containers:

https://github.com/sozercan/aikit

WebUIs:

Model galleries

https://github.com/go-skynet/model-gallery

Other:

Helm chart https://github.com/go-skynet/helm-charts
VSCode extension https://github.com/badgooooor/localai-vscode-plugin
Terminal utility https://github.com/djcopley/ShellOracle
Local Smart assistant https://github.com/mudler/LocalAGI
Home Assistant https://github.com/sammcj/homeassistant-localai / https://github.com/drndos/hass-openai-custom-conversation
Discord bot https://github.com/mudler/LocalAGI/tree/main/examples/discord
Slack bot https://github.com/mudler/LocalAGI/tree/main/examples/slack
Telegram bot https://github.com/mudler/LocalAI/tree/master/examples/telegram-bot
Examples: https://github.com/mudler/LocalAI/tree/master/examples/

🔗 Resources

🆕 New! LLM finetuning guide
How to build locally
How to install in Kubernetes
Projects integrating LocalAI
How tos section (curated by our community)

Citation

If you utilize this repository, data in a downstream project, please consider citing it with:

@misc{localai,
  author = {Ettore Di Giacinto},
  title = {LocalAI: The free, Open source OpenAI alternative},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/go-skynet/LocalAI}},

❤️ Sponsors

Do you find LocalAI useful?

Support the project by becoming a backer or sponsor. Your logo will show up here with a link to your website.

A huge thank you to our generous sponsors who support this project:


Spectro Cloud
Spectro Cloud kindly supports LocalAI by providing GPU and computing resources to run tests on lamdalabs!

And a huge shout-out to individuals sponsoring the project by donating hardware or backing the project.

Sponsor list
JDAM00 (donating HW for the CI)

🌟 Star history

📖 License

LocalAI is a community-driven project created by Ettore Di Giacinto.

MIT - Author Ettore Di Giacinto

🙇 Acknowledgements

LocalAI couldn't have been built without the help of great software already available from the community. Thank you!

🤗 Contributors

This is a community project, a special thanks to our contributors! 🤗

Description

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes libp2p llama llama3 llm mamba mistral musicgen rerank rwkv stable-diffusion text-generation tts

Readme MIT 104 MiB

Languages

Go 88.3%

Python 3.2%

JavaScript 3%

HTML 2.6%

Makefile 1%

Other 1.8%

README.md Unescape Escape

LocalAI

🔥🔥 Hot topics / Roadmap

🚀 Features

💻 Usage

🔗 Community and integrations

🔗 Resources

📖 🎥 Media, Blogs, Social

Citation

❤️ Sponsors

🌟 Star history

📖 License

🙇 Acknowledgements

🤗 Contributors

README.md