mirror of https://github.com/mudler/LocalAI.git synced 2025-05-29 05:24:15 +00:00

Go to file

fix(utf8): prevent multi-byte utf8 characters from being mangled (#981 )

**Description**

This PR fixes #677 using [suggested
solution](https://github.com/go-skynet/LocalAI/issues/677#issuecomment-1695939097)
from @yantoz

before:
```
❯ curl -N http://localhost:57541/v1/completions -H "Content-Type: application/json" -d '{
     "model": "ggml-model-q4_0.bin",
     "prompt": "",
     "max_tokens": 32,
     "temperature": 0.7,
     "stream": true
   }'
data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"\ufffd"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"\ufffd"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"\ufffd"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"\ufffd"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":" |"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":" I"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"'"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"m"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}
```

now:
```
❯ curl -N http://localhost:57541/v1/completions -H Content-Type: application/json -d {
   "model": "ggml-model-q4_0.bin",
   "prompt": "",
   "max_tokens": 32,
   "temperature": 0.7,
   "stream": true
 }
data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":"😂"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":" "}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":"|"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":" "}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":"I"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":"'"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}

data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":"m"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}}
```

**Notes for Reviewers**


**[Signed
commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin)**
- [X] Yes, I signed my commits.
 

<!--
Thank you for contributing to LocalAI! 

Contributing Conventions:

1. Include descriptive PR titles with [<component-name>] prepended.
2. Build and test your changes before submitting a PR. 
3. Sign your commits

By following the community's contribution conventions upfront, the
review process will
be accelerated and your PR merged more quickly.
-->

Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2023-08-30 23:56:59 +00:00

.github

feat: bump llama.cpp, add gguf support (#943 )

2023-08-24 01:18:58 +02:00

.vscode

feat: Add more test-cases and remove dev container (#433 )

2023-05-30 13:01:55 +02:00

api

fix(utf8): prevent multi-byte utf8 characters from being mangled (#981 )

2023-08-30 23:56:59 +00:00

cmd/grpc

feat: add llama-stable backend (#932 )

2023-08-20 16:35:42 +02:00

examples

initial draft of an importable Insomnia profile for developers (#942 )

2023-08-23 18:39:27 +02:00

extra

fix(diffusers): correctly check alpha (#967 )

2023-08-27 15:35:59 +02:00

internal

feat: cleanups, small enhancements

2023-07-04 18:58:19 +02:00

models

Add docker-compose

2023-04-13 01:13:14 +02:00

pkg

fix(llama): resolve lora adapters correctly from the model file (#964 )

2023-08-27 10:11:32 +02:00

prompt-templates

feat(llama2): add template for chat messages (#782 )

2023-07-22 11:31:39 -04:00

tests

feat(llama2): add template for chat messages (#782 )

2023-07-22 11:31:39 -04:00

.dockerignore

Remove .git from .dockerignore

2023-07-06 21:25:10 +02:00

.env

docs: base-Update comments in .env for cublas, openblas, clblas (#867 )

2023-08-07 08:22:42 +00:00

.gitattributes

Create .gitattributes to force git clone to keep the LF line endings on .sh files (#838 )

2023-07-30 15:27:43 +02:00

.gitignore

Feat: rwkv improvements: (#937 )

2023-08-22 18:48:06 +02:00

assets.go

feat: Update gpt4all, support multiple implementations in runtime (#472 )

2023-06-01 23:38:52 +02:00

docker-compose.yaml

images: cleanup, drop .dev Dockerfile (#437 )

2023-05-30 15:58:10 +02:00

Dockerfile

feat: bump llama.cpp, add gguf support (#943 )

2023-08-24 01:18:58 +02:00

Earthfile

Rename project to LocalAI (#35 )

2023-04-19 18:43:10 +02:00

entrypoint.sh

Added CPU information to entrypoint.sh (#794 )

2023-07-23 19:27:55 +00:00

go.mod

fix(deps): update github.com/go-skynet/go-llama.cpp digest to bf3f946 (#979 )

2023-08-30 23:02:19 +02:00

go.sum

fix(deps): update github.com/go-skynet/go-llama.cpp digest to bf3f946 (#979 )

2023-08-30 23:02:19 +02:00

LICENSE

docs: update docs/license(clarification) and point to new website (#415 )

2023-05-29 23:09:19 +02:00

main.go

feat: add --single-active-backend to allow only one backend active at the time (#925 )

2023-08-19 01:49:33 +02:00

Makefile

fix(deps): update go-llama.cpp (#980 )

2023-08-30 23:01:55 +02:00

README.md

readme: link to hot topics in the website

2023-08-07 00:31:46 +02:00

renovate.json

ci: manually update deps

2023-05-04 15:01:29 +02:00

README.md

LocalAI

💡 Get help - ❓FAQ 💭Discussions 💬 Discord 📖 Documentation website

💻 Quickstart 📣 News 🛫 Examples 🖼️ Models

LocalAI is a drop-in replacement REST API that's compatible with OpenAI API specifications for local inferencing. It allows you to run LLMs (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families that are compatible with the ggml format. Does not require GPU.

Follow LocalAI

Connect with the Creator

Share LocalAI Repository

In a nutshell:

Local, OpenAI drop-in alternative REST API. You own your data.
NO GPU required. NO Internet access is required either
- Optional, GPU Acceleration is available in llama.cpp-compatible LLMs. See also the build section.
Supports multiple models
🏃 Once loaded the first time, it keep models loaded in memory for faster inference
⚡ Doesn't shell-out, but uses C++ bindings for a faster inference and better performance.

LocalAI was created by Ettore Di Giacinto and is a community-driven project, focused on making the AI accessible to anyone. Any contribution, feedback and PR is welcome!

Note that this started just as a fun weekend project in order to try to create the necessary pieces for a full AI assistant like ChatGPT: the community is growing fast and we are working hard to make it better and more stable. If you want to help, please consider contributing (see below)!

🔥🔥 Hot topics / Roadmap

🚀 Features

📖 Text generation with GPTs (llama.cpp, gpt4all.cpp, ... 📖 and more)
🗣 Text to Audio
🔈 Audio to Text (Audio transcription with whisper.cpp)
🎨 Image generation with stable diffusion
🔥 OpenAI functions 🆕
🧠 Embeddings generation for vector databases
✍️ Constrained grammars
🖼️ Download Models directly from Huggingface

💻 Usage

Check out the Getting started section in our documentation.

💡 Example: Use GPT4ALL-J model

See the documentation

🔗 Resources

❤️ Sponsors

Do you find LocalAI useful?

Support the project by becoming a backer or sponsor. Your logo will show up here with a link to your website.

A huge thank you to our generous sponsors who support this project:


Spectro Cloud
Spectro Cloud kindly supports LocalAI by providing GPU and computing resources to run tests on lamdalabs!

🌟 Star history

📖 License

LocalAI is a community-driven project created by Ettore Di Giacinto.

MIT - Author Ettore Di Giacinto

🙇 Acknowledgements

LocalAI couldn't have been built without the help of great software already available from the community. Thank you!

🤗 Contributors

This is a community project, a special thanks to our contributors! 🤗

Description

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes libp2p llama llama3 llm mamba mistral musicgen rerank rwkv stable-diffusion text-generation tts

Readme MIT 95 MiB

Languages

Go 88.6%

Python 3.1%

JavaScript 2.8%

HTML 2.5%

Makefile 1%

Other 1.8%

README.md

LocalAI

🔥🔥 Hot topics / Roadmap

🚀 Features

📖 🎥 Media, Blogs, Social

💻 Usage

💡 Example: Use GPT4ALL-J model

🔗 Resources

❤️ Sponsors

🌟 Star history

📖 License

🙇 Acknowledgements

🤗 Contributors