LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2025-05-31 22:40:45 +00:00

Author	SHA1	Message	Date
Ettore Di Giacinto	a28ab18987	feat(vllm): Allow to set quantization (#1094 ) This particularly useful to set AWQ Description Follow up of #1015 Notes for Reviewers [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [ ] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions: 1. Include descriptive PR titles with [<component-name>] prepended. 2. Build and test your changes before submitting a PR. 3. Sign your commits By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. --> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-22 15:52:38 +02:00
Ettore Di Giacinto	bdf3f95346	feat(python-grpc): allow to set max workers with PYTHON_GRPC_MAX_WORKERS (#1081 ) Description this allows to customize the maximum number of grpc workers for python backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-19 21:30:39 +02:00
Ettore Di Giacinto	453e9c5da9	fix(vllm): set default top_p with vllm (#1078 ) Description This PR fixes vllm when called with a request with an empty top_p Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-19 18:10:23 +02:00
Ettore Di Giacinto	8ccf5b2044	feat(speculative-sampling): allow to specify a draft model in the model config (#1052 ) Description This PR fixes #1013. It adds `draft_model` and `n_draft` to the model YAML config in order to load models with speculative sampling. This should be compatible as well with grammars. example: ```yaml backend: llama context_size: 1024 name: my-model-name parameters: model: foo-bar n_draft: 16 draft_model: model-name ``` --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-14 17:44:16 +02:00
Ettore Di Giacinto	c0bb5c4bf6	feat(vllm): Initial vllm backend implementation Related to: https://github.com/go-skynet/LocalAI/issues/1015 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-09 17:03:23 +02:00
Ettore Di Giacinto	ee59e7d45f	fix(vall-e-x): make audiopath relative to models (#1012 ) Description This PR fixes # Notes for Reviewers [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [ ] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions: 1. Include descriptive PR titles with [<component-name>] prepended. 2. Build and test your changes before submitting a PR. 3. Sign your commits By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. --> Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-05 19:33:36 +02:00
Ettore Di Giacinto	605c319157	feat(diffusers): don't set seed in params and respect device (#1010 ) Description Follow up of #998 - respect the device used to load the model and do not specify a seed in the parameters, but rather just configure the generator as described in https://huggingface.co/docs/diffusers/using-diffusers/reusing_seeds Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-04 19:38:38 +02:00
Ettore Di Giacinto	dc307a1cc0	feat: add vall-e-x (#1007 ) Description This PR fixes #985 Notes for Reviewers [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [ ] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions: 1. Include descriptive PR titles with [<component-name>] prepended. 2. Build and test your changes before submitting a PR. 3. Sign your commits By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. --> Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-04 19:25:23 +02:00
Max Cohen	f9d2bd24eb	Allow to manually set the seed for the SD pipeline (#998 ) Description Enable setting the seed for the stable diffusion pipeline. This is done through an additional `seed` parameter in the request, such as: ```bash curl http://localhost:8080/v1/images/generations \ -H "Content-Type: application/json" \ -d '{"model": "stablediffusion", "prompt": "prompt", "n": 1, "step": 51, "size": "512x512", "seed": 3}' ``` Notes for Reviewers When the `seed` parameter is not sent, `request.seed` defaults to `0`, making it difficult to detect an actual seed of `0`. Is there a way to change the default to `-1` for instance ? [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [x] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions: 1. Include descriptive PR titles with [<component-name>] prepended. 2. Build and test your changes before submitting a PR. 3. Sign your commits By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. -->	2023-09-04 19:10:55 +02:00
Ettore Di Giacinto	158c7867e7	fix(diffusers): correctly check alpha (#967 ) Description Loras that have no alpha would crash otherwise Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-27 15:35:59 +02:00
Ettore Di Giacinto	02704e38d3	feat(diffusers): Add lora (#965 ) Description This PR fixes #914 Now diffusers respects the `lora_adapter` configuration parameter. --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-08-27 10:11:16 +02:00
Ettore Di Giacinto	44bc7aa3d0	feat: Allow to load lora adapters for llama.cpp (#955 ) Description This PR fixes # Notes for Reviewers [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [ ] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions: 1. Include descriptive PR titles with [<component-name>] prepended. 2. Build and test your changes before submitting a PR. 3. Sign your commits By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. --> Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-25 21:58:46 +02:00
Ettore Di Giacinto	afdc0ebfd7	feat: add --single-active-backend to allow only one backend active at the time (#925 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-19 01:49:33 +02:00
Ettore Di Giacinto	1079b18ff7	feat(diffusers): be consistent with pipelines, support also depthimg2img (#926 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-18 22:06:24 +02:00
Ettore Di Giacinto	2bacd0180d	feat(diffusers): add img2img and clip_skip, support more kernels schedulers (#906 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-17 23:38:59 +02:00
Ettore Di Giacinto	ede71d398c	feat(diffusers): overcome prompt limit (#904 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-16 22:24:52 +02:00
Ettore Di Giacinto	37700f2d98	feat(diffusers): add DPMSolverMultistepScheduler++, DPMSolverMultistepSchedulerSDE++, guidance_scale (#903 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-16 01:11:42 +02:00
Ettore Di Giacinto	a96c3bc885	feat(diffusers): various enhancements (#895 )	2023-08-14 23:12:00 +02:00
Ettore Di Giacinto	ff3ab5fcca	feat: Add exllama (#881 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-11 00:49:40 +02:00
Ettore Di Giacinto	8c781a6a44	feat: Add Diffusers (#874 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-09 08:38:51 +02:00
Ettore Di Giacinto	219751bb21	fix: cut prompt from AutoGPTQ answers Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-08 01:27:38 +02:00
Ettore Di Giacinto	bb7772a364	fix: byte utf-8 encode results from autogptq Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-08 01:20:07 +02:00
Ettore Di Giacinto	3c8fc37c56	feat: Add UseFastTokenizer Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-08 01:10:05 +02:00
Ettore Di Giacinto	b09bae3443	fix: autogptq requirements Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-08 00:22:15 +02:00
Ettore Di Giacinto	433605e282	feat: add initial Bark backend implementation Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-07 22:53:28 +02:00
Ettore Di Giacinto	a843e64fc2	feat: add initial AutoGPTQ backend implementation	2023-08-07 22:53:28 +02:00
Ettore Di Giacinto	5ca21ee398	feat: add ngqa and RMSNormEps parameters (#860 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-03 00:51:08 +02:00
Ettore Di Giacinto	096d98c3d9	fix: add rope settings during model load, fix CUDA (#821 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-27 21:56:05 +02:00
Ettore Di Giacinto	b96e30e66c	fix: use bytes in gRPC proto instead of strings (#813 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-27 18:41:04 +02:00
Ettore Di Giacinto	569c1d1163	feat: add rope settings and negative prompt, drop grammar backend (#797 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-25 19:05:27 +02:00
Ettore Di Giacinto	982a7e86a8	feat: add huggingface embeddings backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-20 22:10:42 +02:00

31 Commits