mirror of
https://github.com/ParisNeo/lollms-webui.git
synced 2025-01-18 10:46:27 +00:00
Upgraded Code
This commit is contained in:
parent
5665482912
commit
7748b8368d
252
README.md
252
README.md
@ -8,257 +8,7 @@
|
||||
[![Follow me on Twitter](https://img.shields.io/twitter/follow/SpaceNerduino?style=social)](https://twitter.com/SpaceNerduino)
|
||||
[![Follow Me on YouTube](https://img.shields.io/badge/Follow%20Me%20on-YouTube-red?style=flat&logo=youtube)](https://www.youtube.com/user/Parisneo)
|
||||
|
||||
This is a Flask web application that provides a chat UI for interacting with [llamacpp](https://github.com/ggerganov/llama.cpp), gpt-j, gpt-q as well as Hugging face based language models uch as [GPT4all](https://github.com/nomic-ai/gpt4all), vicuna etc...
|
||||
|
||||
Follow us on our [Discord Server](https://discord.gg/4rR282WJb6).
|
||||
|
||||
Watch Install Video (Outdated, please use "New UI video") [Old Install Video](https://youtu.be/6kKv6ESnwMk)
|
||||
|
||||
Watch Usage Videos [Usage Video](https://youtu.be/DCBefhJUUh4)
|
||||
|
||||
Watch Settings Video [Settings Video](https://youtu.be/7KwR2vdt1t4)
|
||||
|
||||
Watch New UI Video [New UI + Install](https://youtu.be/M7NFajCyZKs)
|
||||
|
||||
![image](https://i.gyazo.com/ef94a5ac9169467a1aec228ef8c36c66.gif)
|
||||
|
||||
GPT4All is an exceptional language model, designed and developed by Nomic-AI, a proficient company dedicated to natural language processing. The app uses Nomic-AI's advanced library to communicate with the cutting-edge GPT4All model, which operates locally on the user's PC, ensuring seamless and efficient communication.
|
||||
|
||||
If you are interested in learning more about this groundbreaking project, visit their [Github Repository](https://github.com/nomic-ai/gpt4all), where you can find comprehensive information regarding the app's functionalities and technical details. Moreover, you can delve deeper into the training process and database by going through their detailed Technical report, available for download at [Technical report](https://s3.amazonaws.com/static.nomic.ai/gpt4all/2023_GPT4All_Technical_Report.pdf).
|
||||
|
||||
One of the app's impressive features is that it allows users to send messages to the chatbot and receive instantaneous responses in real time, ensuring a seamless user experience. Additionally, the app facilitates the exportation of the entire chat history in either text or JSON format, providing greater flexibility to the users.
|
||||
|
||||
It's worth noting that the model has recently been launched, and it's expected to evolve across time, enabling it to become even better in the future. This web UI is designed to provide the community with easy and fully localized access to a chatbot that will continue to improve and adapt across time.
|
||||
# Features
|
||||
|
||||
- Chat with locally hosted AI inside a web browser
|
||||
- Create, edit, and share your AI's personality
|
||||
- Audio in and audio out with many options for language and voices (only Chrome web browser is supported at this time)
|
||||
- History of discussion with resume functionality
|
||||
- Add new discussion, rename discussion, remove discussion
|
||||
- Export database to json format
|
||||
- Export discussion to text format
|
||||
|
||||
# Installation and running
|
||||
|
||||
Make sure that your CPU supports `AVX2` instruction set. Without it, this application won't run out of the box (for the pyllamacpp backend). To check your CPU features, please visit the website of your CPU manufacturer for more information and look for `Instruction set extension: AVX2`.
|
||||
> **Note**
|
||||
>
|
||||
>Default model `gpt4all-lora-quantized-ggml.bin` is roughly 4GB in size.
|
||||
|
||||
## Windows 10 and 11
|
||||
|
||||
### Automatic install
|
||||
|
||||
> **Note**
|
||||
>
|
||||
>It is mandatory to have python [3.10](https://www.python.org/downloads/release/python-31010/) (The official one, not the one from Microsoft Store) and [git](https://git-scm.com/download/win) installed.
|
||||
|
||||
1. [Go to the latest release section](https://github.com/nomic-ai/gpt4all-ui/releases)
|
||||
2. Download the `webui.bat` if you are on windows or `webui.sh` if you are on linux/mac. Put this file in a folder for example `/gpt4all-ui/`, because when you run it, all the necessary files will be downloaded into that folder.
|
||||
3. Run the script and wait. It should install everything and start the chatbot. Chatbot will be avaliable from web browser `http://localhost:9600`.
|
||||
> **Note**
|
||||
> During installtion, it may ask you to download a model. Feel free to accept or to download your own models depending on the backends you are using.
|
||||
|
||||
|
||||
Once installed, you can run the app by using `webui.bat` or `webui.sh`. The script will check for any new updates
|
||||
|
||||
[If you want to use a more advanced install procedure, please click here](docs/usage/AdvancedInstallInstructions.md)
|
||||
|
||||
## Docker Compose
|
||||
Make sure to put models the inside the `models` directory.
|
||||
After that, you can simply use docker-compose or podman-compose to build and start the application:
|
||||
|
||||
Build
|
||||
```bash
|
||||
docker compose -f docker-compose.yml build
|
||||
```
|
||||
|
||||
Start
|
||||
```bash
|
||||
docker compose -f docker-compose.yml up
|
||||
```
|
||||
|
||||
Stop ` Ctrl ` + ` C `
|
||||
|
||||
Start detached (runs in background)
|
||||
```bash
|
||||
docker compose -f docker-compose.yml up -d
|
||||
```
|
||||
|
||||
Stop detached (one that runs in background)
|
||||
```bash
|
||||
docker compose stop
|
||||
```
|
||||
|
||||
After that, you can open the application in your browser on http://localhost:9600
|
||||
|
||||
Now you're ready to work!
|
||||
|
||||
# Supported backends
|
||||
Two backends are now supported:
|
||||
|
||||
1- [The llama_cpp backend by Abdeladim](https://github.com/abdeladim-s/pyllamacpp)
|
||||
2- [The GPT-j backend by Abdeladim](https://github.com/abdeladim-s/pygptj)
|
||||
3- [The GPT-j backend by marella](https://github.com/marella/gpt4all-j)
|
||||
4- Hugging face's Transformers (under construction)
|
||||
|
||||
# Supported models
|
||||
You can also refuse to download the model during the install procedure and download it manually.
|
||||
|
||||
**For now, we support ggml models that work "out-of-the-box" (tested on Windows 11 and Ubuntu 22.04.2), such as:**
|
||||
|
||||
## LLama_cpp models
|
||||
- [GPT4ALL 7B](https://huggingface.co/ParisNeo/GPT4All/resolve/main/gpt4all-lora-quantized-ggml.bin) or visit [repository](https://huggingface.co/ParisNeo/GPT4All)
|
||||
- [GPT4ALL 7B unfiltered](https://huggingface.co/ParisNeo/GPT4All/resolve/main/gpt4all-lora-unfiltered-quantized.new.bin) or visit [repository](https://huggingface.co/ParisNeo/GPT4All)
|
||||
- [Vicuna 7B rev 1](https://huggingface.co/eachadea/legacy-ggml-vicuna-7b-4bit/resolve/main/ggml-vicuna-7b-4bit-rev1.bin) or visit [repository](https://huggingface.co/eachadea/legacy-ggml-vicuna-7b-4bit)
|
||||
- [Vicuna 13B rev 1](https://huggingface.co/eachadea/ggml-vicuna-13b-4bit/resolve/main/ggml-vicuna-13b-4bit-rev1.bin) or visit [repository](https://huggingface.co/eachadea/ggml-vicuna-13b-4bit)
|
||||
|
||||
- [ggml-gpt4all-j-v1.3-groovy](https://gpt4all.io/models/ggml-gpt4all-j-v1.3-groovy.bin)
|
||||
- [ggml-gpt4all-j-v1.2-jazzy](https://gpt4all.io/models/ggml-gpt4all-j-v1.2-jazzy.bin)
|
||||
- [ggml-gpt4all-l13b-snoozy](https://gpt4all.io/models/ggml-gpt4all-l13b-snoozy.bin)
|
||||
|
||||
- [ggml-gpt4all-j-v1.1-breezy](https://gpt4all.io/models/ggml-gpt4all-j-v1.1-breezy.bin)
|
||||
- [ggml-gpt4all-j](https://gpt4all.io/models/ggml-gpt4all-j.bin)
|
||||
- [ggml-vicuna-7b-1.1-q4_2](https://gpt4all.io/models/ggml-vicuna-7b-1.1-q4_2.bin)
|
||||
- [ggml-vicuna-13b-1.1-q4_2](https://gpt4all.io/models/ggml-vicuna-13b-1.1-q4_2.bin)
|
||||
|
||||
- [wizard-vicuna-13B.ggml.q4_2](https://huggingface.co/TheBloke/wizard-vicuna-13B-GGML/resolve/main/wizard-vicuna-13B.ggml.q4_2.bin)
|
||||
- [WizardLM-7B-uncensored.ggml.q4_0](https://huggingface.co/TheBloke/WizardLM-7B-uncensored-GGML/resolve/main/WizardLM-7B-uncensored.ggml.q4_0.bin)
|
||||
- [wizardLM-7B.ggml.q4_2.bin](https://huggingface.co/TheBloke/wizardLM-7B-GGML/resolve/main/wizardLM-7B.ggml.q4_2.bin)
|
||||
|
||||
|
||||
|
||||
We also support GPT-j models out of the box
|
||||
|
||||
## GPT-j models
|
||||
- [GPT-j 7B](https://gpt4all.io/models/ggml-gpt4all-j.bin)
|
||||
|
||||
|
||||
**These models don't work "out-of-the-box" and need to be converted to the right ggml type:**
|
||||
## LLAMACPP models
|
||||
- [Vicuna 7B](https://huggingface.co/eachadea/legacy-ggml-vicuna-7b-4bit/resolve/main/ggml-vicuna-7b-4bit.bin) or visit [repository](https://huggingface.co/eachadea/legacy-ggml-vicuna-7b-4bit)
|
||||
- [Vicuna 13B q4 v0](https://huggingface.co/eachadea/ggml-vicuna-13b-1.1/resolve/main/ggml-vicuna-13b-1.1-q4_0.bin) or visit [repository](https://huggingface.co/eachadea/ggml-vicuna-13b-1.1/)
|
||||
- [Vicuna 13B q4 v1](https://huggingface.co/eachadea/ggml-vicuna-13b-1.1/resolve/main/ggml-vicuna-13b-1.1-q4_1.bin) or visit [repository](https://huggingface.co/eachadea/ggml-vicuna-13b-1.1/)
|
||||
- [ALPACA 7B](https://huggingface.co/Sosaka/Alpaca-native-4bit-ggml/resolve/main/ggml-alpaca-7b-q4.bin) or visit [repository](https://huggingface.co/Sosaka/Alpaca-native-4bit-ggml/)
|
||||
|
||||
Just download the model into the `models/<backend name>` folder and start using the tool.
|
||||
# Personalities
|
||||
|
||||
You can find hundreds of personalities in my personal [Personalities repository](https://github.com/ParisNeo/PyAIPersonality). This new personalities format can be used for any third party applications, it builds a simple structure and format to define personalities. This format is evolutive and new fields and assets will be added in the future like personality voice or 3d animated character with prebaked motions that should allow AI to be more alive. The format is baked to support old versions while adding new capabilities for new versions making it ideal as a personality defintition format.
|
||||
|
||||
### How to Install Personalities from the Zoo
|
||||
|
||||
1. Navigate to the root directory of your repository.
|
||||
2. Run either `installations/add_personality.bat` or `installations/add_personality.sh`, depending on your operating system.
|
||||
3. Select the desired language, category, and personality from the provided options.
|
||||
4. The selected personality will be added to the list of available options.
|
||||
5. Choose the current personality:
|
||||
- Option 1: Use the UI by going to "Settings" and selecting "Personalities".
|
||||
- Option 2: Update the configuration file `configs/default_local.yaml` with the appropriate language, category, and personality name.
|
||||
|
||||
Note: Ensure that you have the necessary permissions and dependencies installed before performing the above steps.
|
||||
|
||||
```
|
||||
|
||||
Please don't forget to take time and give a Star if you like the project. This helps the visibility of the project.
|
||||
# Build custom personalities and share them
|
||||
|
||||
To build a new personality, create a new file with the name of the personality inside the `personalities` folder. You can look at `gpt4all` personality as an example. Then you can fill the fields with the description, conditionning, etc. of your personality. Add a logo to your personality (avatar). Then save the file. I personally use stable diffusion to generate the avatars.
|
||||
|
||||
|
||||
|
||||
You can launch the application using the personality in two ways:
|
||||
- Change it permanently by putting the name of the personality inside your configuration file
|
||||
- Use the `--personality` or `-p` option to give the personality name to be used
|
||||
|
||||
If you deem your personality worthy of sharing, you can share the it by adding it to the [GPT4all personalities](https://github.com/ParisNeo/GPT4All_Personalities) repository. Just fork the repo, add your file, and do a pull request.
|
||||
|
||||
# Advanced Usage
|
||||
|
||||
If you want more control on your launch, you can activate your environment:
|
||||
|
||||
On Windows:
|
||||
```cmd
|
||||
env/Scripts/activate.bat
|
||||
```
|
||||
|
||||
On Linux/MacOs:
|
||||
```cmd
|
||||
source venv/bin/activate
|
||||
```
|
||||
|
||||
Now you are ready to customize your Bot.
|
||||
|
||||
To run the Flask server, execute the following command:
|
||||
```bash
|
||||
python app.py [--config CONFIG] [--personality PERSONALITY] [--port PORT] [--host HOST] [--temp TEMP] [--n_threads N_THREADS] [--n_predict N_PREDICT] [--top_k TOP_K] [--top_p TOP_P] [--repeat_penalty REPEAT_PENALTY] [--repeat_last_n REPEAT_LAST_N] [--ctx_size CTX_SIZE]
|
||||
```
|
||||
|
||||
On Linux/MacOS more details can be found [here](docs/usage/Linux_Osx_Usage.md)
|
||||
|
||||
## Options
|
||||
* `--config`: the configuration file to be used. It contains default configurations. The script parameters will override the configurations inside the configuration file. It must be placed in configs folder (default: default.yaml)
|
||||
* `--personality`: the personality file name. It contains the definition of the pezrsonality of the chatbot and should be placed in personalities folder. The default personality is `gpt4all_chatbot.yaml`
|
||||
* `--model`: the name of the model to be used. The model should be placed in models folder (default: gpt4all-lora-quantized.bin)
|
||||
* `--seed`: the random seed for reproductibility. If fixed, it is possible to reproduce the outputs exactly (default: random)
|
||||
* `--port`: the port on which to run the server (default: 9600)
|
||||
* `--host`: the host address at which to run the server (default: localhost). To expose application to local network, set this to 0.0.0.0.
|
||||
* `--temp`: the sampling temperature for the model (default: 0.1)
|
||||
* `--n_threads`: the number of threads to be used (default:8)
|
||||
* `--n-predict`: the number of tokens to predict at a time (default: 128)
|
||||
* `--top-k`: the number of top-k candidates to consider for sampling (default: 40)
|
||||
* `--top-p`: the cumulative probability threshold for top-p sampling (default: 0.90)
|
||||
* `--repeat-penalty`: the penalty to apply for repeated n-grams (default: 1.3)
|
||||
* `--repeat-last-n`: the number of tokens to use for detecting repeated n-grams (default: 64)
|
||||
* `--ctx-size`: the maximum context size to use for generating responses (default: 2048)
|
||||
|
||||
Note: All options are optional and have default values.
|
||||
|
||||
Once the server is running, open your web browser and navigate to http://localhost:9600 (or http://your-host-name:your-port-number if you have selected different values for those) to access the chatbot UI. To use the app, open a web browser and navigate to this URL.
|
||||
|
||||
Make sure to adjust the default values and descriptions of the options to match your specific application.
|
||||
|
||||
# Contribute
|
||||
|
||||
This is an open-source project by the community and for the community. Our chatbot is a UI wrapper for Nomic AI's model, which enables natural language processing and machine learning capabilities.
|
||||
|
||||
We welcome contributions from anyone who is interested in improving our chatbot. Whether you want to report a bug, suggest a feature, or submit a pull request, we encourage you to get involved and help us make our chatbot even better.
|
||||
|
||||
Before contributing, please take a moment to review our [code of conduct](./CODE_OF_CONDUCT.md). We expect all contributors to abide by this code of conduct, which outlines our expectations for respectful communication, collaborative development, and innovative contributions.
|
||||
|
||||
### Reporting Bugs
|
||||
|
||||
If you find a bug or other issue with our chatbot, please report it by [opening an issue](https://github.com/nomic-ai/gpt4all-ui/issues/new). Be sure to provide as much detail as possible, including steps to reproduce the issue and any relevant error messages.
|
||||
|
||||
### Suggesting Features
|
||||
|
||||
If you have an idea for a new feature or improvement to our chatbot, we encourage you to [open an issue](https://github.com/nomic-ai/gpt4all-ui/issues/new) to discuss it. We welcome feedback and ideas from the community and will consider all suggestions that align with our project goals.
|
||||
|
||||
### Contributing Code
|
||||
|
||||
If you want to contribute code to our chatbot, please follow these steps:
|
||||
|
||||
1. Fork the repository and create a new branch for your changes.
|
||||
2. Make your changes and ensure that they follow our coding conventions.
|
||||
3. Test your changes to ensure that they work as expected.
|
||||
4. Submit a pull request with a clear description of your changes and the problem they solve.
|
||||
|
||||
We will review your pull request as soon as possible and provide feedback on any necessary changes. We appreciate your contributions and look forward to working with you!
|
||||
|
||||
Please note that all contributions are subject to review and approval by our project maintainers. We reserve the right to reject any contribution that does not align with our project goals or standards.
|
||||
|
||||
# Future Plans
|
||||
|
||||
Here are some of the future plans for this project:
|
||||
|
||||
**Enhanced control of chatbot parameters:** We plan to improve the UI of the chatbot to allow users to control the parameters of the chatbot such as temperature and other variables. This will give users more control over the chatbot's responses, and allow for a more customized experience.
|
||||
|
||||
**Extension system for plugins:** We are also working on an extension system that will allow developers to create plugins for the chatbot. These plugins will be able to add new features and capabilities to the chatbot, and allow for greater customization of the chatbot's behavior.
|
||||
|
||||
**Enhanced UI with themes and skins:** Additionally, we plan to enhance the UI of the chatbot to allow for themes and skins. This will allow users to personalize the appearance of the chatbot and make it more visually appealing.
|
||||
|
||||
We are excited about these future plans for the project and look forward to implementing them in the near future. Stay tuned for updates!
|
||||
# The new version is beeing worked on. Once it is ready, you'll have an uptodate README and a video that explain it all.
|
||||
|
||||
# License
|
||||
|
||||
|
@ -19,7 +19,7 @@ __copyright__ = "Copyright 2023, "
|
||||
__license__ = "Apache 2.0"
|
||||
|
||||
|
||||
class GPTBackend:
|
||||
class LLMBackend:
|
||||
file_extension='*.bin'
|
||||
backend_path = Path(__file__).parent
|
||||
def __init__(self, config:dict, inline:bool) -> None:
|
||||
@ -72,7 +72,7 @@ class GPTBackend:
|
||||
"""Lists the models for this backend
|
||||
"""
|
||||
models_dir = Path('./models')/config["backend"] # replace with the actual path to the models folder
|
||||
return [f.name for f in models_dir.glob(GPTBackend.file_extension)]
|
||||
return [f.name for f in models_dir.glob(LLMBackend.file_extension)]
|
||||
@staticmethod
|
||||
def get_available_models():
|
||||
# Create the file path relative to the child class's directory
|
||||
|
144
backends/backend_template/__init__.py
Normal file
144
backends/backend_template/__init__.py
Normal file
@ -0,0 +1,144 @@
|
||||
######
|
||||
# Project : GPT4ALL-UI
|
||||
# File : backend.py
|
||||
# Author : ParisNeo with the help of the community
|
||||
# Underlying backend : Abdeladim's pygptj backend
|
||||
# Supported by Nomic-AI
|
||||
# license : Apache 2.0
|
||||
# Description :
|
||||
# This is an interface class for GPT4All-ui backends.
|
||||
|
||||
# This backend is a wrapper to marella's backend
|
||||
# Follow him on his github project : https://github.com/marella/ctransformers
|
||||
|
||||
######
|
||||
from pathlib import Path
|
||||
from typing import Callable
|
||||
from api.backend import LLMBackend
|
||||
import yaml
|
||||
from ctransformers import AutoModelForCausalLM
|
||||
|
||||
__author__ = "parisneo"
|
||||
__github__ = "https://github.com/nomic-ai/gpt4all-ui"
|
||||
__copyright__ = "Copyright 2023, "
|
||||
__license__ = "Apache 2.0"
|
||||
|
||||
backend_name = "GPTJ"
|
||||
|
||||
class GPTJ(LLMBackend):
|
||||
file_extension='*.bin'
|
||||
def __init__(self, config:dict) -> None:
|
||||
"""Builds a LLAMACPP backend
|
||||
|
||||
Args:
|
||||
config (dict): The configuration file
|
||||
"""
|
||||
super().__init__(config, False)
|
||||
if 'gpt2' in self.config['model']:
|
||||
model_type='gpt2'
|
||||
elif 'gptj' in self.config['model']:
|
||||
model_type='gptj'
|
||||
elif 'gpt_neox' in self.config['model']:
|
||||
model_type='gpt_neox'
|
||||
elif 'dolly-v2' in self.config['model']:
|
||||
model_type='dolly-v2'
|
||||
elif 'starcoder' in self.config['model']:
|
||||
model_type='starcoder'
|
||||
elif 'llama' in self.config['model'] or 'wizardLM' in self.config['model']:
|
||||
model_type='llama'
|
||||
elif 'mpt' in self.config['model']:
|
||||
model_type='mpt'
|
||||
else:
|
||||
print("The model you are using is not supported by this backend")
|
||||
return
|
||||
|
||||
|
||||
if self.config["use_avx2"]:
|
||||
self.model = AutoModelForCausalLM.from_pretrained(
|
||||
f"./models/c_transformers/{self.config['model']}", model_type=model_type
|
||||
)
|
||||
else:
|
||||
self.model = AutoModelForCausalLM.from_pretrained(
|
||||
f"./models/c_transformers/{self.config['model']}", model_type=model_type, lib = "avx"
|
||||
)
|
||||
|
||||
def tokenize(self, prompt):
|
||||
"""
|
||||
Tokenizes the given prompt using the model's tokenizer.
|
||||
|
||||
Args:
|
||||
prompt (str): The input prompt to be tokenized.
|
||||
|
||||
Returns:
|
||||
list: A list of tokens representing the tokenized prompt.
|
||||
"""
|
||||
return self.model.tokenize(prompt.encode())
|
||||
|
||||
def detokenize(self, tokens_list):
|
||||
"""
|
||||
Detokenizes the given list of tokens using the model's tokenizer.
|
||||
|
||||
Args:
|
||||
tokens_list (list): A list of tokens to be detokenized.
|
||||
|
||||
Returns:
|
||||
str: The detokenized text as a string.
|
||||
"""
|
||||
return self.model.detokenize(tokens_list)
|
||||
|
||||
def generate(self,
|
||||
prompt:str,
|
||||
n_predict: int = 128,
|
||||
new_text_callback: Callable[[str], None] = bool,
|
||||
verbose: bool = False,
|
||||
**gpt_params ):
|
||||
"""Generates text out of a prompt
|
||||
|
||||
Args:
|
||||
prompt (str): The prompt to use for generation
|
||||
n_predict (int, optional): Number of tokens to prodict. Defaults to 128.
|
||||
new_text_callback (Callable[[str], None], optional): A callback function that is called everytime a new text element is generated. Defaults to None.
|
||||
verbose (bool, optional): If true, the code will spit many informations about the generation process. Defaults to False.
|
||||
"""
|
||||
try:
|
||||
output = ""
|
||||
self.model.reset()
|
||||
tokens = self.model.tokenize(prompt)
|
||||
count = 0
|
||||
for tok in self.model.generate(
|
||||
tokens,
|
||||
top_k=self.config['top_k'],
|
||||
top_p=self.config['top_p'],
|
||||
temperature=self.config['temperature'],
|
||||
repetition_penalty=self.config['repeat_penalty'],
|
||||
seed=self.config['seed'],
|
||||
batch_size=1,
|
||||
threads = self.config['n_threads'],
|
||||
reset=True,
|
||||
):
|
||||
|
||||
|
||||
if count >= n_predict or self.model.is_eos_token(tok):
|
||||
break
|
||||
word = self.model.detokenize(tok)
|
||||
if new_text_callback is not None:
|
||||
if not new_text_callback(word):
|
||||
break
|
||||
output += word
|
||||
count += 1
|
||||
|
||||
|
||||
except Exception as ex:
|
||||
print(ex)
|
||||
return output
|
||||
|
||||
@staticmethod
|
||||
def get_available_models():
|
||||
# Create the file path relative to the child class's directory
|
||||
backend_path = Path(__file__).parent
|
||||
file_path = backend_path/"models.yaml"
|
||||
|
||||
with open(file_path, 'r') as file:
|
||||
yaml_data = yaml.safe_load(file)
|
||||
|
||||
return yaml_data
|
45
backends/backend_template/install.py
Normal file
45
backends/backend_template/install.py
Normal file
@ -0,0 +1,45 @@
|
||||
import subprocess
|
||||
from pathlib import Path
|
||||
import requests
|
||||
from tqdm import tqdm
|
||||
|
||||
class Install:
|
||||
def __init__(self, api):
|
||||
# Get the current directory
|
||||
current_dir = Path(__file__).resolve().parent
|
||||
install_file = current_dir / ".installed"
|
||||
|
||||
if not install_file.exists():
|
||||
print("-------------- Template backend -------------------------------")
|
||||
print("This is the first time you are using this backend.")
|
||||
print("Installing ...")
|
||||
"""
|
||||
try:
|
||||
print("Checking pytorch")
|
||||
import torch
|
||||
import torchvision
|
||||
if torch.cuda.is_available():
|
||||
print("CUDA is supported.")
|
||||
else:
|
||||
print("CUDA is not supported. Reinstalling PyTorch with CUDA support.")
|
||||
self.reinstall_pytorch_with_cuda()
|
||||
except Exception as ex:
|
||||
self.reinstall_pytorch_with_cuda()
|
||||
"""
|
||||
|
||||
# Step 2: Install dependencies using pip from requirements.txt
|
||||
requirements_file = current_dir / "requirements.txt"
|
||||
subprocess.run(["pip", "install", "--upgrade", "--no-cache-dir", "-r", str(requirements_file)])
|
||||
|
||||
# Create ther models folder
|
||||
models_folder = Path("./models/c_transformers")
|
||||
models_folder.mkdir(exist_ok=True, parents=True)
|
||||
|
||||
#Create the install file
|
||||
with open(install_file,"w") as f:
|
||||
f.write("ok")
|
||||
print("Installed successfully")
|
||||
|
||||
def reinstall_pytorch_with_cuda(self):
|
||||
subprocess.run(["pip", "install", "torch", "torchvision", "torchaudio", "--no-cache-dir", "--index-url", "https://download.pytorch.org/whl/cu117"])
|
||||
|
52
backends/backend_template/models.yaml
Normal file
52
backends/backend_template/models.yaml
Normal file
@ -0,0 +1,52 @@
|
||||
- LLAMA: 'true'
|
||||
description: GGML format model files for the original LLaMa
|
||||
filename: llama-7b.ggmlv3.q4_0.bin
|
||||
license: Non commercial
|
||||
owner_link: https://huggingface.co/TheBloke/
|
||||
owner: TheBloke
|
||||
server: https://huggingface.co/TheBloke/LLaMa-7B-GGML/resolve/main/
|
||||
sha256: ec2f2d1f0dfb73b72a4cbac7fa121abbe04c37ab327125a38248f930c0f09ddf
|
||||
- LLAMA: 'true'
|
||||
description: GGML format model files for Wizard LM 7B model
|
||||
filename: wizardLM-7B.ggmlv3.q4_0.bin
|
||||
license: Non commercial
|
||||
owner_link: https://huggingface.co/TheBloke/
|
||||
owner: TheBloke
|
||||
server: https://huggingface.co/TheBloke/wizardLM-7B-GGML/resolve/main/
|
||||
sha256: ea35e30a7c140485b856d0919284ce59e4ca47c1b8af037ea8b7ba05ef291c43
|
||||
- LLAMA: 'true'
|
||||
description: GGML format model files for Wizard LM 7B model
|
||||
filename: koala-7b.ggml.unquantized.pr613.bin
|
||||
license: Non commercial
|
||||
owner_link: https://huggingface.co/TheBloke/
|
||||
owner: TheBloke
|
||||
server: https://huggingface.co/TheBloke/koala-7b-ggml-unquantized/resolve/main/
|
||||
sha256: c478ceced3b38800cb768225b1e759a32c9e89bd33606fb38eeff3b811e28371
|
||||
|
||||
|
||||
|
||||
|
||||
- MPT-7B: 'true'
|
||||
description: MPT-7B
|
||||
filename: mpt-7b.ggmlv3.q5_1.bin
|
||||
license: Apache-2.0
|
||||
owner_link: https://huggingface.co/TheBloke/
|
||||
owner: TheBloke
|
||||
server: https://huggingface.co/TheBloke/MPT-7B-GGML/resolve/main/
|
||||
sha256: c947c38405921a199c603fed2ed63486811777ba370bb51c40c3132e5cfe84e5
|
||||
- MPT-7B-Instruct: 'true'
|
||||
description: MPT-7B-Instruct
|
||||
filename: mpt-7b-instruct.ggmlv3.q5_1.bin
|
||||
license: Apache-2.0
|
||||
owner_link: https://huggingface.co/TheBloke/
|
||||
owner: TheBloke
|
||||
server: https://huggingface.co/TheBloke/MPT-7B-Instruct-GGML/resolve/main/
|
||||
sha256: a4d17a39ac277d48a3d55aa74b36a4e6e1b891b58937a838243fad549d26c686
|
||||
- MPT-7B-Storywriter: 'true'
|
||||
description: MPT-7B-Storywriter
|
||||
filename: mpt-7b-storywriter.ggmlv3.q5_1.bin
|
||||
license: Apache-2.0
|
||||
owner_link: https://huggingface.co/TheBloke/
|
||||
owner: TheBloke
|
||||
server: https://huggingface.co/TheBloke/MPT-7B-Storywriter-GGML/resolve/main/
|
||||
sha256: 3b7dd7aa7508cc8cb4e262fe4b93214826f38d18d04059075e05837457f5402
|
3
backends/backend_template/requirements.txt
Normal file
3
backends/backend_template/requirements.txt
Normal file
@ -0,0 +1,3 @@
|
||||
ctransformers
|
||||
transformers
|
||||
accelerate
|
@ -14,7 +14,7 @@
|
||||
######
|
||||
from pathlib import Path
|
||||
from typing import Callable
|
||||
from api.backend import GPTBackend
|
||||
from api.backend import LLMBackend
|
||||
import yaml
|
||||
from ctransformers import AutoModelForCausalLM
|
||||
|
||||
@ -23,9 +23,9 @@ __github__ = "https://github.com/nomic-ai/gpt4all-ui"
|
||||
__copyright__ = "Copyright 2023, "
|
||||
__license__ = "Apache 2.0"
|
||||
|
||||
backend_name = "GPTJ"
|
||||
backend_name = "CTRansformers"
|
||||
|
||||
class GPTJ(GPTBackend):
|
||||
class CTRansformers(LLMBackend):
|
||||
file_extension='*.bin'
|
||||
def __init__(self, config:dict) -> None:
|
||||
"""Builds a LLAMACPP backend
|
||||
|
@ -15,7 +15,7 @@
|
||||
from pathlib import Path
|
||||
from typing import Callable
|
||||
from gpt4all import GPT4All
|
||||
from api.backend import GPTBackend
|
||||
from api.backend import LLMBackend
|
||||
import yaml
|
||||
|
||||
__author__ = "parisneo"
|
||||
@ -25,7 +25,7 @@ __license__ = "Apache 2.0"
|
||||
|
||||
backend_name = "GPT4ALL"
|
||||
|
||||
class GPT4ALL(GPTBackend):
|
||||
class GPT4ALL(LLMBackend):
|
||||
file_extension='*.bin'
|
||||
|
||||
def __init__(self, config:dict) -> None:
|
||||
|
@ -15,7 +15,7 @@
|
||||
from pathlib import Path
|
||||
from typing import Callable
|
||||
from pygptj.model import Model
|
||||
from api.backend import GPTBackend
|
||||
from api.backend import LLMBackend
|
||||
|
||||
__author__ = "parisneo"
|
||||
__github__ = "https://github.com/nomic-ai/gpt4all-ui"
|
||||
@ -24,7 +24,7 @@ __license__ = "Apache 2.0"
|
||||
|
||||
backend_name = "GptJ"
|
||||
|
||||
class GptJ(GPTBackend):
|
||||
class GptJ(LLMBackend):
|
||||
file_extension='*.bin'
|
||||
def __init__(self, config:dict) -> None:
|
||||
"""Builds a LLAMACPP backend
|
||||
|
@ -15,7 +15,7 @@
|
||||
from pathlib import Path
|
||||
from typing import Callable
|
||||
from gpt4allj import Model
|
||||
from api.backend import GPTBackend
|
||||
from api.backend import LLMBackend
|
||||
import yaml
|
||||
|
||||
__author__ = "parisneo"
|
||||
@ -25,7 +25,7 @@ __license__ = "Apache 2.0"
|
||||
|
||||
backend_name = "GPTJ"
|
||||
|
||||
class GPTJ(GPTBackend):
|
||||
class GPTJ(LLMBackend):
|
||||
file_extension='*.bin'
|
||||
def __init__(self, config:dict) -> None:
|
||||
"""Builds a LLAMACPP backend
|
||||
|
@ -11,7 +11,7 @@ from pathlib import Path
|
||||
from typing import Callable
|
||||
from transformers import AutoTokenizer, TextGenerationPipeline
|
||||
from auto_gptq import AutoGPTQForCausalLM, BaseQuantizeConfig
|
||||
from api.backend import GPTBackend
|
||||
from api.backend import LLMBackend
|
||||
import torch
|
||||
import yaml
|
||||
|
||||
@ -22,7 +22,7 @@ __license__ = "Apache 2.0"
|
||||
|
||||
backend_name = "GPTQ"
|
||||
|
||||
class GPTQ(GPTBackend):
|
||||
class GPTQ(LLMBackend):
|
||||
file_extension='*'
|
||||
def __init__(self, config:dict) -> None:
|
||||
"""Builds a GPTQ backend
|
||||
|
@ -14,7 +14,7 @@
|
||||
from pathlib import Path
|
||||
from typing import Callable
|
||||
from llama_cpp import Llama
|
||||
from api.backend import GPTBackend
|
||||
from api.backend import LLMBackend
|
||||
import yaml
|
||||
import random
|
||||
|
||||
@ -25,7 +25,7 @@ __license__ = "Apache 2.0"
|
||||
|
||||
backend_name = "LLAMACPP"
|
||||
|
||||
class LLAMACPP(GPTBackend):
|
||||
class LLAMACPP(LLMBackend):
|
||||
file_extension='*.bin'
|
||||
def __init__(self, config:dict) -> None:
|
||||
"""Builds a LLAMACPP backend
|
||||
|
@ -14,7 +14,7 @@
|
||||
from pathlib import Path
|
||||
from typing import Callable
|
||||
from pyllamacpp.model import Model
|
||||
from api.backend import GPTBackend
|
||||
from api.backend import LLMBackend
|
||||
import yaml
|
||||
|
||||
__author__ = "parisneo"
|
||||
@ -24,7 +24,7 @@ __license__ = "Apache 2.0"
|
||||
|
||||
backend_name = "PyLLAMACPP"
|
||||
|
||||
class PyLLAMACPP(GPTBackend):
|
||||
class PyLLAMACPP(LLMBackend):
|
||||
file_extension='*.bin'
|
||||
def __init__(self, config:dict) -> None:
|
||||
"""Builds a LLAMACPP backend
|
||||
|
Loading…
Reference in New Issue
Block a user