ecoute/README.md

78 lines
2.8 KiB
Markdown
Raw Normal View History

2023-05-14 00:33:48 +00:00
# 🎧 Ecoute
2023-05-14 00:36:54 +00:00
Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.
2023-05-14 00:33:48 +00:00
2023-05-16 23:53:22 +00:00
## 📖 Demo
2023-05-14 00:33:48 +00:00
2023-05-18 02:27:13 +00:00
https://github.com/SevaSk/ecoute/assets/50382291/8ac48927-8a26-49fd-80e9-48f980986208
2023-05-16 23:53:22 +00:00
Ecoute is designed to help users in their conversations by providing live transcriptions and generating contextually relevant responses. By leveraging the power of OpenAI's GPT-3.5, Ecoute aims to make communication more efficient and enjoyable.
2023-05-14 00:33:48 +00:00
2023-05-14 00:30:19 +00:00
## 🚀 Getting Started
2023-05-12 02:08:52 +00:00
2023-05-17 00:13:12 +00:00
Follow these steps to set up and run Ecoute on your local machine.
2023-05-13 01:18:44 +00:00
2023-05-14 00:30:19 +00:00
### 📋 Prerequisites
2023-05-13 01:18:58 +00:00
2023-05-14 00:30:19 +00:00
- Python 3.x
- An OpenAI API key
2023-05-17 00:13:12 +00:00
- Windows OS (Not tested on others)
2023-05-21 01:00:21 +00:00
- FFmpeg
2023-05-13 19:49:49 +00:00
2023-05-14 00:30:19 +00:00
### 🔧 Installation
1. Clone the repository:
```
git clone https://github.com/SevaSk/ecoute
```
2. Navigate to the `ecoute` folder:
```
cd ecoute
```
3. Install the required packages:
```
pip install -r requirements.txt
```
2023-05-14 14:42:12 +00:00
2023-05-14 00:30:19 +00:00
4. Create a `keys.py` file and add your OpenAI API key:
```
echo 'OPENAI_API_KEY = "API KEY"' > keys.py
```
Replace `API KEY` with your actual OpenAI API key.
2023-05-17 00:13:12 +00:00
### 🎬 Running Ecoute
2023-05-14 00:30:19 +00:00
Run the main script:
```
2023-05-13 01:18:27 +00:00
python main.py
2023-05-14 00:30:19 +00:00
```
2023-05-17 00:13:12 +00:00
Now, Ecoute will start transcribing your microphone input and speaker output in real-time, and provide a suggested response based on the conversation. It may take a couple of seconds to warm up before the transcription becomes real-time.
2023-05-14 00:30:19 +00:00
2023-05-16 23:53:22 +00:00
### ⚠️ Limitations
2023-05-17 00:13:12 +00:00
While Ecoute provides real-time transcription and response suggestions, there are several known limitations to its functionality that you should be aware of:
2023-05-16 23:53:22 +00:00
**Default Mic and Speaker:** Ecoute is currently configured to listen only to the default microphone and speaker set in your system. It will not detect sound from other devices or systems. If you wish to use a different mic or speaker, you will need to set it as your default device in your system settings.
**Whisper Model**: We utilize the 'tiny' version of the Whisper ASR model, due to its low resource consumption and fast response times. However, this model may not be as accurate as the larger models in transcribing certain types of speech, including accents or uncommon words.
2023-05-17 00:13:12 +00:00
**Language**: The Whisper model used in Ecoute is set to English. As a result, it may not accurately transcribe non-English languages or dialects. We are actively working to add multi-language support to future versions of the program.
2023-05-16 23:53:22 +00:00
2023-05-14 00:30:19 +00:00
## 📖 License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
## 🤝 Contributing
2023-05-17 00:13:12 +00:00
Contributions are welcome! Feel free to open issues or submit pull requests to improve Ecoute.