ecoute/README.md


# 🎧 Ecoute

Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.

## 📖 Demo

https://github.com/SevaSk/ecoute/assets/50382291/8ac48927-8a26-49fd-80e9-48f980986208

Ecoute is designed to help users in their conversations by providing live transcriptions and generating contextually relevant responses. By leveraging the power of OpenAI's GPT-3.5, Ecoute aims to make communication more efficient and enjoyable.

## 🚀 Getting Started

Follow these steps to set up and run Ecoute on your local machine.

### 📋 Prerequisites

- Python >=3.8.0
- An OpenAI API key
- Windows OS (Not tested on others)
- FFmpeg 

If FFmpeg is not installed in your system, you can follow the steps below to install it.

First, you need to install Chocolatey, a package manager for Windows. Open your PowerShell as Administrator and run the following command:
```
Set-ExecutionPolicy Bypass -Scope Process -Force; [System.Net.ServicePointManager]::SecurityProtocol = [System.Net.ServicePointManager]::SecurityProtocol -bor 3072; iex ((New-Object System.Net.WebClient).DownloadString('https://community.chocolatey.org/install.ps1'))
```
Once Chocolatey is installed, you can install FFmpeg by running the following command in your PowerShell:
```
choco install ffmpeg-full
```
Please ensure that you run these commands in a PowerShell window with administrator privileges. If you face any issues during the installation, you can visit the official Chocolatey and FFmpeg websites for troubleshooting.

### 🔧 Installation

1. Clone the repository:

   ```
   git clone https://github.com/SevaSk/ecoute
   ```

2. Navigate to the `ecoute` folder:

   ```
   cd ecoute
   ```

3. Install the required packages:

   ```
   pip install -r requirements.txt
   ```
   
4. Create a `keys.py` file in the ecoute directory and add your OpenAI API key:

   - Option 1: You can utilize a command on your command prompt. Run the following command, ensuring to replace "API KEY" with your actual OpenAI API key:

      ```
      python -c "with open('keys.py', 'w', encoding='utf-8') as f: f.write('OPENAI_API_KEY=\"API KEY\"')"
      ```

   - Option 2: You can create the keys.py file manually. Open up your text editor of choice and enter the following content:
   
      ```
      OPENAI_API_KEY="API KEY"
      ```
      Replace "API KEY" with your actual OpenAI API key. Save this file as keys.py within the ecoute directory.

### 🎬 Running Ecoute

Run the main script:

```
python main.py
```

For a better and faster version, use:

```
python main.py --api
```

Upon initiation, Ecoute will begin transcribing your microphone input and speaker output in real-time, generating a suggested response based on the conversation. Please note that it might take a few seconds for the system to warm up before the transcription becomes real-time.

The --api flag significantly enhances transcription speed and accuracy, and it's expected to be the default option in future releases. However, keep in mind that using the Whisper API will consume more OpenAI credits than using the local model. This increased cost is attributed to the advanced features and capabilities that the Whisper API provides. Despite the additional cost, the considerable improvements in speed and transcription accuracy might make it a worthwhile investment for your use case.

### ⚠️ Limitations

While Ecoute provides real-time transcription and response suggestions, there are several known limitations to its functionality that you should be aware of:

**Default Mic and Speaker:** Ecoute is currently configured to listen only to the default microphone and speaker set in your system. It will not detect sound from other devices or systems. If you wish to use a different mic or speaker, you will need to set it as your default device in your system settings.

**Whisper Model**: If the --api flag is not used, we utilize the 'tiny' version of the Whisper ASR model, due to its low resource consumption and fast response times. However, this model may not be as accurate as the larger models in transcribing certain types of speech, including accents or uncommon words.

**Language**: The Whisper model used in Ecoute is set to English. As a result, it may not accurately transcribe non-English languages or dialects. We are actively working to add multi-language support to future versions of the program.

## 📖 License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## 🤝 Contributing

Contributions are welcome! Feel free to open issues or submit pull requests to improve Ecoute.
Update README.md 2023-05-14 00:33:48 +00:00
			`# 🎧 Ecoute`

Update README.md 2023-05-14 00:36:54 +00:00			`Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.`
Update README.md 2023-05-14 00:33:48 +00:00
Update README.md 2023-05-16 23:53:22 +00:00			`## 📖 Demo`
Update README.md 2023-05-14 00:33:48 +00:00
Update README.md 2023-05-18 02:27:13 +00:00			`https://github.com/SevaSk/ecoute/assets/50382291/8ac48927-8a26-49fd-80e9-48f980986208`
Update README.md 2023-05-16 23:53:22 +00:00
			`Ecoute is designed to help users in their conversations by providing live transcriptions and generating contextually relevant responses. By leveraging the power of OpenAI's GPT-3.5, Ecoute aims to make communication more efficient and enjoyable.`
Update README.md 2023-05-14 00:33:48 +00:00
Update README.md 2023-05-14 00:30:19 +00:00			`## 🚀 Getting Started`
Create README.md 2023-05-12 02:08:52 +00:00
Update README.md 2023-05-17 00:13:12 +00:00			`Follow these steps to set up and run Ecoute on your local machine.`
Update README.md 2023-05-13 01:18:44 +00:00
Update README.md 2023-05-14 00:30:19 +00:00			`### 📋 Prerequisites`
Update README.md 2023-05-13 01:18:58 +00:00
Update README.md The current project's Python requirement (>=3.0,<4.0) is not compatible with some of the required packages Python requirement: - torch requires Python >=3.8.0, so it will not be satisfied for Python >=3.0,<3.8.0 2023-05-31 03:44:03 +00:00			`- Python >=3.8.0`
Update README.md 2023-05-14 00:30:19 +00:00			`- An OpenAI API key`
Update README.md 2023-05-17 00:13:12 +00:00			`- Windows OS (Not tested on others)`
Update README.md 2023-05-21 05:41:44 +00:00			`- FFmpeg`

			`If FFmpeg is not installed in your system, you can follow the steps below to install it.`

			`First, you need to install Chocolatey, a package manager for Windows. Open your PowerShell as Administrator and run the following command:`
			```
			`Set-ExecutionPolicy Bypass -Scope Process -Force; [System.Net.ServicePointManager]::SecurityProtocol = [System.Net.ServicePointManager]::SecurityProtocol -bor 3072; iex ((New-Object System.Net.WebClient).DownloadString('https://community.chocolatey.org/install.ps1'))`
			```
Update README.md 2023-05-21 05:42:17 +00:00			`Once Chocolatey is installed, you can install FFmpeg by running the following command in your PowerShell:`
Update README.md 2023-05-21 05:41:44 +00:00			```
Update README.md 2023-05-21 05:50:47 +00:00			`choco install ffmpeg-full`
Update README.md 2023-05-21 05:41:44 +00:00			```
			`Please ensure that you run these commands in a PowerShell window with administrator privileges. If you face any issues during the installation, you can visit the official Chocolatey and FFmpeg websites for troubleshooting.`
Update README.md 2023-05-13 19:49:49 +00:00
Update README.md 2023-05-14 00:30:19 +00:00			`### 🔧 Installation`

			`1. Clone the repository:`

			```
			`git clone https://github.com/SevaSk/ecoute`
			```

			2. Navigate to the `ecoute` folder:

			```
			`cd ecoute`
			```

			`3. Install the required packages:`

			```
			`pip install -r requirements.txt`
			```
Update README.md 2023-05-14 14:42:12 +00:00
Update README.md 2023-05-26 16:13:42 +00:00			4. Create a `keys.py` file in the ecoute directory and add your OpenAI API key:
Update README.md 2023-05-14 00:30:19 +00:00
Update README.md 2023-05-31 01:46:58 +00:00			`- Option 1: You can utilize a command on your command prompt. Run the following command, ensuring to replace "API KEY" with your actual OpenAI API key:`
Update README.md 2023-05-31 01:42:41 +00:00
			```
			`python -c "with open('keys.py', 'w', encoding='utf-8') as f: f.write('OPENAI_API_KEY=\"API KEY\"')"`
			```

Update README.md 2023-05-31 01:38:33 +00:00			`- Option 2: You can create the keys.py file manually. Open up your text editor of choice and enter the following content:`
Update README.md 2023-05-31 01:42:41 +00:00
			```
			`OPENAI_API_KEY="API KEY"`
			```
Update README.md 2023-05-31 01:44:17 +00:00			`Replace "API KEY" with your actual OpenAI API key. Save this file as keys.py within the ecoute directory.`
Update README.md 2023-05-14 00:30:19 +00:00
Update README.md 2023-05-17 00:13:12 +00:00			`### 🎬 Running Ecoute`
Update README.md 2023-05-14 00:30:19 +00:00
			`Run the main script:`

			```
Update README.md 2023-05-13 01:18:27 +00:00			`python main.py`
Update README.md 2023-05-14 00:30:19 +00:00			```

added --api flag 2023-05-30 00:34:23 +00:00			`For a better and faster version, use:`

			```
			`python main.py --api`
			```

			`Upon initiation, Ecoute will begin transcribing your microphone input and speaker output in real-time, generating a suggested response based on the conversation. Please note that it might take a few seconds for the system to warm up before the transcription becomes real-time.`

			The --api flag significantly enhances transcription speed and accuracy, and it's expected to be the default option in future releases. However, keep in mind that using the Whisper API will consume more OpenAI credits than using the local model. This increased cost is attributed to the advanced features and capabilities that the Whisper API provides. Despite the additional cost, the considerable improvements in speed and transcription accuracy might make it a worthwhile investment for your use case.
Update README.md 2023-05-14 00:30:19 +00:00
Update README.md 2023-05-16 23:53:22 +00:00			`### ⚠️ Limitations`

Update README.md 2023-05-17 00:13:12 +00:00			`While Ecoute provides real-time transcription and response suggestions, there are several known limitations to its functionality that you should be aware of:`
Update README.md 2023-05-16 23:53:22 +00:00
			`Default Mic and Speaker: Ecoute is currently configured to listen only to the default microphone and speaker set in your system. It will not detect sound from other devices or systems. If you wish to use a different mic or speaker, you will need to set it as your default device in your system settings.`

Update README.md 2023-05-30 01:01:44 +00:00			`Whisper Model: If the --api flag is not used, we utilize the 'tiny' version of the Whisper ASR model, due to its low resource consumption and fast response times. However, this model may not be as accurate as the larger models in transcribing certain types of speech, including accents or uncommon words.`
Update README.md 2023-05-16 23:53:22 +00:00
Update README.md 2023-05-17 00:13:12 +00:00			`Language: The Whisper model used in Ecoute is set to English. As a result, it may not accurately transcribe non-English languages or dialects. We are actively working to add multi-language support to future versions of the program.`
Update README.md 2023-05-16 23:53:22 +00:00
Update README.md 2023-05-14 00:30:19 +00:00			`## 📖 License`

			`This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.`

			`## 🤝 Contributing`

Update README.md 2023-05-17 00:13:12 +00:00			`Contributions are welcome! Feel free to open issues or submit pull requests to improve Ecoute.`