Voice to Clipboard (vtcp)
vtcp is a command-line utility that captures audio from your microphone, transcribes it using Azure Cognitive Services Speech-to-Text, and copies the transcribed text to your clipboard.
Features
- Real-time speech recognition
- Copies transcribed text to clipboard
- Supports silence detection to automatically stop recording
- Plays a sound to indicate start and end of recording
- Uses Azure Cognitive Services for accurate speech-to-text conversion
Prerequisites
- Python 3.7 or higher
- Azure Cognitive Services account
- Azure Speech Services subscription key and region
Installation
-
Clone the repository:
git clone <repository_url> cd vtcp
-
Create a virtual environment (recommended):
python3 -m venv .venv source .venv/bin/activate # On Linux/macOS .venv\Scripts\activate # On Windows
-
Install the dependencies:
pip install -r requirements.txt
-
Configure Azure Credentials:
-
Create a
.envfile in the root directory of the project. -
Add your Azure Speech Services subscription key and region to the
.envfile:AZURE_SPEECH_KEY=your_key_here AZURE_SPEECH_REGION=your_region_hereReplace
your_key_hereandyour_region_herewith your actual Azure credentials. You can obtain these from the Azure portal.
-
Usage
Run the vtcp command in your terminal: