This is a Flask web app that allows you to translate audio and video files to an audio output in any languag of your choice using OpenAI's whisper, gpt-3.5-turbo and Eleven labs. It uses the pytube, moviepy, and pydub libraries to download and process the videos, and nltk for tokenizing the text.
6428e6f5e849b34fcd6d5b53_webm.1.mp4
Clone this repository and navigate to the project directory:
bash Copy code git clone https://github.com//.git cd Create a virtual environment and install the required packages:
''' Copy code python -m venv venv source venv/bin/activate # on Windows, use "venv\Scripts\activate" pip install -r requirements.txt '''
''' Run the app locally using Flask: arduino Copy code export FLASK_APP=app.py export FLASK_ENV=development flask run ''' Navigate to http://localhost:5000/ in your web browser to access the app.
Upload audio or video files or links. Transcribe audio or video and generate a text output. Chuck words of over 3000 tokens. Use the OpenAI API to translate. Use Elevenlabs API to convert text to audio.
Flask Flask-SocketIO Flask-Bootstrap PyTube moviepy pydub nltk OpenAI API key Elevenlabs API key
Contributions to this project are welcome. To contribute, please follow these steps: Fork this repository. Create a new branch: git checkout -b my-new-branch Make your changes and commit them: git commit -m "Add some feature" Push to the branch: git push origin my-new-branch Create a new pull request. Please include a clear description of your changes and their purpose.