-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Include OpenAI Whisper model #70
Comments
I've been using Whisper to subtitle and translate to English videos which I could not find any subtitles. The only issue I saw with it is that, while OpenAI's implementation generates subtitles, sometimes their timestamps are not great. But there is WhisperX and whisper-timestamped that improve that. To be fair, I got here because it showed up in GitHub's "Explore repositories" and I thought "What it does better than Whisper?" then I saw that's just an older project. At this point I don't see that's no longer a case of incorporating it in AutoSub: those projects (including OpenAI's implementation) already generate subtitles and seem to use a lot of tricks to improve performance. A lot faster in fact: faster-whisper speed is 54s/13min, 4 seconds per minute of audio, against AutoSub's 34 second per minute (40 minutes/70 minutes). Maybe AutoSub can be faster if the same hardware was used, but the readme makes it seem a quite slower. |
OpenAI just released probably the best model that there is for speech recognition right now.
It would be great to incorprate this into this project!
More info: https://openai.com/blog/whisper/
The text was updated successfully, but these errors were encountered: