Replies: 1 comment 1 reply
-
Sounds like you want to achieve the same thing as in the stream example? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Great work here, the accuracy is unbelievable! I would like to get realtime support, and will be following progress here on this intently. I am familiar with the Win32 wav APIs and want to use direct audio input streaming in real time. So basically I will call ::waveInOpen, ::waveInPrepareHeader, ::waveInAddBuffer, etc to load buffers continuously, and pass on to Whisper in realtime. Obviously I need to bypass all the wav file stuff, and hook in there somewhere. I then need to parse the text coming out of an output buffer. I did read some of the discussion here about the "chunking" issues with realtime streaming and am not up to speed enough to comment at this point. It appears in the code that there are issues with how the stream is pieced together (what if the chunk is in the middle of a word?) and also the processing algorithm that seems to be tuned to 30 second chunks. So perhaps I am "early to the party" here with regard to realtime streaming.
Beta Was this translation helpful? Give feedback.
All reactions