-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
not found output as speech to text #698
Comments
where did you get this code ? |
Thanks for reply let me explain from scratch for its replacement i use algorithm of EXAMPLE 46 of this link here is my code of that function def transcribe(fp):
version of pocketsphinx is : Version: 0.1.15 if i speak nothing than i got 0 as output from process_raw and if i say hello i got some specific value like 256 from process_raw() but blank string from decoder.hyp().hypstr |
Download latest (5prealpha) Pocketsphinx and Sphinxbase code from github or see similar code in jasper-dev |
i used https://github.com/cmusphinx/pocketsphinx/blob/master/swig/python/test/decoder_test.py in all three cases i got results as: hyp().hypstr gives blank string and model score along with confidence are giving some values what to did to get text from hyp().hypstr of wav file |
i need to know while creating decoder other than .dict and .bin file you also passed a file of hmm model which one file it is i downloaded https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/US%20English/cmusphinx-en-us-8khz-5.2.tar.gz/download from here i found 6-7 files inside it |
Make sure wav file is in 16Khz 16 bit mono format. Try this model |
thanks alot man its now working i was stucked from many days because of this |
yeah, its working well but i need model for indian english and hindi i think that was for american english i already developed application for speech to text in american english |
Did you search sourceforge for it ?? |
yeah, i found their pretrained indian english and hindi models but they are not working |
see above two even when you sent me 6th model from below link https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/US%20English/ |
while using hindi model i got below error here is my code from pocketsphinx.pocketsphinx import * MODELDIR = "/home/user/scribe/model/hindi/" config = Decoder.default_config() #stream = open(path.join(DATADIR, 'goforward.raw'), 'rb') in_speech_bf = False |
If not working then Train your own model. |
how? |
a good start point |
i used your provided transcribe function for speech to text and i replaced your provided _decoder with other objects provided by pocketsphinx here is my code
def transcribe(fp):
result = speech_rec.get_hyp()
exit(0)
and i got this as output 👍
You just said: <pocketsphinx.pocketsphinx.Hypothesis; proxy of <Swig Object of type 'Hypothesis *' at 0x7f074c6033f0> >
but expecting speech to text
someone please suggest what's wrong here
and if i use
decoder.hyp().hypstr
than nothing is printed as output
The text was updated successfully, but these errors were encountered: