Speech To Text Analyze

In this repo, the performances of tools that convert audio to text are compared by reporting their accuracy rates, speeds and features.

Speech-to-Text Accuracy Benchmark - June 2020 Results

Accuracy of automated speech recognition (ASR) depends on the audio in many ways and the effect is not small. Basically, accuracy can be all over the place depending on factors like:

Does the speech follow proper grammar or is the speaker making things up as they are saying it. Prepared speeches will have better, i.e. lower WER (word error rate) scores compared to unscripted speech.
What is the subject of the speech. Rare and obscure words or word combinations, like e.g. people or other names, will make life difficult for the NLM (natural language model).
Are there more than one speakers? Are they constantly switching over or even talk over one another.
Is there music in the background - very common for youtube productions.
Is there background noise? What is the type of noise?
Are parts of the speech audio unusually slow or fast?
Is there room reverb or echo in the recording?
Is the recording volume very low. Are there variations in the recording volume (e.g. recorder placed on one edge of a very long table)
Is the recording quality bad, e.g., due to a codec or insane archival compression levels.

So what are the results? Who has the best recognizer?

Again, the best recognizer is not the right question, because it all depends on your actual speech audio it is used on.

Every recognizer has improved. The biggest improvement in median WER was by Microsoft Speech to Text.
The best recognizer in our data set was Google Speech to Text - Enhanced (video), but the new Microsoft Speech to Text is very close second.
Taking price into consideration, Microsoft might be declared Best Buy
Google Speech to Text - Standard, although somewhat improved, is still clearly the worst performing on the data set.
The single bad data point for Google Enhanced (video) is real. We ran repeated test on the file and got the same result. The old Google Enhanced recognizer did not have problems with that file.

All speech-to-text Tools

Google API($) ✅
Amazon Transcribe($) ✅
Microsoft Azure Speech to Text($) ✅
Web Speech API(Free) ✅
Dragon Professional Individual($)
Braina Pro($)
Speechnotes(Free)
e-Speaking($)
Voice Finger($)
Apple Dictation(Free)
Windows Dictations/Speech Recognition(Free)
Dictation(Free)
Speech Texter(Free)
Dragon Anywhere for Mobile($)
Otter(Free)
Verbit($)
Speechmatics($)
IBM Watson Speech to Text($)
Just Press Record(Free)
Transcribe(Free)
Voicegain($)

Todos

Search all speech-to-text tools
Review all source codes
Read doc or article all speech-to-text technologies
Run some source code in local
Test some tools
Write report for all tools
Compare all tools
Write all features, infos tools

Source Link

https://www.folio3.ai/blog/best-free-speech-to-text-software/ https://www.softwaretestinghelp.com/best-dictation-software/ https://www.google.com/search?q=compare+speech+to+text+tools&rlz=1C5CHFA_enTR972TR973&oq=compare+speech+to+text+tools&aqs=chrome..69i57j69i60l2.10431j0j7&sourceid=chrome&ie=UTF-8

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
Amazon-Transcribe		Amazon-Transcribe
Chrome-Web-Speech-API @ adda89e		Chrome-Web-Speech-API @ adda89e
Google-API		Google-API
Web-API		Web-API
assets		assets
azure-speech-to-text-sample @ 13d29e0		azure-speech-to-text-sample @ 13d29e0
transcription-compare @ 01af462		transcription-compare @ 01af462
.gitmodules		.gitmodules
README.md		README.md
derleyici-tasarimi-odevi.docx		derleyici-tasarimi-odevi.docx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Amazon-Transcribe

Amazon-Transcribe

Chrome-Web-Speech-API @ adda89e

Chrome-Web-Speech-API @ adda89e

Google-API

Google-API

Web-API

Web-API

assets

assets

azure-speech-to-text-sample @ 13d29e0

azure-speech-to-text-sample @ 13d29e0

transcription-compare @ 01af462

transcription-compare @ 01af462

.gitmodules

.gitmodules

README.md

README.md

derleyici-tasarimi-odevi.docx

derleyici-tasarimi-odevi.docx

Repository files navigation

Speech To Text Analyze

Speech-to-Text Accuracy Benchmark - June 2020 Results

So what are the results? Who has the best recognizer?

All speech-to-text Tools

Todos

Source Link

About

Releases

Packages

Languages

cihat/speech-to-text-analyze

Folders and files

Latest commit

History

Repository files navigation

Speech To Text Analyze

So what are the results? Who has the best recognizer?

All speech-to-text Tools

Todos

Source Link

About

Topics

Resources

Stars

Watchers

Forks

Languages