Is there a better solution than google speech-api? [closed]

https://stackoverflow.com/questions/20389617

29-08-2022
|

Question

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.

We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.

Closed 7 years ago.

Improve this question

I'm working on Sbire which use google speech-api. There is a some things which I don't like with this api.

First, this is not in streaming mode. It receive small audio files but there is no real pipes. Secondly, it's not precise. Google does not record our voice to learn our accent. I think it's very important to have a good quality of recognition.

I'm looking for a free api or an open-source tool.

Solution

Actually there is a streaming mode, but it requires an API key. Here is an example of how to use it: http://mikepultz.com/2013/07/google-speech-api-full-duplex-php-version/

If you want something that learns, you'll need to create a neural network to handle the data. The best open source tool, however, is CMUSphinx or pocketSphinx(a mini version of CMUSphinx). Here is a link to that tool: http://sourceforge.net/p/cmusphinx/code/HEAD/tree/trunk/

The best part of CMU-Sphinx, however, is that you can set a dictionary of keywords. Instead of outputting gibberish, you can have it ONLY output those commands and finds the command closest to what it thinks you are saying. This make speech recognition incredibly accurate, but only work for a small set of words.

Licensed under: CC-BY-SA with attribution

Not affiliated with StackOverflow