سؤال

Hi I want to have a speech recognition api or sdk which recognises the speech spoken by the user and gives it's text form.

Detailed Description is as follows:

In my application I need to play an audio file and text of which is already there with me. When audio starts playing the word should be highlighted which is spoken(from the audio file).

So if I am able to get the word from api or sdk then it is possible to highlight it.

Apart from I googled a lot for api and I came across ceedvocalsdk but it's not available for free trial.

If someone can provide any idea other than this suiting to my requirement or api or sdk , I will be highly Thankful.

هل كانت مفيدة؟

المحلول

You can try

http://www.politepix.com/openears/

As for speed, it should be fast, you probably don't use it properly. As I understood you have text already and you need to build grammar from this text.

نصائح أخرى

You can take a look at https://github.com/KingOfBrian/VocalKit, but I have not tried it myself.

You can also try Nexiwave.com.

I think the function you are looking for is what we can TimeStamping: http://nexiwave.com/index.php/applications/for-transcription-companies

It basically take an audio and the text, we then put timestamp on each sentence and word.

Ben

مرخصة بموجب: CC-BY-SA مع الإسناد
لا تنتمي إلى StackOverflow
scroll top