Question

Though I was having some problems to make sphinx 4 work initially but now its working great. Since the grammar is finite whether it is JSGF or N-Gram, I am not able to make sphinx recognize every possible word or sentence. I want to build something like voice based google search. Since the search may have any possible word or combination of words. It is troublesome to add all the dictionary words in grammar. Kindly help.

Was it helpful?

Solution

No, it's not possible. Every speech recognizer including the one from Google uses limited vocabulary. Google uses large one of about 1.5 million words but still limited. So CMUSphinx. You can verify that by trying to recognize rare proper names, it's almost impossible even with Google.

The practical solution is to use large language model with large vocabulary.

It's a open research question to detect new words in an audio stream and add them recognizer with spoken or other type of feedback.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top