Algorithm for very simple voice/speech recognition

https://stackoverflow.com/questions/22621948

20-06-2023
|

Question

I'm writing a game for Google Glass, but unfortunately SpeechRecognizer API isn't available on the current builds on Google Glass GDK.

So I've been thinking about implementing an algorithm for a very simple voice recognition. Let's say I want to recognize only: "Yes" and "No".
Do you know any example code or any helpful resources to help me in implementing this ?
Is it so hard that I should drop the idea and go with big frameworks like CMUSphinx ?

What about recognizing: up, down, right, left or numbers from 1 to 10 ?

Solution

As I know, there often used transition to the frequency domain by fast Fourier transform (FFT) and it analyzing. Also need some dictionary of speeched words for frequency correlation.

Please see this links:

CMU Sphinx have java implementation.

David Wagner have a good article and matlab implementation.

P.S. Ohh, if you speak in russian, why you don't read this article - very simple, with java examples.

P.P.S. Honestly, I never use this framework, but if you have only a superficial knowledge about speech recognition, robust and easyest way is to use existing complete solutions like frameworks or libraries, otherwise you need spend time to possess the necessary knowledge threshold. In this case you can read this article.

Licensed under: CC-BY-SA with attribution

Not affiliated with StackOverflow