Can anyone point me in the right direction on how I would minimize ambient noise while recording someone speaking using the iPhone SDK Core Audio? I'm guessing a band-pass filter that eliminates any frequencies above and below the human vocal range might work. I have no idea how I would implement band filters on audio in the SDK though. The optimum solution would be one that eliminates the noise from the stream before it is written to memory/disk.

Was it helpful?


You would need to implement an adaptive filter and a voice activity detector. In periods where there is no speech you would collect ambient noise and use this to drive the adaptive filter.


Have a search for "spectral subtraction" which a technique of analyzing a sample of what you're calling background noise, and then subtracting it from the the regular signal. Yes, you will always get some artifacts. It's a tradeoff.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow