You need to create a custom MediaStreamSource implementation. This way you'll have complete control on what data is received from the network, and what data is fed to the OS-provided media codecs.
Unfortunately this task is hard. Here's the document outlining some technical challenges I've encountered (and solved): http://const.me/SkyFM/technical.aspx