If you study the history of a famous camera company Point Grey you will find a triclops - three stereo cameras set up horizontally and vertically in a triangular pattern that supposedly allow you to get better matching. This is because horizontally shifted cameras prefer vertical gradient and vertically shifted ones prefer horizontal gradient for matching.
Happened to be a waste of hardware since usually both directions of gradients are present in a correlation window. Another early mistake was to use color for stereo matching (looks like an attractive option but it adds more noise and variability than help). Point grey now has 'digiclops' with two cameras that are grey, not color.
If you see three cameras they are typically lined up horizontally to provide a choice for a wide and narrow baseline. A narrow one is good for close objects while the wide one would have a longer 'dead zone' (i.e. absence of stereo overlap) but distinguishes depth at longer distances.
Stereo cameras never pick up for another reason - absence of texture creates huge holes in disparity maps. Kinect happened to a winner here because it projects its own texture (though it cannot do this at sunlight).