The typical deployment I see is one or more enforcement points (PEP) talking to a load balancer that sits in front of multiple PDPs that are all equally configured.
That's true of any version of XACML.
PDPs rarely communicate together though you could imagine you'd have a PDP talking to another via a PIP connector.
--- EDIT --- Here's an architecture diagram