There is nothing doable from the OS side with a T5220 (UltraSPARC T2 based). The only way is to work at the userland side and better parallelize your workload.
Starting with the UltraSPARC T4 series, the critical thread feature allows to automatically assign a whole chip to a single thread and then boost performance in your use case.