Parallel Compacting Collector algorithm

Question 1

This is just a general description of an algorithm. Such descriptions can be of different detail. In this case, it gives you most details, but still leaves some choices for the implementer.

Regarding your questions:

So no compaction happened on the "summary phase"? Was the previous phase's purpose only to find all free spaces? - Yes, that is correct. The summary phase gathers indexing data and basically determines everything necessary, so that the compaction phase can then perform the copying. They don't tell how they implement compaction, but the default way is simply placing every live object right next to the previous object. Basically, all empty space is removed and after the compaction step has completed, you have one contiguous chunk of memory, containing all live objects. I see your confusion with the fourth part, but note that it is written in future tense: 'will be compacted' - So not during summary, but later.
Does it mean [...] compacting of this region in not worth the space that could be recovered from such a region? Yes, that is right. You essentially lose some space, but it is very common to trade of memory for execution speed. The exact density threshold is up to the implementation, but I would ballpark the used-to-total-memory ratio threshold at around 70-90%.

If you want to know all the dirty details, have a look at an open source VM implementations, as suggested in the comments.

Question 2

If you really need a detailed understanding of hoe the collectors work you can read the code. The reason you don't find many detailed pages on this is that the collectors are designed to take care of memory management for you are if you start worrying about the details you have gone down the wrong road.

The best solution is to use a memory profiler and reduce your allocation rate. No amount of tuning or messing with your command line options (unless you have a mis-configured GC) will compare to reducing this allocation rate.

However to respond to your questions.

parallel mark-sweep-compact

There is no such this. The is the Parallel Collector which compacts and the Concurrent Mark Sweep which doesn't. There is also a G1 collector which is not generational in the same way. i.e. it collects both young and old objects.

Can't believe that this information is not eligible for the users.

By design, developer don't need to know this much detail. Nor is it a good idea to over tune your application because this makes it very brittle to changes in the application or JVM.

What I would like to know is where to get a full description/information about how different garbage collectors

Instead of saying, I would like to know everything there is to know (there is 500+ options alone) without having to read the code, you should try to solve a specific problem, and ask a specific question.

Every region will be compacted separately? I guess no. So maybe some kind of shifting will be here?

Only the tenured space is compacted. The young regions are copied repeatedly and never need compacting.

So no compaction happened on the "summary phase"? Was the previous phase's purpose only to find all free spaces?

The compaction phase does a best effort copy to one end of the region without defragmenting it completely. This leave one end with some objects (mostly large ones I imagine) and the other end very dense.