Sorted Trie data structure

Question 1

If the prefixes of the words are repeatable, the trie tree will be most memory-efficient solution, unfortunately still O(N) pessimistically. You'll need to enrich the standard trie-tree class with additional information - words counters.

If you're looking for pessimistically optimal solution, multimap is a better solution:

O(1) insert time (not in trie tree if you have alphabet with many letters)
O(N) memory and running time

Still, you'll need to sort the words within the same occurrence count bucket, if there're many words with the same occurrence number, sorting becomes the dominant operation, and trie-tree approach is the same as multimap approach.

Question 2

the main property of trie is to merge the incoming data to save space, so if you want to use any property which is individual to any of the data unit, you can not benefit from the trie built in properties. So you can think if you want to save space, use trie, but to get the most frequent word, somehow you need to use some other algorithm (like traversing the trie once the data has been collected and prepare another table).

My idea is probably priority queue with the frequency of the word as the key can be a possible candidate

Question 3

You can use a ternary trie but the insertion time is expensive but you can skip the sort algorithm when you are just interested in the top 5 most occurrence words.