Best practice to create Shared library packed with big data

Question 1

I don't think there is one "right" answer to this.

Storing data in the file is fine, as long as the data isn't changing more often than you wish to release a new library - you need the amount of storage in some way or another anyway, so as long as the compiler doesn't do a terrible job with storing the data in the shared library, it's no worse than any other options, as far as I see it.

Having a secondary file is only useful if you expect the data to be changed more often than you wish to release a new shared library. It adds the extra complication of opening and reading the secondary file - the drawback is that you then also need to add checking that it's correct/present and code dealing with it not being there.

If you do have a secondary file, having SOME way to redefine the location would definitely be beneficial.

If the data is really large, you may want to use a compressed format. You can still store compressed data as data in your shared library, and use a compression library that can expand the data from that. Or you can use a library that reads from an external file...

In the end, it really comes down to:

How you are using the data - do you always need ALL of it, or do you just need some of it at times? If the latter, how do you know which bits?
How often the data changes.
If the data can be compressed or not, and if so by what method do you compress it?

I'm not sure there is any direct size limits on a shared library - if you need 1GB of data, then you need 1GB of space in memory either way, so it's not like you are saving memory [assuming you always need ALL the data and/or can't determine which parts you need].

Question 2

You can use a test file and save data in it as a comppressed binary format. then distribut the text file and the dll/lib together