Binary files printing and desired precision

https://stackoverflow.com/questions/2581662

24-09-2019
|

Question

I'm printing a variable say z1 which is a 1-D array containing floating point numbers to a text file so that I can import into Matlab or GNUPlot for plotting. I've heard that binary files (.dat) are smaller than .txt files. The definition that I currently use for printing to a .txt file is:

void create_out_file(const char *file_name, const long double *z1, size_t z_size){
FILE *out;
size_t i;
 if((out = _fsopen(file_name, "w+", _SH_DENYWR)) == NULL){
 fprintf(stderr, "***> Open error on output file %s", file_name);
 exit(-1);
 }
for(i = 0; i < z_size; i++)
fprintf(out, "%.16Le\n", z1[i]);
fclose(out);
}

I have three questions:

Are binary files really more compact than text files?;
If yes, I would like to know how to modify the above code so that I can print the values of the array z1 to a binary file. I've read that fprintf has to be replaced with fwrite. My output file say dodo.dat should contain the values of array z1 with one floating number per line.
I have %.16Le up in my code but I think that %.15Le is right as I have 15 precision digits with long double. I have put a dot (.) in the width position as I believe that this allows expansion to an arbitrary field to hold the desired number. Am I right? As an example with %.16Le, I can have an output like 1.0047914240730432e-002 which gives me 16 precision digits and the width of the field has the right width to display the number correctly. Is placing a dot (.) in the width position instead of a width value a good practice?

Thanks a lot...

UPDATE Is changing to:

for(i = 0; i < z_size; i++)
fwrite(&z1, sizeof(long double), 1, out);

ok in addition to the change to "wb+" ? I can't read the binary file in Matlab.

Solution

yes, binary files are more compact, but you lose portability and there are various other potential problems too, so unless your data files are problematically huge, or slow to export/import, it's a good idea to stick with text if you can (you can always compress them for storage, e.g. with zip)
open you file with "wb" instead of "w" and use fwrite() - you no longer have "lines" in your file - it will just be a stream of (binary) floating point values
you may be getting confused between double and long double - a long double can be up to 16 bytes in size and have a precision of up to around 32 digits (however this is implementation-dependent - long double can commonly be 10, 12 or 16 bytes). A double is usually 8 bytes and has a precision of around 16 digits.

MATLAB may not be able to cope with long double (as it is not well standardized) so you probably just want to write doubles to your data file, e.g.

for (i = 0; i < z_size; i++)
{
    double z = (double)z1[i];
    fwrite(&z, sizeof(double), 1, out);
}

Licensed under: CC-BY-SA with attribution

Not affiliated with StackOverflow