How to read 64bit int file with markers the fastest?

Question 1

I would probably do something along the lines of the following:

move currentline outside the loop - prevents reallocs every time around the loop
use a switch statement on the first char of currentline so we jump instead of multiple if/else statements
use std::stoull instead of a stringstream to convert currentline to uint64_t

Here's the function (not tested to see if it compiles, just wrote it up)

void read_data_from_file(const string &fname, vector<t_featfam> &data)
{
    ifstream f;
    f.open(fname, ios::in);
    string currentline;
    while (!f.eof())
    {
        getline(f, currentline);
        switch (currentline.c_str()[0])
        {
            case '\0':
            case '#':
                break;
            case '-':
                data.push_back(t_featfam());
                break;
            case '!':
                data.back().second = data.back().first.size();
                break;
            default:
                data.back().first.push_back(std::stoull(currentline));
                break;
        }
    }
}

Question 2

Most of the time is lost in the memory allocations. You have one allocation when you call getline() and another one when you construct the istringstream. Each allocation costs roughly 250 cycles on my system. So, you can save roughly 500 cycles per line that you read.

You can eliminate the allocations altogether if you use mmap() to map the entire file into your address space. Once you have everything in a single large array of char, you can relatively easily parse it without any need to copy lines out of that large array.

Question 3

How long is it taking to process the entire file? If speed is really an issue here, and since you should have plenty of memory if you're running this program on a PC, you could use the equivalent of seek to end, tell, seek to beginning to get the size of the entire log file, allocate that much memory, then read in the entire file into a a large buffer. Then use memchr () to scan for every occurrence of "-", to determine the number of pairs, optionally create an array of pointers (pre-allocated based on maximum number of pairs for a given file size) then do a one time resize of the vector of pairs (or a one time new if using a pointer to vector to pairs). Then parse the buffer again to fill in the pairs via index or iterator instead of push_back(). Although this method scans the file buffer twice, it may compensate by avoiding internal dynamic resizes that would occurs with a bunch of push_backs().

Another option would be to put the pair count at the start or end of the log file, which would eliminate the first scan used to get number of pairs. If you have an idea of the maximum file size, you could just allocate sufficient memory to handle the largest expected log file, which would eliminate having to determine the file size before allocation.