Multiple read() operations on the same file

Question 1

while (read(fd, &tmp, sizeof(tmp)) == sizeof(tmp))
{
    ...got another one...
}

It is conventional to use FILE *fp; and int fd; (so the name for a file descriptor is fd and not fp).

The read() function returns the number of bytes it read. If there's no more data, it returns 0. For disk files and the like, it will return the requested number of bytes (except at the very end when there might not be that many bytes left to read) or 0 when there's no data left to read (or -1 if there's an error on the device rather than just no more data). For terminals (and sockets, and pipes), it will read as many bytes as are available rather than wait for the requested size (so each read could return a different size). The code shown only reads full-size structures and baulks if it gets a short read, EOF or an error.

The code by ensc in his answer covers all practical circumstances, but isn't the way I'd write the equivalent loop. I'd use:

struct foo tmp;
ssize_t nbytes;

while ((nbytes = read(fd, &tmp, sizeof(tmp))) != 0)
{
    if ((size_t)nbytes = sizeof(tmp))
        process(&tmp);
    else if (nbytes < 0 && errno == EINTR)
        continue;
    else if (nbytes > 0)
        err_syserr("Short read of %zu bytes when %zu expected on fd %d\n",
                   nbytes, sizeof(tmp), fd);
    else
        err_syserr("Read failure on fd %d\n", fd);
}

The two normal cases — a full length record is read OK and EOF is detected — are handled at the top of the loop; the esoteric cases are handled further down the loop. My err_syserr() function is printf()-like and reports the error given by its arguments, and also the error associated with errno if it is non-zero, and then exits. You can use any equivalent mechanism. I might or might not put the file descriptor number in the error message; it depends on who is going to see the errors. If I knew the file name, I'd certainly include that in the message in preference to the file descriptor.

I don't see any difficulty handling the nbytes == -1 && errno == EINTR case, contrary to comments by @ensc.

Question 2

read returns the number of bytes read. If you perform a read, and the return value is less than the number of bytes you requested, then you know it reached EOF during that read. If it exactly equals the requested number of bytes, then either the file has not reached EOF, or it did, and there are exactly 0 bytes left in the file, in which case the next call to read() will return 0.

while(read(fd, &tmp, sizeof(tmp)) > 0) {
    ...
}

Question 3

Ignoring error conditions, I think this is the basic idea:

while (read(fp, &tmp, sizeof(struct foo))==sizeof(struct foo))
    new_node(tmp);

Question 4

for (;;) {
    struct foo tmp;
    ssize_t l = read(fd, &tmp, sizeof tmp);

    if (l < 0 && errno == EINTR) {
        continue;
    } else if (l < 0) {
        perror("read()");
        abort();
    } else if (l == 0) {
        break;   /* eof condition */
    } else if ((size_t)(l) != sizeof tmp) {
        abort(); /* something odd happened */
    } else {
        handle(&tmp);
    }
}

EDIT:

In my projects I use a generic

bool read_all(int fd, void *dst_, size_t len, bool *is_err)
{
        unsigned char *dst = dst_;

        *is_err = false;

        while (len > 0) {
                ssize_t l = read(fd, dst, len);

                if (l > 0) {
                        dst += l;
                        len -= l;
                } else if (l == 0) {
                        com_err("read_all", 0, "read(): EOF");
                        *is_err = (void *)dst != dst_;
                        break;
                } else if (errno == EINTR) {
                        continue;
                } else {
                        com_err("read_all", errno, "read()");
                        *is_err = true;
                        break;
                }
        }

        return len == 0;
}

function. Because I prefer the approach to say how much elements are to be read, an EOF is handled as an error here. But it would be trivial to add another bool *err argument to the function which is set in the non-EOF error case. You can use above as

while (read_all(fd, &tmp, sizeof tmp, &is_err))
    new_node(&tmp);