Using md5sum for speeding up dd disk imaging, sample script: Good idea?

Question

Point by point answer:

With running this, would I miss some part of the disk?
- no.
Do you see some flaws?
- While units are differents. this implie double read at source and full read before write at destination. This will mostly improve backup time.
- there is a little probability of having MD5 matching while differences exists. This probability is reducted by using SHA1 or MD256 or other harder checksum algo. but this implie more resource on both ends. (See Birthday problem on wikipedia)
Could I win some time compared to the 500GB read/write?
- In case both units are already same, yes, because reading is generaly quicker than writting. (depending on processor, for checksum computation: this could be significant on very poor processors)
Obviously I potentially write less to the target disk. Will I improve the lifetime of that disk?
- In this case, yes, but if you write only diff, this will go a lot faster and improve really you disks lifetime.
- When disk are different, you re-write whole disk, this is not efficient!
I was thinking of leaving count to 1, and increasing the block size.Is this good idea/bad idea?
- I find this globally a bad idea. Why re-inventing wheel?
Would this same script work with an image file as output?
- Yes.

Functionnality answer.

For jobs like this, you may use rsync! With this tools you may

Compress data during transfer
copy over network
tunneling with SSH (or not)
transfer (an write) only modified blocks

Using bash, `dd` and `md5sum`

There is a kind of command I run sometime:

ssh $USER@$SOURCE "dd if=$PATH/$SRCDEV |tee >(sha1sum >/dev/stderr);sleep 1" |
    tee >(sha1sum >/dev/tty) | dd of=$LOCALPATH/$LOCALDEV

This will do a full read on souce host, than a sha1sum before sending to localhost (destination), than a sha1sum to ensure transfer before writting to local device.

This may render something like:

2998920+0 records in
2998920+0 records out
1535447040 bytes (1.4gB) copied, 81.42039 s, 18.3 MB/s
d61c645ab2c561eb10eb31f12fbd6a7e6f42bf11  -
d61c645ab2c561eb10eb31f12fbd6a7e6f42bf11  -
2998920+0 records in
2998920+0 records out
1535447040 bytes (1.4gB) copied, 81.42039 s, 18.3 MB/s

Using md5sum for speeding up dd disk imaging, sample script: Good idea?

Point by point answer:

Functionnality answer.

Using bash, dd and md5sum

Using bash, `dd` and `md5sum`