Skip to content
This repository has been archived by the owner on Jun 13, 2024. It is now read-only.

MD5 checksum computing taking an unexpectedly long time #15

Open
colemickens opened this issue Apr 1, 2016 · 1 comment
Open

MD5 checksum computing taking an unexpectedly long time #15

colemickens opened this issue Apr 1, 2016 · 1 comment

Comments

@colemickens
Copy link
Contributor

I have created a VHD that is ~500GB but it's less than 1GB on disk currently.

$ du -hs disk.vhd
943M disk.vhd

$ azure-vhd-utils-for-go inspect footer --path disk.vhd
[...]
PhysicalSize      : 536879692800 bytes
VirtualSize       : 536879692800 bytes
[...]

This is what I'm seeing right now as it's computing the MD5 Checksum...

Computing MD5 Checksum..
Completed:  10% RemainingTime: 00h:14m:34s Throughput: 4197 MB/sec
536879MB / 4197MB/s = ~127 s

I'm not really sure what's going on with this. Is the throughput value wrong? Is my math wrong?

@anuchandy
Copy link
Member

Looks like the throughput calculation is wrong.

The MD5 calculation is usually slow as we can have only one goroutine to read the vhd file. Also we cannot write to MD5 writer ("crypto/md5") from multiple routine.

In case of upload, we use multiple routines for uploading the pages though there is only one routine reading the vhd file.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants