Improving on Gear Hashing with FastCDC

from blog Joshleeb, | ↗ original
In the last post we took a look at Gear Hashing for Content-Defined Chunking (CDC), and outlined some problems we noticed when analyzing the distribution of generated chunks, namely the lack of uniformity in the chunk sizes, and that a large portion of chunks were much smaller than our desired chunk size. In this post, we’ll go through the...