[LUNI] File duplicate finder contest!

Gene Jannece gene at sevatech.com
Mon Dec 18 12:18:32 CST 2006


It looks like there are just under 500,000 files
Martin Maney wrote:
> On Sat, Dec 16, 2006 at 07:22:33PM -0600, Gene Jannece wrote:
>   
> Probably more important to know how *many* files there are in the data
> set.  A ballpark figure for average path length and directory size
> (count of entries, not bytes), too, since you imply that some tools
> have run out of memory processing this collection, and these are the
> parameters that will affect the space used during the early stages of
> processing (or know that you can't do the first pass the easy way, and
>   
revise it to accomodate that).



More information about the luni mailing list