Tag: deduplication
-
Deleting about 24,000 files takes about 5 minutes
$ wc -l zz*txt 24028 zz-discard-paths.txt 78968 zz-keep-paths.txt 102996 total
-
Deduplication, continued.
OK, so what started out as a bash script grew into a rather finicky Perl script. I used a bunch of parallel hashes, judging things by combinations duplicate names, identical file sizes and actually scoring the path name and taking the highest score. I ended up no using the file hashes, because I decided that…
-
Sorting through thousands of photos
Backups are great. Having terabytes of of space for them is now completely necessary. Filling up those terabytes is …frustrating. My photo collection is probably much like many photo enthusiast’s — well into the hundreds of thousands of pictures. But why is it so? I make thumbnails (that’s 2x pictures), I keep a low-res and…