File System Thots

A brief experiment in calculating a histogram of file sizes:

$ find -type f -size -128000c -printf "%k\n" \
| sort -n \
| perl -ne 'BEGIN{ %h=(); } 
{chomp $_; $h{$_}++;} 
END { foreach my $k (sort {$h{$a} <=> $h{$b}} keys(%h)) { 
      print "$k $h{$k}\n"; }}'
137 3
145 3
121 3
129 5
113 7
25 10
105 14
97 21
89 29
81 35
73 38
65 60
57 92
49 165
1 221
41 317
33 781
9 4220

When NOT to Use a SAN?

The vast number of filesystems, distributed file systems, and network replication schemes available for Linux all come with learning curves and caveats. When does one actually want to avoid merely picking out an average SAN solution and go with a Linux oriented distributed storage solution like Gluster or NFS on DRBD?