Bash – How to store the human-friendly size of a file in a variable

bashfilesshell-scriptsize;

Basically what I want to do is to make a script that will output all the files and their sizes in the directory which have size more than a threshold value (2.2 GB in my case). I tried using the stat command as below

a=$(stat -c '%s' example.txt)

but this will store the file size in bytes. What I want is to store and display the size in human readable format(MB,GB). I was also thinking to store the output of ls -lah and then trim the result so as to store only name and size but that seemed a tedious task. Is there other anyway to do this apart from storing the result in bytes and then doing arithmetic operations on it.

Best Answer

Since you're already using GNU tools, see numfmt from GNU coreutils:

$ stat -c %s file
310776
$ stat -c '%s' file | numfmt --to=si
311K
$ stat -c '%s' file | numfmt --to=iec
304K
$ stat -c '%s' file | numfmt --to=iec-i
304Ki
$ stat -c '%s' file | numfmt --to=si --suffix=B
311KB

With ksh93:

$ size=$(stat -c %s file)
$ printf "%#d %#i\n" "$size" "$size"
311k 304Ki

Or if built with the ls builtin (which you can also get as a standalone utility in the ast-open package):

$ type ls
ls is a shell builtin version of /opt/ast/bin/ls
$ ls -Z '%#(size)d %#(size)i' file
311k 304Ki

Related Solutions

Find human-readable files

Yes, you can use find to look for non-executable files of the right size and then use file to check for ASCII. Something like:

find . -type f -size 1033c ! -executable -exec file {} + | grep ASCII

The question, however, isn't as simple as it sounds. 'Human readable' is a horribly vague term. Presumably, you mean text. OK, but what kind of text? Latin character ASCII only? Full Unicode? For example, consider these three files:

$ cat file1
abcde
$ cat file2
αβγδε
$ cat file3
abcde
αβγδε
$ cat file4
#!/bin/sh
echo foo

These are all text and human readable. Now, let's see what file makes of them:

$ file *
file1: ASCII text
file2: UTF-8 Unicode text
file3: UTF-8 Unicode text
file4: POSIX shell script, ASCII text executable

So, the find command above will only find file1 (for the sake of this example, let's imagine those files had 1033 characters). You could expand the find to look for the string text:

find . -type f -size 1033c ! -executable -exec file {} + | grep -w text

With the -w, grep will only print lines where text is found as a stand-alone word. That should be pretty close to what you want, but I can't guarantee that there is no other file type whose description might also include the string text.

Tar – Can `tar –list -v` List File Size in Human-Readable Format?

There's no built-in tar option, but you can filter its output. For example, using humanize:

#!/usr/bin/env python

import fileinput
import humanize

for line in fileinput.input():
    (perm, owner, size, date, time, filename) = tuple(line.split())
    print '{0} {1} {2:>9} {3} {4} {5}'.format(perm, owner, humanize.naturalsize(size, gnu=True), date, time, filename)

Save this as e.g. humantvf, then

tar tvf ... | ./humantvf

Best Answer

Related Solutions

Find human-readable files

Tar – Can `tar –list -v` List File Size in Human-Readable Format?

Related Question