Why Can Spaces Between Options and Parameters Be Omitted?

options

For example:

xargs -n 1

is the same as

xargs -n1

But if you look at the man page, the option is listed as -n max-args, which means the space is supposed to be preserved. There is nothing about the abbreviated form -nmax-args.

This also happens with many other Linux utilities.

What is this called in Linux? Do all utilities support the abbreviated form (but never document it in the man page)?

Best Answer

When you write the command line parsing bit of your code, you specify what options take arguments and which ones do not. For example, in a shell script accepting an -h option (for help for example) and an -a option that should take an argument, you do

opt_h=0     # default value
opt_a=""

while getopts 'a:h' opt; do
    case $opt in
        h)  opt_h=1 ;;
        a)  opt_a="$OPTARG" ;;
    esac
done

echo "h: $opt_h"
echo "a: $opt_a"

The a:h bit says "I'm expecting to parse two options, -a and -h, and -a should take an argument" (it's the : after a that tells the parser that -a takes a argument).

Therefore, there is never any ambiguity in where an option ends, where its value starts, and where another one starts after that.

Running it:

$ bash test.sh -h -a hello
h: 1
a: hello

$ bash test.sh -h -ahello
h: 1
a: hello

$ bash test.sh -hahello
h: 1
a: hello

This is why you most of the time shouldn't write your own command line parser to parse options.

There is only one case in this example that is tricky. The parsing usually stops at the first non-option, so when you have stuff on the command line that looks like options:

$ bash test.sh -a hello -world
test.sh: illegal option -- w
test.sh: illegal option -- o
test.sh: illegal option -- r
test.sh: illegal option -- l
test.sh: illegal option -- d
h: 0
a: hello

The following solves that:

$ bash test.sh -a hello -- -world
h: 0
a: hello

The -- signals an end of command line options, and the -world bit is left for the program to do whatever it wants with (it's in one of the positional variables).

That is, by the way, how you remove a file that has a dash in the start of its file name with rm.

EDIT:

Utilities written in C call getopt() (declared in unistd.h) which works pretty much the same way. In fact, for all we know, the bash function getopts may be implemented using a call to the C library function getopt(). Perl, Python and other languages have similar command line parsing libraries, and it is most likely that they perform their parsing in similar ways.

Some of these getopt and getopt-like library routines also handle "long" options. These are usually preceded by double-dash (--), and long options that takes arguments often does so after an equal sign, for example the --block-size=SIZE option of [some implementations of] the du utility (which also allows for -B SIZE to specify the same thing).

The reason manuals are often written to show a space in between the short options and their arguments is probably for readability.

EDIT: Really old tools, such as the dd and tar utilities, have options without dashes in front of them. This is purely for historical reasons and for maintaining compatibility with software that relies on them to work in exactly that way. The tar utility has gained the ability to take options with dashes in more recent times. The BSD manual for tar calls the old-style options for "bundled flags".

Related Solutions

Linux – “userdel –root?” supposed to do

Here's the version from my English manpage:

 -R, --root CHROOT_DIR
     Apply changes in the CHROOT_DIR directory and use the
     configuration files from the CHROOT_DIR directory.

In other words, instead of editing /etc/passwd and friends, you're editing CHROOT_DIR/etc/passwd.

For example, you might boot a live CD, mount the hard drive as /mnt, and then use -R /mnt to edit its users.

Shell – Where is the `–` (double dash) argument documented

This is a POSIX requirement for all utilities, see POSIX chapter 12.02, Guideline 10 for more information:

The first -- argument that is not an option-argument should be accepted as a delimiter indicating the end of options. Any following arguments should be treated as operands, even if they begin with the '-' character.

POSIX recommends all utilities to follow these guidelines.

There are a few exceptions like echo (read at OPTIONS). And special builtins that do not follow the guidelines (like break, dot, exec, etc.) :

Some of the special built-ins are described as conforming to XBD Utility Syntax Guidelines. For those that are not, the requirement in Utility Description Defaults that "--" be recognized as a first argument to be discarded does not apply and a conforming application shall not use that argument.

The intent is to document all commands that do not follow the guidelines in their POSIX man page, from POSIX chapter 12.02 third paragraph:

Some of the standard utilities do not conform to all of these guidelines; in those cases, the OPTIONS sections describe the deviations.

As the cat POSIX man page documents no deviations in the OPTIONS section, it is expected that it accept -- as a valid argument.

There still may be (faulty) implementations that fail to follow the guideline.

In particular, most GNU core utilities follow this guideline

Best Answer

Related Solutions

Linux – “userdel –root?” supposed to do

Shell – Where is the `–` (double dash) argument documented

Related Question