Linux sort and cut multiple columns

cutsort

I have the following file named "info":

White:73:Mars:1543:Manuel
Green:17:Jupiter:1968:Sebastian
Blue:24:Venus:1970:Anna
Red:35:Neptune:1122:Javier
Yellow:135:Earth:1234:Raymond

I need to use cut and sort, to show only the columns with planest and names, sorted. This means I have to be left with:

Earth:Anna
Jupiter:Javier
Mars:Manuel
Neptune:Raymond
Venus:Sebastian

I tried using
cut -d: -f3,5 info | sort -t: -k1,1 -k2,2
but it only sorted the first column and not the second.

I also tried
cut -d: -f3,5 info | sort -t: -k1,1 -k2,2 | sort -t: -k2,2
but this only sorted the second column.

Any and all help is appreciated

Best Answer

Sorting columns Individually:

paste -d: <(cut -d: -f3 info | sort) <(cut -d: -f5 info | sort)
Earth:Anna
Jupiter:Javier
Mars:Manuel
Neptune:Raymond
Venus:Sebastian

Related Solutions

Unix sort by multiple columns

No, -k1,2 says to sort on the portion of the line that starts at the beginning of the first field and ends at the end of the second field.

To sort on the first field and then on the second, it's:

sort -k1,1 -k2,2

Choose columns with sort and cut in a csv with a comma delimiter ‘,’ ignoring data on quotes with comma “text,text”

CSV is a structured document format. As such, simple text manipulation tools like cut (or sort, sed, or awk, unless the data is simple) are inadequate for processing CSV files safely and conveniently (because fields may contain embedded delimiters and newlines). Instead, it would be best if you were using a CSV-aware processing tool such as Miller (mlr).

The following Miller command parses the file as a header-less CSV file, sorting it numerically ascending by its 12th field:

mlr --csv -N sort -n 12 file

If you have headers in your CSV data, drop the -N option and use the header name in place of 12, e.g.,

mlr --cvs sort -n pvalue file

To extract column 12,

mlr --csv -N cut -f 12 file

To sort and cut, and also only get the 10 first results,

mlr --csv -N sort -n 12 then cut -f 12 then head -n 10 file

Again, drop the -N and use the field names if you have headers in the input.

With the csvkit toolkit, you could use csvsort to get the same result like so:

csvsort -H -c 12 file | tail -n +2

(the tail command removes the headers that csvsort generates), or, with headers in the input,

csvsort -c pvalue file

Extracting individual fields with csvcut:

csvcut -H -c 12 file

Combined with csvsort:

csvsort -H -c 12 file | csvcut -c 12 | head -n +2

Or, with headers,

csvsort -c pvalue file | csvcut -c pvalue

There is no csvhead command, so limiting the resutl to 10 records will have to be doen some other way, possibly through mlr --csv head -n 10.

Best Answer

Related Solutions

Unix sort by multiple columns

Choose columns with sort and cut in a csv with a comma delimiter ‘,’ ignoring data on quotes with comma “text,text”

Related Question