Split column after nth character

awksed

I'm trying to split my second column in the file below after every 4 characters.

file.txt

>1A    THISISATEST
>1B    THATISATEST

desired output:

>1A    THIS    ISATEST
>1B    THAT    ISATEST

After searching and attempting to modify, I tried to use this sed command: sed 's/(.{4})(.{7}).*/\2 \3/' file.txt. However, I can't seem to get it to work. Am I missing something? However, if you have an awk suggestion, that would also be helpful. Also, please explain your suggestions. I'm in the learning process of awk and sed.

Best Answer

Here is a solution with awk. It separates first four characters and rest of the 2nd column into two variables and print them.

]$ awk '{s=substr($2,1,4)}{g=substr($2,5,length($2))}{print $1,s,g}' file.txt
1A THIS ISATEST
1B THAT ISATEST

Related Solutions

Remove Lines with Field Value Less Than or Equal to 3 Using sed or awk

You almost got it.

 awk '(NR>1) && ($8 > 2 ) ' foo > bar

where

NR is number of record (that is number of line)
$8 is eight field
&& is logical and
foo is the original file, unchanged
bar resulting file
implicit default action is to print the current input line

Note that header is striped from foo to bar, to keep it

 awk '(NR==1) || ($8 > 2 ) ' foo > bar

where

|| is logical or
input line is printed if NR==1 or if $8 > 2

Update #1

To specify a range

( ($8 >= -4) && ( $8 <= 4 ) ) 8th field from -4 to 4
(NR == 1 ) || ( ($8 >= -4) && ( $8 <= 4 ) ) same, including header

Unix: replace one entire column in one file with a single value from another file

First extract the field you want from File 2:

value="$(awk -F, 'NR==1{print $3;exit}' file2)"

Then plug it into the replacement code for File 1:

awk '{$11 = v} 1' v="$value" file1

Best Answer

Related Solutions

Remove Lines with Field Value Less Than or Equal to 3 Using sed or awk

Update #1

Unix: replace one entire column in one file with a single value from another file

Related Question