Text Processing – Print Line After nth Occurrence of a Match

awkregular expressionsedtext processing

I am looking to display the line 4598 in the following file. Effectively I want to display the line AFTER the nth occurrence of a match. In this case, the line after the 3rd occurrence of <Car>. How do I go about this?

<Car>
10456
</Car>
<Car>
70192
</Car>
<Car>
4598
</Car>

Best Answer

awk -v n=3 '/<Car>/ && !--n {getline; print; exit}'

Or:

awk '/<Car>/ && ++n == 3 {getline; print; exit}'

To pass the search pattern as a variable:

var='<car>'
PATTERN="$var" awk -v n=3 '
  $0 ~ ENVIRON["PATTERN"] && ++n == 3 {getline; print; exit}'

Here using ENVIRON instead of -v as -v expands backslash-escape sequences and backslashes are often found in regular expressions (so would need to be doubled with -v).

GNU awk 4.2 or above lets you assign variables as strong typed regexps. As long as its POSIX mode is not enabled (for instance via the $POSIXLY_CORRECT environment variable, you can do:

# GNU awk 4.2 or above only, when not in POSIX mode
gawk -v n=3 -v pattern="@/$var/" '
  $0 ~ pattern && ++n == 3 {getline; print; exit}'

Related Solutions

Print several lines after nth occurence in bash

You can use grep and tail to achieve this:

$ n=3
$ k=2
$ grep -m "$n" -A "$k" 'Draft' input.txt | tail -n "$k"
important line 1  
important line 2  
$

The -m "$n" option to grep specifies to stop after the nth match, and -A "$k" tells grep to output k lines from after each match. We then pipe this to tail -b "$k" to output only those k lines.

Freebsd – BSD sed: Replace only the Nth occurrence of a pattern

With any POSIX sed:

$ sed -e'/hello/{' -e:1 -e'$!N;s/hello/world/2;t2' -eb1 -e\} -e:2 -en\;b2 <file
hello world hello
hello hello hello

After the first match /hello/, we run into a loop.
Inside loop :1, we read each Next line to the pattern space, doing substitute command for 2nd occurrence only. We test if the substitution success or not. If yes, we run into loop :2, else repeat the loop with b1.
Inside loop :2, we just print remain lines till the end of file.

Note that this approach will store all things between two hello in pattern space. It will be a problem with huge files, when the first and the second are far from each other.

Best Answer

Related Solutions

Print several lines after nth occurence in bash

Freebsd – BSD sed: Replace only the Nth occurrence of a pattern

Related Question