Bash – Ignoring characters at start and end of words

bashshell

I'm trying to pass a flatdb with some names through a script in order to ignore certain things fields and check others. Currently this is as far as I've gotten.

What would I use in my then statement in order to ignore the comment and space lines?

EDIT: Sample of file being read

Bob,Brown,Smith,39
#Alex,Jay,Jones,83

Justin,Michael,White,18
 James,Paul,Weis,54

EDIT: Then I am separating each name

IFS_SAVE=${IFS}
IFS=,

while read value1 value2 value3 value4
do
 echo "First name ${value1}"
 echo "Middle name ${value2}"
 echo "Last name ${value3}"
 echo "Age ${value4}"
done < ${1}

Then I'm trying to ignore blank lines in the file

while read value1 value2 value3 value4
do
if [ -z ${value1} ]
 then
   echo "blank line exists"
 fi
done < ${1}

Then I'm trying ignore commented lines in file

while read value1 value2 value3 value4
do
if [ "${value1:0:1}" = "#" ]
 then
    echo "comment exists"
 fi
done < ${1}

From here down I have been unable to get this to actually work

I'm trying to check if fields start or end with space

while read value1 value2 value3 value4
do
if [[ ${value1} = *[[:space:]]* ]]
then
    echo "space exist"
fi
done < ${1}

EDIT: Ideal output

First name Bob
Middle name Brown
Last name Smith
Age 39
First name Justin
Middle name Michael
Last name White
Age 39
 James,Paul,Wise,54 space exist

Best Answer

Use a text processing tool to process text, not hundreds of invocations of inappropriate tools such as read or echo in a loop.

Here, the obvious one to process tabular data is awk:

awk -F, -v OFS='\n' '
  ! /^[#[:space:]]/ {
    print "First name "  $1, \
          "Middle name " $2, \
          "Last name "   $3, \
          "Age "         $4
  }' < you-file

The ! /^[#[:space:]]/ condition matches on lines that don't (!) start (^) with # or whitespace character. You could also do /^[^#[:space:]]/ to match on lines that start with a character other than # or whitespace which in effect would also skip empty lines or add && NF == 4 to skip the lines that don't have exactly 4 fields.

Or to pass the lines with leading or trailing whitespace along untouched as in your expected output:

awk -F, -v OFS='\n' '
  /^[[:space:]]/ || /[[:space:]]$/ {print; next}

  /^[^#]/ && NF == 4 {
    print "First name "  $1, \
          "Middle name " $2, \
          "Last name "   $3, \
          "Age "         $4
  }' < you-file

(Here, we're assuming a POSIX compliant awk; with mawk, replace [:space:] with the hardcoded list of whitespace characters you expect to find at the start as mawk doesn't support POSIX character classes).

Also beware that the carriage return character (aka CR / \r / ^M) is considered a [:space:] character. If the input file was a text file coming from Microsoft OSes, where the line delimiter is CRLF instead of just LF in Unix, then each line would end in a whitespace character. You'd want to run the file through dos2unix first to convert it to Unix format.

Related Solutions

Shell – Why is echo ignoring the quote characters

Your shell is interpreting the quotes, both ' and ", before they even get to echo. I generally just put double quotes around my argument to echo even if they're unnecessary; for example:

$ echo "Hello world"
Hello world

So in your first example, if you want to include literal quote marks in your output, they either need to be escaped:

$ echo \'Hello world\'
'Hello world'

Or they need to be used within a quoted argument already (but it can't be the same kind of quote, or you'll need to escape it anyway):

$ echo "'Hello world'"
'Hello world'

$ echo '"Hello world"'
"Hello world"

In your second example, you're running a command substitution in the middle of the string:

grep  $ARG  /var/tmp/setfile  | awk {print $2}

Things that start with $ are also handled specially by the shell -- it treats them as variables and replaces them with their values. Since most likely neither of those variables are set in your shell, it actually just runs

grep /var/tmp/setfile | awk {print}

Since grep only sees one argument, it assumes that argument is the pattern you're searching for, and that the place it should read data from is stdin, so it blocks waiting for input. That's why your second command appears to just hang.

This won't happen if you single-quote the argument (which is why your first example nearly worked), so this is one way to get the output you want:

echo \'' echo PARAM=`  grep  $ARG  /var/tmp/setfile  | awk '{print $2}' `    '\'

You can also double-quote it, but then you'll need to escape the $s so the shell doesn't resolve them as variables, and the backticks so the shell doesn't run the command substitution right away:

echo "' echo PARAM=\`  grep  \$ARG  /var/tmp/setfile  | awk '{print \$2}' \`    '"

Shell – Replace between patterns that start and end in different lines

With GNU sed:

sed '/binaries=(/,/)/{
       //!d
       /)/e uname
     }'

Replace uname with your command.

Best Answer

Related Solutions

Shell – Why is echo ignoring the quote characters

Shell – Replace between patterns that start and end in different lines

Related Question