Not sure if this will work in awk (works in vi, etc.) but if you prefix the control character you're looking for with the ctrl-v sequence, that may work. In some cases the presence of ctrl-v tells the command that the next character is a control character. Have a go anyway.
Maybe you use some older version of awk and it doesn't recognise special characters. But, gawk does. I use GNU version 3.0.3 of awk (GNU, thanks again!) and it works (awk '$0 ~ /\026/ { print "Here is!" }' inputfile).
Hi, I am sending along a utility I wrote that should show one way to check for control characters. The key is in the 'ordinal' array in the BEGIN section. (I think I based the idea on Arnold Robbins' book 'Effective AWK Programming' -- see section 15.5 Translating Between Characters and Numbers).
Purpose:
This awk utility (visib.awk) can be used to make invisible characters (except spaces) visible. (It looks at characters in the ascii range 1 to 31 and 127 to 255). This is particularly useful for situations where a file contains hidden control characters, such as newlines and carriage returns. It can also be used to replace all instances of selected characters in the ranges mentioned above with another character, or null.
There are a few syntax options to use this script:
1)
To view invisible control characters, there are 2 approaches:
1 a)
% visib.awk filename
This syntax will print any hidden characters as numbers enclosed in angle brackets.
Example: visib.awk <myfile.txt>
Eg: <13> represents a carriage return (ie: ascii character #13).
1 b)
% visib.awk vischar=<display character(s)> <filename>
This syntax will print any hidden characters as the display character(s) that you provide.
Example: visib.awk vischar=@ myfile.txt
In this example, invisible characters will be displayed as an at sign ('@').
Note: For some display characters, such as the space (" " and pound sign ("#", you will need to enclose the argument in single quotes and add a backslash before the character.
2)
To replace all instances of selected characters with another character, there are 3 choices of syntax:
2 a)
% visib.awk oldchar=<old_char_num> newchar=new_char_num filename
This syntax will replace one selected control character (represented by old_char_num) with another selected control character (represented by new_char_num).
Notes:
Both arguments are numerical values representing ascii values.
You will want to redirect the output of this command to another file, which you can then use to overwrite the original, if that is what you want.
Example: To replace a carriage return (ascii 13) with a tab (ascii 9) you could do:
2 b)
% visib.awk oldchar=<old_char_num> newchar=space <filename>
This syntax is used to replace a selected control character (represented by old_char_num) with a space.
Notes:
The first argument (old_char_num) must be a numeric value representing an ascii value.
You will want to redirect the output of this command to another file, which you can then use to overwrite the original, if that is what you want.
2 c)
% visib.awk oldchar=<old_char_num> newchar=null <filename>
This syntax is used to replace a selected control character (represented by old_char_num) with a null.
Notes:
The first argument (old_char_num) must be a numeric value representing an ascii value.
You will want to redirect the output of this command to another file, which you can then use to overwrite the original, if that is what you want.
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.