Cross posted from the Perl forum in case anyone has any ideas here.
I have a tab delimited file, 16 fields, 1,000,000 records.
I want to scan the file and report all lines which have control characters in them. When I say control characters, I basically mean all chars except [a-z][A-Z][0-9] and all the other punctuation characters available via a standard keyboard, i.e. (`¬!"£$%^&*-=_+[]{};'#~,./<>?\|) etc. Probably want to ignore carriage returns also.
Hope this makes sense. I'm happy to use any tool available but I'm best with awk, sed or perl. I figure all is needs is the right regular expression.
Any advice much appreciated.
Greg.
I have a tab delimited file, 16 fields, 1,000,000 records.
I want to scan the file and report all lines which have control characters in them. When I say control characters, I basically mean all chars except [a-z][A-Z][0-9] and all the other punctuation characters available via a standard keyboard, i.e. (`¬!"£$%^&*-=_+[]{};'#~,./<>?\|) etc. Probably want to ignore carriage returns also.
Hope this makes sense. I'm happy to use any tool available but I'm best with awk, sed or perl. I figure all is needs is the right regular expression.
Any advice much appreciated.
Greg.