I have systems that generate log-files daily and it's full of useless data. I wrote a python script that weeds out what I need. Based on user input, it goes through files, examines the file line by line (readline()) and if it matches certain criteria, the given line is then written into a separate text file.
However, there appears to be a certain type of character that crashes the program.
The error message I get is
UnicodeDecodeError: 'utf16' codec can't decode bytes in position 70-71: unexpected end of data
When I open the file with Notepad ++, I see the attached.
When I use the readline() with Try:, then the program doesn't crash, it actually bypasses some of these errors, but most of the time it hangs.
Currently the only way I can use this program is to run it, see where it crashes, manually remove the pictured characters, and run the program again, and when it crashes, manually... etc. etc, etc. Very time-consuming...
Any idea?
However, there appears to be a certain type of character that crashes the program.
The error message I get is
UnicodeDecodeError: 'utf16' codec can't decode bytes in position 70-71: unexpected end of data
When I open the file with Notepad ++, I see the attached.
When I use the readline() with Try:, then the program doesn't crash, it actually bypasses some of these errors, but most of the time it hangs.
Currently the only way I can use this program is to run it, see where it crashes, manually remove the pictured characters, and run the program again, and when it crashes, manually... etc. etc, etc. Very time-consuming...
Any idea?