Edward1984
Programmer
Hello everyone.
Can you advise me please on this project I'm trying to do.
I've got about 45Gigs of text logs. I've done some search and extracted regular expressions for certain text sentences related to errors that I'm mostly interested in. Now I'd like to be able to do some of the following:
1. be able to predict a possibility of occurrence of some sentences in their relation to other sentences (eg: occurrence of error string 1 is likely with probability P to be located with error string 2 in the range of N lines).
2. at least to be able to cluster roughly error strings by their occurrence together with some range of lines.
Could you advise me please what tools and methods to use best? Thank you in advance!
Can you advise me please on this project I'm trying to do.
I've got about 45Gigs of text logs. I've done some search and extracted regular expressions for certain text sentences related to errors that I'm mostly interested in. Now I'd like to be able to do some of the following:
1. be able to predict a possibility of occurrence of some sentences in their relation to other sentences (eg: occurrence of error string 1 is likely with probability P to be located with error string 2 in the range of N lines).
2. at least to be able to cluster roughly error strings by their occurrence together with some range of lines.
Could you advise me please what tools and methods to use best? Thank you in advance!