K,
Thank you for the post. I had a look after posting and though I was not able to scroll and see the whole text, I was
able to view some of the code and when I copied and pasted the data in notepad, it was complete. Now, I had wordwrapped
the whole thing and removed unwanted part of the text. Hope this is much better read than the previous post.
My apologies for the inconvenience caused to all.
I use CR XI and the data source is a UNIX log file that has about 200,000 lines. Sample data is as follows:
I inserted a DUMMY blank line after every line in the code shown here to properly identify the original text fields
(since due to the number of characters, the lines were wrapped in the Preview screen - I added carriage return to be able to view the code in the screen). In the source file, they are not
there.
Out of these lines, all I am interested is in parsing the file for the text <errorcode> and use that line and the two
previous lines concatenated in the original order. For example, the first occurrence of this will become as follows
(The log file somehow seems to insert carriage return where it should not be there).
I have CR text driver defined to read lines that are 2,000 characters long in FIXED format while reading this file.
For want of any other ideas, I started as follows:
@concat:
if left({error35_log.Title},9) = "<!DOCTYPE" then
previous({error35_log.Title}) & {error35_log.Title} & next({error35_log.Title})
From here, I use another formulao to identify the lines with errorcode in them (the lines that I really want for use in
the report)
@errorcode_data:
if instr({@concat},"<ErrorCode>",1) <> 0
then
{@concat}
else
"NODATA" //NODATA is used to suppress the unwanted lines in Details section
After this step, I have different formulas to extract the information I need to report:
@Date
left({@errorcode_data},10)
@errorcode_string:
((extractstring({@errorcode_data},"<ErrorCode>","</ErrorCode>")))
@TXN_string:
(extractstring({@errorcode_data},"<TxNumber>","</TxNumber>"))
I tried using tonumber(((extractstring({@errorcode_data},"<ErrorCode>","</ErrorCode>"))) in the above formula and I get
error 'non-numeric values' but when I use tonumber(@errorcode_string) in a separate formula, it works fine. Any ideas?
I am sure there are more straightforward methods to achieve the above and I can use any ideas / help in doing that.
I thought Record selection formula would be the best but I am not sure if I can combine the three lines in one while
doing that. That will make the report work faster as I normally only have about 1% of the total lines have the error
data.
Also, I want to group the errorcode and TXNumber field, but I do not get an option to use the formula in a group (only
my original field is available for grouping or summary). I would like to know how I can achieve that.
I would appreciate any help / guidance I can get on this.
Thank you very much to all of you for your support.
TR
Thank you for the post. I had a look after posting and though I was not able to scroll and see the whole text, I was
able to view some of the code and when I copied and pasted the data in notepad, it was complete. Now, I had wordwrapped
the whole thing and removed unwanted part of the text. Hope this is much better read than the previous post.
My apologies for the inconvenience caused to all.
I use CR XI and the data source is a UNIX log file that has about 200,000 lines. Sample data is as follows:
Code:
2007.01.09 13:41:30 Client request = <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE GetUserRequest SYSTEM
"GetUserRequest.dtd"><GetUserRequest><CommonRequest CustomerSubType="FSM" CustomerType="FSM"><TxNumber>802</TxNumber>
2007.01.09 13:41:30 Client request = <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE GetUserRequest SYSTEM
"GetUserRequest.dtd"><GetUserRequest><CommonRequest CustomerSubType="FSM" CustomerType="FSM"><TxNumber>802</TxNumber>
2007.01.09 13:41:32 Client request = <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE GetCountryProhibitsRequest
SYSTEM"GetCountryProhibitsRequest.dtd"><GetCountryProhibitsRequest><CommonRequest CustomerSubType="WEBSITE"
2007.01.09 13:41:34 BACKEND response sending to Servlet = <?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE EditCommodityCountryReply SYSTEM "EditCommodityCountryReply.dtd">
<EditCommodityCountryReply><CommonReply><TxNumber>963</TxNumber><TxUUID>00g00kafac</TxUUID></CommonReply><Error>
<ErrorCode>77003</ErrorCode><ErrorMsg/></Error></EditCommodityCountryReply>
2007.01.09 13:41:59 Invoking method TradeXpress.CustomerProfile.CustomerProfileEJB_jruilg_ELOImpl.retrieveCommodityRecords
2007.01.09 13:41:59 weblogic OBJECTBROKER.PROCESSMESSAGE TX= 851 Invoke time= 7
2007.01.09 13:42:00 BACKEND response sending to Servlet = <?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE DetermineDocsForCommodityReply SYSTEM "DetermineDocsForCommodityReply.dtd">
<DetermineDocsForCommodityReply><CommonReply><TxNumber>920</TxNumber><TxUUID>g0g00o4f17</TxUUID></CommonReply>
<Error><ErrorCode>55002</ErrorCode><ErrorMsg>[854]</ErrorMsg></Error></DetermineDocsForCommodityReply>
2007.01.09 13:42:00 OBJECTBROKER.PROCESSMESSAGE Invoking method TradeXpress.DocDetermination.DocDeterminationEJB_jisdhc_ELOImpl.determineDocsforCommodity
I inserted a DUMMY blank line after every line in the code shown here to properly identify the original text fields
(since due to the number of characters, the lines were wrapped in the Preview screen - I added carriage return to be able to view the code in the screen). In the source file, they are not
there.
Out of these lines, all I am interested is in parsing the file for the text <errorcode> and use that line and the two
previous lines concatenated in the original order. For example, the first occurrence of this will become as follows
(The log file somehow seems to insert carriage return where it should not be there).
Code:
2007.01.09 13:41:34 BACKEND response sending to Servlet = <?xml version="1.0" encoding="UTF-8"?><!DOCTYPE
EditCommodityCountryReply SYSTEM "EditCommodityCountryReply.dtd"><EditCommodityCountryReply><CommonReply><TxNumber>
963</TxNumber><TxUUID>00g00kafac</TxUUID></CommonReply><Error><ErrorCode>77003</ErrorCode><ErrorMsg/></Error>
</EditCommodityCountryReply>
I have CR text driver defined to read lines that are 2,000 characters long in FIXED format while reading this file.
For want of any other ideas, I started as follows:
@concat:
if left({error35_log.Title},9) = "<!DOCTYPE" then
previous({error35_log.Title}) & {error35_log.Title} & next({error35_log.Title})
From here, I use another formulao to identify the lines with errorcode in them (the lines that I really want for use in
the report)
@errorcode_data:
if instr({@concat},"<ErrorCode>",1) <> 0
then
{@concat}
else
"NODATA" //NODATA is used to suppress the unwanted lines in Details section
After this step, I have different formulas to extract the information I need to report:
@Date
left({@errorcode_data},10)
@errorcode_string:
((extractstring({@errorcode_data},"<ErrorCode>","</ErrorCode>")))
@TXN_string:
(extractstring({@errorcode_data},"<TxNumber>","</TxNumber>"))
I tried using tonumber(((extractstring({@errorcode_data},"<ErrorCode>","</ErrorCode>"))) in the above formula and I get
error 'non-numeric values' but when I use tonumber(@errorcode_string) in a separate formula, it works fine. Any ideas?
I am sure there are more straightforward methods to achieve the above and I can use any ideas / help in doing that.
I thought Record selection formula would be the best but I am not sure if I can combine the three lines in one while
doing that. That will make the report work faster as I normally only have about 1% of the total lines have the error
data.
Also, I want to group the errorcode and TXNumber field, but I do not get an option to use the formula in a group (only
my original field is available for grouping or summary). I would like to know how I can achieve that.
I would appreciate any help / guidance I can get on this.
Thank you very much to all of you for your support.
TR