Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations strongm on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Text file issues

Status
Not open for further replies.

thiagarr

Technical User
Oct 20, 2006
86
US
I use CR XI and the data source is a UNIX log file that has about 200,000 lines. Sample data is as follows:

Code:
2007.01.09  13:41:30 [ExecuteThread: '44' for queue: 'weblogic.kernel.Default'] [5] weblogic APIEJB.PROCESSMESSAGE(String,int)  Client request = <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE GetUserRequest SYSTEM "GetUserRequest.dtd"><GetUserRequest><CommonRequest CustomerSubType="FSM" CustomerType="FSM"><TxNumber>802</TxNumber><TxUUID>000g0h37
69</TxUUID><CusReferenceID>0999</CusReferenceID></
CommonRequest><RequestData><UUID>000g0h3769</UUID></
RequestData></GetUserRequest>

2007.01.09  13:41:30 [ExecuteThread: '42' for queue: 'weblogic.kernel.Default'] [5] weblogic APIEJB.PROCESSMESSAGE(String,int)  Client request = <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE GetUserRequest SYSTEM "GetUserRequest.dtd"><GetUserRequest><CommonRequest CustomerSubType="FSM" CustomerType="FSM"><TxNumber>802</TxNumber><TxUUID>00g00469
cc</TxUUID><CusReferenceID>0999</CusReferenceID></
CommonRequest><RequestData><UUID>00g00469cc</UUID></
RequestData></GetUserRequest>

2007.01.09  13:41:32 [ExecuteThread: '46' for queue: 'weblogic.kernel.Default'] [5] weblogic APIEJB.PROCESSMESSAGE(String,int)  Client request = <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE GetCountryProhibitsRequest SYSTEM "GetCountryProhibitsRequest.dtd"><GetCountryProhibitsRequest>
<CommonRequest CustomerSubType="WEBSITE" CustomerType="GTM"><TxNumber>832</TxNumber><TxUUID>0gggg015
dc</TxUUID><CusReferenceID>0999</CusReferenceID></
CommonRequest><RequestData><ExportCountryCode>US</
ExportCountryCode><ImportCountryCode>BS</ImportCountryCode>
<ShipDate>20070109</ShipDate></RequestData></
GetCountryProhibitsRequest>

2007.01.09  13:41:34 [ExecuteThread: '31' for queue: 'weblogic.kernel.Default'] [5] weblogic APIEJB.PROCESSMESSAGE(String,int)  BACKEND response sending to Servlet = <?xml version="1.0" encoding="UTF-8"?>

<!DOCTYPE EditCommodityCountryReply SYSTEM "EditCommodityCountryReply.dtd">

<EditCommodityCountryReply><CommonReply><TxNumber>963</
TxNumber><TxUUID>00g00kafac</TxUUID></CommonReply><Error>
<ErrorCode>77003</ErrorCode><ErrorMsg/></Error></
EditCommodityCountryReply>

2007.01.09  13:41:59 [ExecuteThread: '31' for queue: 'weblogic.kernel.Default'] [5] weblogic OBJECTBROKER.PROCESSMESSAGE  Invoking method TradeXpress.CustomerProfile.CustomerProfileEJB_jruilg_
ELOImpl.retrieveCommodityRecords

2007.01.09  13:41:59 [ExecuteThread: '31' for queue: 'weblogic.kernel.Default'] [5] weblogic OBJECTBROKER.PROCESSMESSAGE  TX= 851 Invoke time= 7

2007.01.09  13:42:00 [ExecuteThread: '31' for queue: 'weblogic.kernel.Default'] [5] weblogic APIEJB.PROCESSMESSAGE(String,int)  BACKEND response sending to Servlet = <?xml version="1.0" encoding="UTF-8"?>

<!DOCTYPE DetermineDocsForCommodityReply SYSTEM "DetermineDocsForCommodityReply.dtd">

<DetermineDocsForCommodityReply><CommonReply><TxNumber>
920</TxNumber><TxUUID>g0g00o4f17</TxUUID></CommonReply>
<Error><ErrorCode>55002</ErrorCode><ErrorMsg>[854]</
ErrorMsg></Error></DetermineDocsForCommodityReply>

2007.01.09  13:42:00 [ExecuteThread: '31' for queue: 'weblogic.kernel.Default'] [5] weblogic OBJECTBROKER.PROCESSMESSAGE  Invoking method TradeXpress.DocDetermination.DocDeterminationEJB_jisdhc_
ELOImpl.determineDocsforCommodity

2007.01.09  13:42:01 [ExecuteThread: '46' for queue: 'weblogic.kernel.Default'] [5] weblogic APIEJB.PROCESSMESSAGE(String,int)  Client request = <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE GetUserRequest SYSTEM "GetUserRequest.dtd"><GetUserRequest><CommonRequest CustomerSubType="FSM" CustomerType="FSM"><TxNumber>802</TxNumber><TxUUID>000g01e4
00</TxUUID><CusReferenceID>0999</CusReferenceID></
CommonRequest><RequestData><UUID>000g01e400</UUID></
RequestData></GetUserRequest>

I inserted a DUMMY blank line after every line in the code shown here to properly identify the original text fields (since due to the number of characters, the lines were wrapped in the Preview screen). In the source file, they are not there.

Out of these lines, all I am interested is in parsing the file for the text <errorcode> and use that line and the two previous lines concatenated in the original order. For example, the first occurrence of this will become as follows. (The log file somehow seems to insert carriage return where it should not be there).

Code:
2007.01.09  13:41:34 [ExecuteThread: '31' for queue: 'weblogic.kernel.Default'] [5] weblogic APIEJB.PROCESSMESSAGE(String,int)  BACKEND response sending to Servlet = <?xml version="1.0" encoding="UTF-8"?><!DOCTYPE EditCommodityCountryReply SYSTEM "EditCommodityCountryReply.dtd"><EditCommodityCountryReply>
<CommonReply><TxNumber>963</TxNumber><TxUUID>00g00kafac
</TxUUID></CommonReply><Error><ErrorCode>77003</ErrorCode>
<ErrorMsg/></Error></EditCommodityCountryReply>

I have CR text driver defined to read lines that are 2,000 characters long in FIXED format while reading this file.

For want of any other ideas, I started as follows:

@concat:

if left({error35_log.Title},9) = "<!DOCTYPE" then
previous({error35_log.Title}) & {error35_log.Title} & next({error35_log.Title})

From here, I use another formulao to identify the lines with errorcode in them (the lines that I really want for use in the report)

@errorcode_data:
if instr({@concat},"<ErrorCode>",1) <> 0
then
{@concat}
else
"NODATA" //NODATA is used to suppress the unwanted lines in Details section

After this step, I have different formulas to extract the information I need to report:

@Date
left({@errorcode_data},10)

@errorcode_string:
((extractstring({@errorcode_data},"<ErrorCode>",
"</ErrorCode>")))

@TXN_string:
(extractstring({@errorcode_data},"<TxNumber>","</TxNumber>")
)

I tried using tonumber(((extractstring({@errorcode_data},"<ErrorCode>",
"</ErrorCode>"))) in the above formula and I get error 'non-numeric values'
but when I use tonumber(@errorcode_string) in a separate formula, it works fine. Any ideas?

I am sure there are more straightforward methods to achieve the above and I can use any ideas / help in doing that.

I thought Record selection formula would be the best but I am not sure if I can combine the three lines in one while doing that. That will make the report work faster as I normally only have about 1% of the total lines have the error data.

Also, I want to group the errorcode and TXNumber field, but I do not get an option to use the formula in a group (only my original field is available for grouping or summary). I would like to know how I can achieve that.

I would appreciate any help / guidance I can get on this.
Thank you very much to all of you for your support.

TR
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top