Hi all,
we're experiencing a strange issue with the indexing of mth files on our CS10 installation (CS SP2 Update 13).
We've some mht files that are not indexed by CS and we get following message on the index log file.
1401385870871:IEWorkerThread-0:4erforming (L)AddOrReplace on Object Id [DataId=1618733&Version=1]:
1401385870872:IEWorkerThread-0:5oing add DataId=1618733&Version=1:
1401385870879:IEWorkerThread-0:5:Raw data size for DataId=1618733&Version=1 was 2880 chars.:
1401385870879:IEWorkerThread-0:5:Metadata set took 3 ms.:
1401385870880:IEWorkerThread-0:5:Mimetype of DataId=1618733&Version=1 is text/html:
1401385870951:IEWorkerThread-0:0:Accumulator; Discarding content of object DataId=1618733&Version=1 as it fails the accumulator's good object heuristic 1 nWords 54182 nLengthOfWords 721513 text/html:
1401385870951:IEWorkerThread-0:4erformed (L)AddOrReplace on Object Id [DataId=1618733&Version=1]:
We've read about the bad object heuristic logic and we're aware about how it works. The strange fact is that if we open the same file with Word 2010 and we save it without any changes, when we upload it back to CS it's correctly indexed.
Original files should have been edited with Office 2007 and on OT's community we've found article 17394445 on this issue but it's applyable only for CS 9.7.1. Please note that the great part of our mht files are correctly indexed.
We've also verified that on a previous 9.7.1 installation the same file is correctly indexed.
Do you have any suggestions on that?
Thanks
Regards
we're experiencing a strange issue with the indexing of mth files on our CS10 installation (CS SP2 Update 13).
We've some mht files that are not indexed by CS and we get following message on the index log file.
1401385870871:IEWorkerThread-0:4erforming (L)AddOrReplace on Object Id [DataId=1618733&Version=1]:
1401385870872:IEWorkerThread-0:5oing add DataId=1618733&Version=1:
1401385870879:IEWorkerThread-0:5:Raw data size for DataId=1618733&Version=1 was 2880 chars.:
1401385870879:IEWorkerThread-0:5:Metadata set took 3 ms.:
1401385870880:IEWorkerThread-0:5:Mimetype of DataId=1618733&Version=1 is text/html:
1401385870951:IEWorkerThread-0:0:Accumulator; Discarding content of object DataId=1618733&Version=1 as it fails the accumulator's good object heuristic 1 nWords 54182 nLengthOfWords 721513 text/html:
1401385870951:IEWorkerThread-0:4erformed (L)AddOrReplace on Object Id [DataId=1618733&Version=1]:
We've read about the bad object heuristic logic and we're aware about how it works. The strange fact is that if we open the same file with Word 2010 and we save it without any changes, when we upload it back to CS it's correctly indexed.
Original files should have been edited with Office 2007 and on OT's community we've found article 17394445 on this issue but it's applyable only for CS 9.7.1. Please note that the great part of our mht files are correctly indexed.
We've also verified that on a previous 9.7.1 installation the same file is correctly indexed.
Do you have any suggestions on that?
Thanks
Regards