Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations strongm on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Content Server 10 - Indexing Problem with MHT files

Status
Not open for further replies.

megapix

Technical User
Jan 22, 2013
18
IT
Hi all,
we're experiencing a strange issue with the indexing of mth files on our CS10 installation (CS SP2 Update 13).
We've some mht files that are not indexed by CS and we get following message on the index log file.

1401385870871:IEWorkerThread-0:4:performing (L)AddOrReplace on Object Id [DataId=1618733&Version=1]:
1401385870872:IEWorkerThread-0:5:Doing add DataId=1618733&Version=1:
1401385870879:IEWorkerThread-0:5:Raw data size for DataId=1618733&Version=1 was 2880 chars.:
1401385870879:IEWorkerThread-0:5:Metadata set took 3 ms.:
1401385870880:IEWorkerThread-0:5:Mimetype of DataId=1618733&Version=1 is text/html:
1401385870951:IEWorkerThread-0:0:Accumulator; Discarding content of object DataId=1618733&Version=1 as it fails the accumulator's good object heuristic 1 nWords 54182 nLengthOfWords 721513 text/html:
1401385870951:IEWorkerThread-0:4:performed (L)AddOrReplace on Object Id [DataId=1618733&Version=1]:

We've read about the bad object heuristic logic and we're aware about how it works. The strange fact is that if we open the same file with Word 2010 and we save it without any changes, when we upload it back to CS it's correctly indexed.
Original files should have been edited with Office 2007 and on OT's community we've found article 17394445 on this issue but it's applyable only for CS 9.7.1. Please note that the great part of our mht files are correctly indexed.
We've also verified that on a previous 9.7.1 installation the same file is correctly indexed.

Do you have any suggestions on that?
Thanks
Regards
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top