Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations TouchToneTommy on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Information on PDF Index Format?

Status
Not open for further replies.

rfedyk

Programmer
Feb 15, 2002
29
AU
PDF files that have been indexed create a .PDX file which is just the starting point in the indexing info chain. The real info is held in the .DDD and .DID files that are saved in the "parts" subdirectory.

I have a project which would work much better if I could read those files directly. Does anyone have any information on the format of those files?

Thanks
Roger
 
Did a quick google, and this popped up - don't know if it helps.

The .ddd files contain token data (usernames, filenames, object handles, etc. - data that does not need stemming during search). The .did file contains stream data (data that needs stemming during search).

Ahhhhh, I see you have a machine that goes Bing!
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top