I'm planning on storing/indexing millions of text documents and allowing them to be keyword searched and retrieved via a web page. I'm trying to figure out the best way to handle this. I know it's a broad question, but does anyone have any suggestions regarding the best way to do this?
I'm considering three different options:
1. storing flat files which I'll search based on XML schema
2. storing in a SQL Server database using full-text indexes
3. storing in an Oracle database using their XML functionality
I'm not well versed in any of these methods, so there will obviously be some learning needed on my part. But, I want to try to do this in the most efficient manner possible. I'm also expecting that there may be a completely different way that I haven't listed, that would be a better implementation. Thanks for any help or opinions that you guys can provide.
I'm considering three different options:
1. storing flat files which I'll search based on XML schema
2. storing in a SQL Server database using full-text indexes
3. storing in an Oracle database using their XML functionality
I'm not well versed in any of these methods, so there will obviously be some learning needed on my part. But, I want to try to do this in the most efficient manner possible. I'm also expecting that there may be a completely different way that I haven't listed, that would be a better implementation. Thanks for any help or opinions that you guys can provide.