Hi everyone,
The Background
I have designed a database schema with a fact table and several dimension tables with the view to create a set of reports and cubes from this schema. I am trying to work out a strategy to populate the dimension tables using DTS packages but am pretty new to datawarehousing concepts in general and do not feel confident that what I am doing is along the right lines
What I am currently doing
I am storing dimension values in a text file. The text file will include the dimension values as well as a key. This text file is then loaded into the dimension table.
If members are added/removed then the text file is modified and the dimension fully reloaded using standard bulk copy task. The key I use for my dimensions is a surrogate key.
Question
Is there anything wrong with this approach (the values in my dimensions will very rarely change) or is there any reason that I should use a proper incremental approach? ie detect a change in the text file and then make this change to the dimension table rather than reload the results every time?
The problem is that I have been reading up on literature on slow changing dimensions and am consequently managing to get myself very confused!
Any advice would be much appreciated.
Nassy
The Background
I have designed a database schema with a fact table and several dimension tables with the view to create a set of reports and cubes from this schema. I am trying to work out a strategy to populate the dimension tables using DTS packages but am pretty new to datawarehousing concepts in general and do not feel confident that what I am doing is along the right lines
What I am currently doing
I am storing dimension values in a text file. The text file will include the dimension values as well as a key. This text file is then loaded into the dimension table.
If members are added/removed then the text file is modified and the dimension fully reloaded using standard bulk copy task. The key I use for my dimensions is a surrogate key.
Question
Is there anything wrong with this approach (the values in my dimensions will very rarely change) or is there any reason that I should use a proper incremental approach? ie detect a change in the text file and then make this change to the dimension table rather than reload the results every time?
The problem is that I have been reading up on literature on slow changing dimensions and am consequently managing to get myself very confused!
Any advice would be much appreciated.
Nassy