I am designing ETL and really confused about the whole staging architecture. Please provide me whatever links you can, I am ready to read anything. My questions are as follows.
(1)
Staging area between the source and the data Mart (backroom as they call it), is it supposed to be a persistent staging area or a non persistent staging area or does it depend on the requirement of the businees. Does Kimball say anything about it? I did lot of reading but could not get it clearly. Except I guess he recommends persisting confirmed dimensions in the staging.
If it depends on the requirements what are the cases where you will persist structures in the staging (meaning tables will not be wiped out with each load), apart from confirmed dimensions.
If I decide to persist my source information with a slightly different ER model than source (still 3NF)in staging area, will I be deviating too much from Kimball Or in Kimball's terminology this is called an ODS.
People who are doing Kimball stype of datawarehousing, do they ususally have
(1) Source -> ODS -> Data Mart (OR)
(2) Source -> Trasient Staging -> Data Mart (OR)
(3) Source -> Persistent staging -> Data Mart.
(1)
Staging area between the source and the data Mart (backroom as they call it), is it supposed to be a persistent staging area or a non persistent staging area or does it depend on the requirement of the businees. Does Kimball say anything about it? I did lot of reading but could not get it clearly. Except I guess he recommends persisting confirmed dimensions in the staging.
If it depends on the requirements what are the cases where you will persist structures in the staging (meaning tables will not be wiped out with each load), apart from confirmed dimensions.
If I decide to persist my source information with a slightly different ER model than source (still 3NF)in staging area, will I be deviating too much from Kimball Or in Kimball's terminology this is called an ODS.
People who are doing Kimball stype of datawarehousing, do they ususally have
(1) Source -> ODS -> Data Mart (OR)
(2) Source -> Trasient Staging -> Data Mart (OR)
(3) Source -> Persistent staging -> Data Mart.