The only difference between the two is the level of normalization of your dimension table(s); think of star schema as of denormalized snowflake schema.
Here is an example:
Employee dimension is linked to Employee_Earnings fact table by Empl_id.
Let's say each Employee has atributes like Dept, Manager, Division, Region. You can decide to list all these attributes in each Employee record in the Employee Dimension table - than you'll get many repetitive data in this table. And this is perfectly fine with Star schema concept.
Someone else may decide to separate repetitive Dept, Division, Region info into a second dimension table called Dept and provide unique keys for all combinations. Then he would replace Dept info in the Employee table with a reference to the Dept table by Dept keys. Thus, he created two dimension tables linked by Dept key while only Employee is linked to the fact table. He just normilized a Star schema into a Snowflake schema.
Clearly, Snowflake schema is a step back towards to highly normalized OLTP architecture, and away from DW OLAP denormalized architecture.
Often designers end up using a combination snowflake-star approach in which the fields are selectively normalized into seperate tables and at the same time some of these fields are also kept in the lower-level dimension tables (denormed).
Much of the design is dependent on update requirements, query tool requirements, and space requirements. There usually is no totally black & white Star or Snowflake.
Hi,
this is in regard tO the INFORMATICA.
can anybody tell me what is the tool that is used to check
the repeating data in the sources.
immediate response will be highly appreciated.
thanx
rrdw
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.