Does anyone have any experience with these, such as Netezza, Datallegro, Teradata, Greenplum, etc... We're starting a data warehousing project to consolidate our data, normalize it, and setup a standard data model.
Right now, we draw data from multiple source systems and stick the feeds(transformed by Informatica) into a data mart(we have an extensive amount of these) and have a different data model for each data mart.
The problem with this is that that same source system could be feeding multiple data marts but being transformed differently. This makes it difficult for us to figure out which is the "Truth" I guess i would call it. Then multiply that by the 200-300 terabytes of data we have (yes terabytes), it makes it very difficult to properly use our analytical tools to their fullest potential.
I've uploaded a mock up of how the data flows.
Right now, we draw data from multiple source systems and stick the feeds(transformed by Informatica) into a data mart(we have an extensive amount of these) and have a different data model for each data mart.
The problem with this is that that same source system could be feeding multiple data marts but being transformed differently. This makes it difficult for us to figure out which is the "Truth" I guess i would call it. Then multiply that by the 200-300 terabytes of data we have (yes terabytes), it makes it very difficult to properly use our analytical tools to their fullest potential.
I've uploaded a mock up of how the data flows.