DataStiletto
Programmer
We have just acquired a company that is in the same industry as us but offer a different product and service mix. They are located in New England. We know that we will share some customers and products because we have a large New England presence ourself.
I designed our warehouse from the onset to be capable of handling acquisitions by including the source system as part of the key. What I need to do for now is load all the customers and not look for duplicate because they will have different payment and credit profiles than what we share.
(We will dedupe then later)
What I need to do though is to combine the common products we sell to determine where we can obtain better pricing and profit margins. We have set up a data group to look at each item in our inventory and determine if the new acquisition has the same item.
My question is I intend to create a mapping table that will accept the source system id and product id and return a surrogate key for every item in both us and the new acquisition (18,000 items in total to compare). The mapping table will return the same surrogate key if the items are considered the same regardless of the source system.
My question is what to do if the data group misses an item and there is no surrogate key in the mapping table or an new item is entered and the data group doesn't update the mapping key. I was thinking of creating a surrogate default key of not found if this should occur.
I do not want to stop the flow of data into the dimension tables or fact tables because the data group missed a mapping.
Any ideas would be helpfull.
I designed our warehouse from the onset to be capable of handling acquisitions by including the source system as part of the key. What I need to do for now is load all the customers and not look for duplicate because they will have different payment and credit profiles than what we share.
(We will dedupe then later)
What I need to do though is to combine the common products we sell to determine where we can obtain better pricing and profit margins. We have set up a data group to look at each item in our inventory and determine if the new acquisition has the same item.
My question is I intend to create a mapping table that will accept the source system id and product id and return a surrogate key for every item in both us and the new acquisition (18,000 items in total to compare). The mapping table will return the same surrogate key if the items are considered the same regardless of the source system.
My question is what to do if the data group misses an item and there is no surrogate key in the mapping table or an new item is entered and the data group doesn't update the mapping key. I was thinking of creating a surrogate default key of not found if this should occur.
I do not want to stop the flow of data into the dimension tables or fact tables because the data group missed a mapping.
Any ideas would be helpfull.