My company maintains 3 large tables containing customer addresses. These tables are fed weekly batch data by thousands of partnering companies. None of these 3 tables store history (1 of the tables is a type1 update, the other 2 are full table refreshes).
Now, i'd like to apply address cleansing routines on these 3 tables w/out changing the table structures.
Here are the problems I see. In the case of the table that is type 1 updated ("t1"), the legacy load program applies updates to T1 if there are differences b/w it and the source file. So, if John Smith has the following address today in T1 "15 Maple" and my cleansing routine deems it to be wrong so revises it (still in T1) to "15 Maple Rd" then the next day - as the source file is not a delta but a snapshot of the whole db - when "15 Maple" comes across again for John Smith then it will overwrite the correctly cleansed "15 Maple Rd" value as the values are indeed different. Flipping it back to "15 Maple". So my routine would waste processing power to clean this over and over.
It's pretty much the same in the case of the other tables which are fully refreshed (T2 and T3).
Any help would be greatly appreciated!
Now, i'd like to apply address cleansing routines on these 3 tables w/out changing the table structures.
Here are the problems I see. In the case of the table that is type 1 updated ("t1"), the legacy load program applies updates to T1 if there are differences b/w it and the source file. So, if John Smith has the following address today in T1 "15 Maple" and my cleansing routine deems it to be wrong so revises it (still in T1) to "15 Maple Rd" then the next day - as the source file is not a delta but a snapshot of the whole db - when "15 Maple" comes across again for John Smith then it will overwrite the correctly cleansed "15 Maple Rd" value as the values are indeed different. Flipping it back to "15 Maple". So my routine would waste processing power to clean this over and over.
It's pretty much the same in the case of the other tables which are fully refreshed (T2 and T3).
Any help would be greatly appreciated!