All Forums Database
06 Jan 2014
Data corruption in SCD 2 dim table

We have a SCD 2 table PARTY_ID_HIST. It has a surrogate key named Party_sk based on the source system primary key party_id_num. Due to some code issue the table has been populated repeating values and values pre-fixed with 0 for column party_id_num. Due to this there are new surrogate key values generated for column party_sk.
Party_sk Party_id_num
1             12345           
2             12345
3             12345
4             012345
5             012345
I have been told to remove duplicate entries. There are two challenges:
1. I dont to delete duplicate entries while maintaining historical record of SCD 2. That means if there are new surrogate keys generated due to change in some other attributes, those records should not be removed.
2. What to do with wrong Party_sk column which have been populated in other tables?

Raja_KT 1246 posts Joined 07/09
06 Jan 2014

Hi Singh,
First thing is job security. Take back up always.
You have not told about the SCD 2 , whether it is flag, version, start date end date or a combination of these. Sample data ??????
Once you have a backup, you can do whatever you want.

Raja K Thaw
My wiki:
Street Children suffer not by their fault. We can help them if we want.

08 Jan 2014

Anyone who has performed data reconciliation on SCD 2 as mentioned in the title post?

You must sign in to leave a comment.