Question

I am trying to understand pure type 6 SCD implementation from WIKI which says mainly three points

  1. Having a Type 2 surrogate key for each time slice can cause problems if the dimension is subject to change.
  2. A pure Type 6 implementation does not use this, but uses a Surrogate Key for each master data item (e.g. each unique supplier has a single surrogate key).
  3. This avoids any changes in the master data having an impact on the existing transaction data.

However I am unable to visualize these problems clearly.

Was it helpful?

Solution

The point of "type 6" or "hybrid" processing is that you track changes by adding a new row for each new version but then you update some of the attributes on previous versions to reflect the current state of data. That way the data can be filtered and rolled-up across all versions for reporting purposes.

There's another explanation here: http://www.kimballgroup.com/wp-content/uploads/2012/05/DT15CombiningSCD.pdf

Example:

Type 6 Example

http://www.kimballgroup.com/2013/02/05/design-tip-152-slowly-changing-dimension-types-0-4-5-6-7/

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top