A walk back through the joins, hashing, Teradata, key selection, and arguments for and against sequence numbers. While this post is specifically related to Teradata, there are some generic statements here that apply to all MPP solutions.
Data Vault 2.0 has been in the lab for over 3 years now. It is time to evolve, to offer some pain relief to those of you struggling with joins, performance and partitioning on the physical level. DV2.0 is a specification that solves a few of the issues that are brought on by the strict […]
People come to me all the time complaining about the NUMBER OF JOINS in the Data Vault. I thought I might take a crack at answering this question/complaint, and provide some solid mathematical proof behind the fundamental design of the Data Vault model. I welcome your feedback (I hope you take the time to comment).