it has come to my attention that the “industry” believes (incorrectly) that getting data out of a data vault is difficult, if not impossible. this is so far off the truth it isn’t even funny. individuals can find the right information in the book in chapter 6, and in depth discussions in the data vault […]
a walk back through the joins, hashing, teradata, key selection, and arguments for and against sequence numbers. while this post is specifically related to teradata, there are some generic statements here that apply to all mpp solutions.
data vault 2.0 has been in the lab for over 3 years now. it is time to evolve, to offer some pain relief to those of you struggling with joins, performance and partitioning on the physical level. dv2.0 is a specification that solves a few of the issues that are brought on by the strict […]
people come to me all the time complaining about the number of joins in the data vault. i thought i might take a crack at answering this question/complaint, and provide some solid mathematical proof behind the fundamental design of the data vault model. i welcome your feedback (i hope you take the time to comment).