I’ve begun researching Data Vault models on Hadoop solutions, including HadoopDB and Hive. Recently I came across a number of articles which describe the solutions of Hive and HadoopDB in detail on top of Hadoop solutions. I had to take a minute to write this article, to explain my view points of using the Data […]
Tag Archives: data vault model
#datavault #hadoop – Additional Information
Second look at technology stacks in the Hadoop, MapReduce arena, and beginning to consider the technology to implement for Data Vault model testing.
Column Based Data Vaults #datavault
Column based data stores have been around for a long time. This blog will talk about what you need to do to make the Data Vault successful on a column based data store. It’s quite simple really, and in the end – some parts of physical data modeling don’t matter in a column based data […]
Mathematics of Joins, Denormalization, Rows Per Block and IO
People come to me all the time complaining about the NUMBER OF JOINS in the Data Vault. I thought I might take a crack at answering this question/complaint, and provide some solid mathematical proof behind the fundamental design of the Data Vault model. I welcome your feedback (I hope you take the time to comment).
Hardcore Table Comparisons: Dimensional and Data Vault
There are a lot of comments, and questions out there about the Data Vault model, particularly from those who claim to know it and understand it. Yet some of them have a) not engaged me, nor b) gone to take the certification class. Reasons for the DV model are buried in the methodology – a […]
Clash of the Titans: Post on Kimball Forum
The Data Vault Model is NOT the be-all-end-all solution to data warehousing, it IS a single evolutionary step forward, as it is a Hybrid design. If you don’t feel pain in your current implementation, then you may not be a candidate for the Data Vault model and methodology.
#dvseminar – Data Vault Seminar Smashing Success
#dvseminar was fantastic, As an informal way of gathering information, if you’d really like to attend next year, please leave a comment at the end of this post with your name and email. We can put you on the notification list for next year! This post covers some highlights (from my perspective), I’d love to […]
Code Generation for Data Vault, not as easy as you think!
A short discussion about the inputs you should be using to generate ETL / ELT and Data Vault Models. If your current vendor isn’t there yet, that’s ok – just ask them when they will, and why they haven’t yet gotten there.
Opportunity, DV Consulting, US-Wash DC
I have a potential opportunity for consultants, or a team of implementation consultants interested in working for one of my prospective clients. The gig is just now being specified, could be several months long. It’s in the US, Washington DC area. I need 3 to 4 CERTIFIED Data Vault consultants who may be interested, or […]
How do YOU define Staging Out?
There is a lot of talk these days about what Staging Out means, what Business Data Vault Means, even what the term Data Mart means. In this entry I want to hear from you. Please enter a comment on this post, and tell me what you think “staging out” really means within the context of […]