welcome to my thoughts on data vault 2.0 in 2015
in this entry i’ll start with a year in review (2014), and then move on to discuss the new pieces i’m bringing to market in 2015. i am super excited to announce what is coming this year, including (at long last) my dv2 boot camp & private certification course on-line!! it will be available very soon.
year in review!
2014 brought lots of changes, and lots of advancements for me. i’ve been super busy all over the world (right now, the market is heating up all across asia, new zealand, australia, and south america). data vault 2.0 brought with it the use of big data, nosql techniques, and in my upcoming conference: wwdvc (world wide data vault consortium), we will be discussing (from a customer case-study perspective) just what dv2 has been doing for customers.
the conference focuses on customer case studies this year (with the exception of the keynote presentation by claudia imhoff).
what am i bringing to the table?
my business partner sanjay & i will be discussing the impact of nosql and big data on data vault 2.0. i will be talking about different types of methods to model, why dv2 changes the landscape, and what big data is doing (in general) to the data modeling world. sanjay will be diving in to pros & cons of specific nosql technology, discussing the storage mechanisms, and modeling concerns that we have.
2014 brought me new partnerships…
in 2014, i partnered with the following companies, and work with them closely to help them provide authorized dv2 boot camp and private certifications. you should talk with me, or my partners for information about local data vault 2.0 certified resources.
- analytics8 – in australia, new zealand, and asia pacific regions
- doerffler & partner – based in germany, they handle all of europe today
i am currently in discussions with additional companies in different parts of the world to align more partnerships, to help serve you better.
speaking of which, i also partnered with a software vendor in 2014:
- analytixds.com – us & global, providing mapping manager / code automation for data vault 2.0 projects
their mapping manager product is the only licensed product authorized to generate authentic data vault 2.0 loading code and template designs.
what customers are currently under way with data vault 2.0?
well, i can’t disclose all the customers, but here are a list of a few companies who have been engaging with me directly (along with my business partners: doerffler & partner, and analytics8) to bring dv2 to the market:
- allianz global, australia
- commonwealth bank, australia
- qsuper, australia
- intact financial, canada
- department of defense, usa
- ultimate software, usa
- gmac financial, usa
- wells fargo bank, usa
- bank of hawaii, usa
and the list goes on!! the best thing for you and your company to do, is engage me, or one of my authorized training partners for one of our three packages.
- kick start package – 3 days boot camp + dv2 certification + 7 days on-site building/implementation
- advanced accelerator package – 1 to 3 days training (negotiable) + 7 to 9 days on-site building/implementation (goes beyond the basics levels offered in the kick start)
- program assessment package – 2 weeks on-site (generally), full program assessment, including dv2 model, methodology, team communication, requirements processes, delivery processes, agile principles in practice and more.
what about dv2 and agility?
data vault 2.0 is a huge boost to productivity, scalability, and methodology. in fact, i am pleased to announce a partnership with scott ambler (father of the agile manifesto). scott has backed the agile components of dv2 for a few years now. with the advent of the dv2 agile methodology approach, he is squarely behind the principles of what we teach (agile for data warehousing and enterprise analytics). you can see more about the agile side of data warehousing by checking out kent graziano’s brand new 1 hour course.
what’s happening to datavaultalliance.com?
datavaultalliance.com web site is going under construction shortly. or should i say, re-construction. we have moved all our training course content to our new professional learning management system (lms).
our original site: http://datavaultalliance.com will be changed, and rebuilt soon to reflect the needs of the new platform, and to interact with the content delivery on the back-end. bear with us as we take the next few months to re-build the site and re-launch it.
in another post (shortly) i will explain how to use the new platform, and offer a few videos / sneak peeks if you will, that walk you through *what you will see* if you purchase a course!!
this is simply the way the platform works today. in the future (after the rebuild of datavaultalliance.com is complete, that might change).
more on the new platform later… for now, back to discussions of what comes next.
dv2 – what comes next?
well, there will be a lot of discussion on this topic at the wwdvc conference coming up, but i’ll give you a preview of what is to come this year. these are my opinionated predictions:
- big data will finally “buckle down” and have to be run like all other projects, with agility, governance, and security – not to mention master data, metadata, and auditability controls. just because we are working with (or going to work with) big data, doesn’t mean we can simply “let go” of all the best practices in the analytics and data management industry!! in reality, big data is just that – lots and lots of data in disparate formats.we still will need to address structuring, restructuring, aggregation, coalescing, interpretation, standardization just to name a few (all business rules) in order to turn data into information.
- “data marts” will be re-named: “information marts” (in accordance with the above statements). data mart is truly a misnomer for what we provide to business. but this is an industry prediction rather than a dv2 prediction.
- nosql will finally be recognized as just another infrastructural platform. hey!! look at that! it can and should co-exist naturally with my long-living relational database engines that i have invested so much in over the years. nosql is not “ready” yet to take over all the functionality that traditional rdbms stores provide. that day may come, but i predict not until q4-2015 will it be ready for sandboxes in corporations. it will take a lot longer than that to address the security and accessibility issues, along with the ad-hoc access issues, and the latency and consistency issues – before a nosql platform can actually be turned in to a serious production edw + analytics platform.
- the implementers in the edw/bi space will need to learn data mining techniques, including mathematical models, predictive models, neural models, and so on… “r” and sas and other mining tools will become extremely important in parsing through, weeding out the data in the big data /nosql platforms. these mining engines are the data profiling engines of tomorrow.
- dv2 – will continue to offer agility training, in fact, increasing it’s strength and reliance on agile best practices while maintaining their applicability to enterprise data warehousing programs.
- dv2 modeling standards and rules will adapt for specific capabilities on nosql platforms based on storage needs
- business keys, ontologies and metadata will become more important than ever – and will move to the fore-front as a means to assist in the translation of business requirements to implemented solutions. particularly as the implementers move in to managed self-service bi
- there will be a rise of managed self-service bi in the corporate world. better tools are needed in this space, there are lots of niches not being met today.
internet-of-things… um really? we don’t need to discuss this, or we shouldn’t – unless we are the manufacturers of these devices. in truth, even for the manufacturers, it’s all about the data collection. the iot data is no different than today’s machine generated data sets. the good news? it will flow in real-time directly to hadoop or other nosql style machines. the other good news? you should already understand how to deal with machine generated data sets, and incorporate it in to your edw / bi and analytics projects.
the bad news? well, if there is any… it will be learning how to separate the “mundane” generated data from the “interesting” data coming from the machine generated data sets in the iot.
there will be more announcements on dv2 coming soon, particularly as the conference nears. if you have not yet gotten your room or purchased your entry ticket, you should do this, as the seats do fill up fast!! go to: http://wwdvc.com to register.
hope to see you soon!