Locking in a Data Vault

| September 25, 2017

So, I’m playing a little with words here. I’m certainly not advocating locking anybody or anything in a Data Vault. I want to share how you can lock in success as you design and deliver your new Data Vault. I assume you have your business people fully on board as discussed in this recent blog. If not, I advise you to go back and do that first. This blogpost is aimed to specifically assist your development team.

Most of us are challenged by change. And developers are little different. They are typically very comfortable with a set of design approaches and tools learned in the past and it routinely frames their perspective on how to tackle the future. Combining the comfort of old ways with the tight timeframes and pressures of today’s business requests seldom leads to taking time to explore new options. As a result, it is easy for teams to be weighed down by outdated, limiting approaches to data infrastructure.

What we’ve learned with the evolution of the Data Vault methodology and data warehouse automation (DWA) over the past decade is that some areas within the data warehouse development process are broken. Dan Linstedt and the other contributors to the Data Vault model in the early 2000’s recognized early on that the traditional data models were not able to meet the quality and agility goals of a data warehouse serving a modern data-focused business. I have provided some of this background in this recent white paper.

The Data Vault is constructed from some very carefully defined primitives, such as hubs, links and satellite tables, that must be defined and populated in specific ways to work as intended. If developers use old approaches or, worse still, make up new ones themselves, disaster will follow.

In Data Vault 2.0, Linstedt has provided a methodology to drive best practice in the design of the data model and in the development of the function that populates it. Methodologies are great: I rely on a wonderful methodology for manually raising my computer screen to the ideal height as I write this post. But, within development teams, such behavior will lead to inconsistent approaches to development; result in delays in future maintenance as other developers struggle to understand different coding styles; and ultimately will lead to a skills loss for your organization when your cleverest developer dies in a freak coding accident.

WhereScape® Data Vault Express addresses these issues by encoding the templates of the Data Vault components, and employing best practices in population processes and development methods within an automated, metadata-driven design and development environment. Starting in initial design collaboration between IT and business people, design choices are encoded in metadata to auto-generate the code and scripts responsible for defining Data Vault tables and populating them with the correct data, ensuring design consistency and completeness, and coding conformity to a single set of standards. Traceability is enforced and maintenance eased. Additionally, as your developers work, all is documented automatically—a task few enjoy or have the time to complete.

Locking in the Data Vault is all about maintaining consistency, ensuring complete documentation, and auto-generating best-practice model and code assets across design and development. As I discuss in this white paper Meeting the Six Data Vault Challenges and within this recent recorded webcast, data warehouse automation is the logical foundation. And while change is hard, development teams will benefit greatly from an openness to doing it differently.

Coming soon, some thoughts on Living in a Data Vault.

You can find the other blog posts in this series here:


Dr. Barry Devlin is among the foremost authorities on business insight and one of the founders of data warehousing, having published the first architectural paper on the topic in 1988. Barry is founder and principal of 9sight Consulting. A regular blogger, writer and commentator on information and its use, Barry is based in Cape Town, South Africa and operates worldwide.

A Webinar Recap: Exploring Data Automation Levels with Kent Graziano

Our most recent webinar, "The Future of Data Warehousing: Understanding Automation Levels," hosted by Patrick O'Halloran, Solutions Architect, and esteemed guest speaker Kent Graziano dove into the transformative world of data warehouse automation. They discussed its...

Overcoming Challenges with AI Hallucinations

Conversing with your digital assistant on your smartphone, using facial recognition for security, traveling in autonomous vehicles, or browsing recommended products based on your search history - there is no denying AI is embedded in many aspects of our lives. AI has...

Navigating Data Governance with WhereScape 3D

Properly managing and organizing data allows businesses to not only understand crucial patterns and trends, but also to leverage that data in strategic ways that grow revenue over time. Data drives decision-making and paves the way for innovation when used properly....

Deep Dive into WhereScape RED: Features and Benefits

Transforming a business’s various databases and files into actionable insights and reports is crucial, but incredibly time-consuming with traditional tools. Fortunately, with data warehouse automation tools like WhereScape RED, organizations can take advantage of a...

ETL vs ELT: What are the Differences?

In data management, the debate between ETL and ELT strategies is at the forefront for organizations aiming to refine their approach to handling vast amounts of data. Each method, ETL vs ELT, offers a unique pathway for transferring raw data into a warehouse, where it...

How to Hire and Retain Data Warehouse Developers

The projected data warehouse developer job growth rate is 21% from 2018-2028, with about 284,100 new jobs for data warehouse developers projected over the next decade, according to Zippia. This surge in demand for data warehouse talent is being felt across businesses...

8 Reasons to Make the Switch to ELT Automation

Extraction, loading, and transformation (ELT) processes have been in existence for almost 30 years. It has been a programming skill set mandatory for those responsible for the creation of analytical environments and their maintenance because ELT automation works....

Related Content

Overcoming Challenges with AI Hallucinations

Overcoming Challenges with AI Hallucinations

Conversing with your digital assistant on your smartphone, using facial recognition for security, traveling in autonomous vehicles, or browsing recommended products based on your search history - there is no denying AI is embedded in many aspects of our lives. AI has...

Overcoming Challenges with AI Hallucinations

Overcoming Challenges with AI Hallucinations

Conversing with your digital assistant on your smartphone, using facial recognition for security, traveling in autonomous vehicles, or browsing recommended products based on your search history - there is no denying AI is embedded in many aspects of our lives. AI has...

Navigating Data Governance with WhereScape 3D

Navigating Data Governance with WhereScape 3D

Properly managing and organizing data allows businesses to not only understand crucial patterns and trends, but also to leverage that data in strategic ways that grow revenue over time. Data drives decision-making and paves the way for innovation when used properly....