Select Page

Introducing the Data Vault Alliance

| May 20, 2019
Introducing the Data Vault Alliance

Today, at the WorldWide Data Vault Consortium (WWDVC) event in Stowe, Vermont, I launched the Data Vault Alliance, a new global community which seeks to unite Data Vault experts, vendors and practitioners and share best practices for Data Vault 2.0 with organizations worldwide. One of the primary reasons that I founded the Alliance was to provide IT practitioners with the right education, tools and resources to be successful with Data Vault 2.0 and reap its benefits for the organizations they serve as quickly as possible.

But what is a Data Vault, and why should organizations consider it? My colleague Michael Olschimke, CEO at Scalefree, discussed this in a recent webinar with WhereScape and here’s a quick explanation he shared:

A Quick Introduction

At a basic level, a Data Vault consists of three key categories of information:

  • Hubs – unique lists of business keys
  • Links – unique lists of relationships
  • Satellites – descriptive data over time

The hub sits at the heart of the methodology and is then connected via links to other hubs or satellite information. This satellite information is where all the “color” of useful data is held – including historical tracking of values over time. Examples of satellite information could include customer data, location data or individual information streams from different business units.

 
Together, combinations of these categories form the “network” of a Data Vault, a way of connecting together bits of information in a flexible, repeatable way that can enable a consistent development stream. At its core, using Data Vault 2.0 methodology helps businesses fuse together many different data streams from varying sources, in such a way as can deliver actionable, useable information for end users.

How Does a Data Vault Process Information?

The usual workflow for a Data Vault environment follows four stages:

  • Data is loaded from the source into either a relational data system or a data lake
  • Data is then broken down into individual “lego brick” components, and then built in a more targeted manner using simple ETL driven by metadata
  • One information is regrouped, enterprise business rules can be applied to turn these individual data fragments into useful information
  • Lastly, an overarching schema is applied – whether that is a star schema or a snowflake schema or something else entirely, this create the basis to overlay a dashboard tool ready to present back insights

At the end of this process, a fully formed Data Vault provides a solution that is almost “self-service business intelligence”, as well as a raw data stream where power users can create and write back solutions to their own user area, without affecting the core IT data warehouse. The question is, how do we get there?

Data vault automation can play a critical role here. As this workflow remains a constant repeatable process in the Data Vault, it is perfect for applying automaton to help organizations realize the benefits, faster. WhereScape® Data Vault Express™ offers exactly this capability – allowing businesses to achieve scalability and data consistency, as well as reaping the benefits of Data Vault 2.0 sooner.

For those wishing to learn more about Data Vault 2.0, and deepen their expertise in Data Vault 2.0 modeling, methodology and architecture, the Data Vault Alliance can provide you with access to world-class training, professional development certifications, organizational assessment tools, directories of authorized Data Vault 2.0 vendors and mentoring opportunities. You can view this video to learn more about the Data Vault Alliance. I encourage you to take a look at this new online community today.

And for those of you attending the WWDVC, I hope the knowledge and experience our presenters share at this week’s event provide you with many practical ideas to take back and implement within your organizations.  It’s an exciting time for the Data Vault community, and if you aren’t yet applying Data Vault 2.0, now is the perfect time for you to learn more and evaluate if it is right for your organization.

Shaping the Future of Higher Ed Data: WhereScape at EDUCAUSE 2025

October 27–30, 2025 | Nashville, TN | Booth #116 The EDUCAUSE Annual Conference is where higher education’s brightest minds come together to explore how technology can transform learning, streamline operations, and drive student success. This year, WhereScape is proud...

Data Foundation Guide: What It Is, Key Components and Benefits

A data foundation is a roadmap for how data from a variety of sources will be compiled, cleaned, governed, stored, and used. A strong data foundation ensures organizations get high-quality, consistent, usable, and accessible data to inform operational improvements and...

Data Automation: What It Is, Benefits, and Tools

What Is Data Automation? How It Works, Benefits, and How to Choose the Best Platform Data automation has quickly become one of the most important strategies for organizations that rely on data-driven decision-making.  By reducing the amount of manual work...

New in 3D 9.0.6: The ‘Repo Workflow’ Release

For modern data teams, the bottleneck isn’t just modeling - it comes down to how fast you can collaborate, standardize and move changes across environments. In developing WhereScape 3D 9.0.6, we focused on turning the repository itself into a first-class workflow...

Automating Data Vault 2.0 on Microsoft Fabric with WhereScape

Enterprises choosing Microsoft Fabric want scale, governance, and agility. Data Vault 2.0 (DV2) delivers those outcomes at the modeling level: Agility: add sources fast, without refactoring the core model. Auditability: every change is tracked; nothing is thrown away....

Unlocking ROI in Microsoft Fabric with WhereScape Automation

When organizations first evaluate Microsoft Fabric, the promise is clear: unified data, simplified architecture, and faster insights. But the real questions come down to ROI: How quickly can your team deliver governed analytics on Fabric? How much manual effort is...

The Fabric Complexity Challenge: Why Automation is Key

Microsoft Fabric is an undeniably powerful platform. By bringing together OneLake, Fabric Data Warehouse, Data Factory, Power BI and Purview, it creates a unified analytics ecosystem for modern enterprises. But as many teams quickly discover, power often comes with...

Related Content

Data Foundation Guide: What It Is, Key Components and Benefits

Data Foundation Guide: What It Is, Key Components and Benefits

A data foundation is a roadmap for how data from a variety of sources will be compiled, cleaned, governed, stored, and used. A strong data foundation ensures organizations get high-quality, consistent, usable, and accessible data to inform operational improvements and...