What is a Cloud Data Warehouse?

| May 12, 2020

What is a Cloud Data Warehouse?

A cloud data warehouse is a database service hosted online by a public cloud company. It has the functionality of an on-premises database but is managed by a third party, can be accessed remotely and its memory and compute power can be shrunk or grown instantly.

Traditional Vs. Cloud Data Warehouse Differences

A traditional data warehouse is an architecture for organising, storing and accessing ordered data, hosted in a data centre on premises owned by the organisation whose data is stored within it. It is of finite size and power and is owned by that organisation.

cloud data warehouse is a flexible volume of storage and compute power, which is part of a much bigger public cloud data centre and is accessed and managed online. Storage and compute power is merely rented. Its physical location is largely irrelevant apart from for countries and/or industries whose regulations dictate their data must be stored in the same country.

Benefits of Cloud Data Warehouse

The benefits of a Cloud Data Warehouse can be summarised in five main points:

1. Access
Rather than having only physical access to databases in data centres, cloud data warehouses can be accessed remotely from anywhere. As well as being convenient for staff who live near the data centre, who can now troubleshoot from home or anywhere out of hours if needed, this access means companies can hire staff based anywhere, which opens up talent pools that were previously unavailable. Cloud data warehousing is self-service and so its provision does not depend on the availability of specialist staff.

2. Cost
Data centres are expensive to buy and maintain. Property to store them in needs to be properly cooled, insured and expertly staffed, and the databases themselves come at a huge cost. Cloud data warehousing allows the same service to be enjoyed, but you only pay for the computing and storage power you need, when you need it. Now with elastic cloud services such as Snowflake, compute and storage can be bought separately, in different amounts. So you now really only have to pay for what you are using, and you can instantly close or downsize capabilities you no not need.

3. Performance
Cloud service providers compete to offer the use of the most performant hardware for a fraction of the coast that would be incurred to reproduce such power on-premises.
Upgrades are performed automatically, so you always have the latest capabilities and do not experience downtime in upgrading to the latest ‘version’. Some on-premises databases offer faster performance, but not at the cost and availability of the ‘Infrastructure-as-a-service’ that Cloud providers offer.

4. Scalability
Opening a Cloud data warehouse is as simple as opening an account with a provider such as Microsoft Azure, AWS Redshift, Google BigQuery and Snowflake. The account can be grown and shrunk, or even closed instantly. Users are aware of the costs involved before they change the amount of compute or storage they rent. This scalability has led to the coining of the phrase ‘Elastic Cloud’.

5. Agility
Hosting data in a Cloud data warehouse means you can switch providers if and when it suits changes in business strategy. Staying database-agnostic means you have the agility to upsize, downsize or switch completely. Metadata-driven automation software like WhereScape allows you to lift and shift entire data infrastructures on and off Cloud data warehouse if desired, and allows different teams within the same company to work with the database and hybrid cloud structure that best suits their needs, as seen at Legal & General.

Choosing a Cloud Data Warehouse Solution

A cost analysis is vital in estimating how much money a Cloud Data Warehouse would save the business. Different Cloud providers have different pricing structures that need bearing in mind. More established providers such as Amazon and Microsoft rent nodes and clusters, so your company uses a defined section of the server. This makes pricing predictable and constant, but sometimes maintenance to your particular node is needed.

Snowflake and Google offer a ‘serverless’ system, which means the cluster locations and numbers are not defined and so are irrelevant. Instead, the customer is charged for exactly the amount of compute or processing power it consumes. However, in bigger companies it is often difficult to predict the number of users and size of a process before it occurs. It is possible for queries to be much bigger was assumed and so cost much more than was expected. Each cloud provider has its own suite of supporting tools for functions such as data management, visualisation and predictive analytics, so these needs should be factored when deciding on which provider to use.

Whichever Cloud provider or modelling style you choose, you can always get your data warehouse solution online and usable faster with Data Automation. For more information on how WhereScape can help, please click here.

WhereScape Announces the Release of RED 10.0.0.0

WhereScape is pleased to announce the general availability of WhereScape RED 10.0.0.0. This release is the culmination of man-years of effort. It confirms WhereScape’s commitment to continuing to develop new technologies and tools and its commitment to delivering the...

Effective AI through Data Modeling

As we journey deeper into the digital age, the importance of data modeling within the broader landscape of artificial intelligence (AI) has become more pronounced than ever. The success of AI-driven initiatives is tightly woven with the quality and structure of the...

Is Data Vault 2.0 Still Relevant?

TL;DR  Yes. Data Vault 2.0 Data Vault 2.0 is a database modeling method published in 2013. It was designed to overcome many of the shortcomings of data warehouses created using relational modeling (3NF) or star schemas (dimensional modeling). Speci fically, it...

Data Vault 2.0 Resources

Data Vault Revisited: A Six-Year Journey into the Secure Data Repository In 2017, Dr. Barry Devlin provided valuable insights about Data Vaults, a concept that sparked interest among businesses and IT professionals. Data Vaults were envisioned as secure repositories...

Understanding Data Vault 2.0

How to Avoid Pitfalls During Data Vault 2.0 Implementation Implementing a data vault as your Data Modeling approach has many advantages, such as flexibility, scalability, and efficiency. But along with that, one must be aware of the challenges that come along with...

Navigating the AI Landscape

The Pivotal Role of Data Modeling In the rapidly evolving digital age, artificial intelligence (AI) has emerged as a game-changer, deeply impacting the business landscape. Its ability to automate operations, refine decision-making processes, and significantly enhance...

Information Management Maturity

Unlocking Your Business Potential: Understanding and Enhancing Information Management Maturity In a recent report by Gartner, they emphasize the crucial role of information in the current business environment, stating, "Through 2025, organizations that are data-driven...

Data Warehousing Best Practices

In modern times, organizations are daily generating huge volumes of data. Appreciating the significance of data, companies are storing data from different departments which can be analyzed to gather insights to help the organization in better decision-making. This...

Related Content

WhereScape Announces the Release of RED 10.0.0.0

WhereScape Announces the Release of RED 10.0.0.0

WhereScape is pleased to announce the general availability of WhereScape RED 10.0.0.0. This release is the culmination of man-years of effort. It confirms WhereScape’s commitment to continuing to develop new technologies and tools and its commitment to delivering the...

WhereScape Announces the Release of RED 10.0.0.0

WhereScape Announces the Release of RED 10.0.0.0

WhereScape is pleased to announce the general availability of WhereScape RED 10.0.0.0. This release is the culmination of man-years of effort. It confirms WhereScape’s commitment to continuing to develop new technologies and tools and its commitment to delivering the...

Effective AI through Data Modeling

Effective AI through Data Modeling

As we journey deeper into the digital age, the importance of data modeling within the broader landscape of artificial intelligence (AI) has become more pronounced than ever. The success of AI-driven initiatives is tightly woven with the quality and structure of the...