Skip to main content

Tiered storage in Aiven for ClickHouse® Limited availabilty

Discover the tiered storage capability in Aiven for ClickHouse®. Learn how it works and explore its use cases. Check out why you might need it and what benefits you get using it.

Overview

The tiered storage feature introduces a method of organizing and storing data in two tiers for improved efficiency and cost optimization. The data is automatically moved to an appropriate tier based on your database's local disk usage. On top of this default data allocation mechanism, you can control the tier your data is stored in using custom data retention periods.

The tiered storage in Aiven for ClickHouse consists of the following two layers:

  • SSD - the first tier: Fast storage device with limited capacity, better suited for fresh and frequently queried data, relatively costly to use
  • Object storage - the second tier: Affordable storage device with unlimited capability, better suited for historical and more rarely queried data, relatively slower

Why use it

By enabling and properly configuring the tiered storage feature in Aiven for ClickHouse, you can use storage resources efficiently and, therefore, significantly reduce storage costs of your Aiven for ClickHouse instance.

How it works

After you enable the tiered storage feature, Aiven for ClickHouse by default stores data on SSD until it reaches 80% of its capacity. After exceeding this size-based threshold, data is stored in object storage.

Optionally, you can configure the time-based threshold for your storage. Based on the time-based threshold, the data is moved from your SSD to object storage after a specified time period.

note

Backups are taken for data that resides both on SSD and in object storage.

Typical use case

In your Aiven for ClickHouse service, there is a significant amount of data that is there for a while and is hardly ever accessed. It's stored on SSD and, thus, high-priced. You decide to enable tiered storage for your service to make your data storage more efficient and reduce the costs. For that purpose, you contact the sales team at sales@aiven.io to have it enabled on your project, and you enable the feature on tables you want to optimize. You configure the time-based threshold to control how your data is stored between the two layers.

Limitations

  • When enabled, the tiered storage feature cannot be deactivated.

    tip

    As a workaround, you can create a new table (without enabling tiered storage on it) and copy the data from the original table (with the tiered storage feature enabled) to the new table. As soon as the data is copied to the new table, you can remove the original table.

  • With the tiered storage feature enabled, it's not possible to connect to an external existing object storage or cloud storage bucket.

What's next