SSD capacity management

Understand the key terms of WEKA system capacity management and the formula for calculating the net data storage capacity.

Raw capacity

Raw capacity is the total capacity on all the SSDs assigned to a WEKA system cluster. For example, 10 SSDs of one terabyte each have a total raw capacity of 10 terabytes. This is the total capacity available for the WEKA system. This will change automatically if more servers or SSDs are added.

Net capacity

Net capacity is the space for user data on the SSDs in a configured WEKA system. It is based on the raw capacity minus the WEKA filesystem overheads for redundancy protection and other needs. This will change automatically if more servers or SSDs are added.

Stripe width

The stripe width is the number of blocks with a common protection set, ranging from 3 to 16. The WEKA system has distributed any-to-any protection. Consequently, in a system with a stripe width of 8, many groups of 8 data units spread on various servers protect each other (rather than a group of 8 servers forming a protection group). The stripe width is set during the cluster formation and cannot be changed. Stripe width choice impacts performance and space.

If not configured, the stripe width is set automatically to: #Failure Domains - Protection Level -1.

Protection level

Protection Level refers to the number of extra protection blocks added to each data stripe in your storage system. These blocks help protect your data against hardware failures. The protection levels available are:

Protection level 2: Can survive 2 concurrent disk or server failures.
Protection level 4: Can survive 4 concurrent disk failures or 2 concurrent server failures.

A higher protection level means better data durability and availability but requires more storage space and can affect performance.

Key points:

Durability:
- Higher protection levels offer better data protection.
- Level 4 is more durable than level 2.
Availability:
- Ensures system availability during hardware failures.
- Level 4 maintains availability through more extensive failures compared to level 2.
Space and performance:
- Higher protection levels use more storage space.
- They can also slow down the system due to additional processing.
Configuration:
- The protection level is set during cluster formation and cannot be changed later.
- If not configured, the system defaults to protection level 2.

Failure domains (optional)

A failure domain is a group of WEKA servers that can fail concurrently due to a single root cause, such as a power circuit or network switch failure.

A cluster can be configured with explicit or implicit failure domains:

In a cluster with explicit failure domains, each group of blocks that protect each other is spread on different failure domains.
In a cluster with implicit failure domains, the group of blocks is spread on different servers, and each server is a failure domain. Additional failure domains can be added, and new servers can be added to any existing or new failure domain.

This documentation relates to a homogeneous WEKA system deployment. That is, the same number of servers per failure domain (if any) and the same SSD capacity per server. For information about heterogeneous WEKA system configurations, contact the Customer Success Team.

Hot spare

A hot spare is the number of failure domains that the system can lose, undergo a complete rebuild of data, and still maintain the same net capacity. All failure domains are constantly participating in storing the data, and the hot spare capacity is evenly spread within all failure domains.

The higher the hot spare count, the more hardware is required to obtain the same net capacity. On the other hand, the higher the hot spare count, the more relaxed the IT maintenance schedule for replacements. The hot spare is defined during cluster formation and can be reconfigured anytime.

Note: If not configured, the hot spare is automatically set to 1.

WEKA filesystem overhead

After deducting the protection and hot spare capacity, only 90% of the remaining capacity can be used as net user capacity, with the other 10% of capacity reserved for the WEKA filesystems. This is a fixed formula that cannot be configured.

Provisioned capacity

The provisioned capacity is the total capacity assigned to filesystems. This includes both SSD and object store capacity.

Available capacity

The available capacity is the total capacity used to allocate new filesystems, net capacity minus provisioned capacity.

Deductions from raw capacity to obtain net storage capacity

The net capacity of the WEKA system is obtained after the following three deductions performed during configuration:

The level of protection required is the storage capacity dedicated to system protection.
The hot spare(s) is the storage capacity set aside for redundancy and to allow for rebuilding following a component failure.
WEKA filesystem overhead to improve overall performance.

SSD net storage capacity calculation

Examples:

Scenario 1: A homogeneous system of 10 servers, each with one terabyte of Raw SSD Capacity, one hot spare, and a protection scheme of 6+2.

SSD Net Capacity = 10 TB * (10-1) / 10 * 6/(6+2) * 0.9 = 6.075 TB

Scenario 2: A homogeneous system of 20 servers, each with one terabyte of Raw SSD Capacity, two hot spares, and a protection scheme of 16+2.

SSD Net Capacity = 20 TB * (20-2) / 20 * 16/(16+2) * 0.9 = 14.4 TB

PreviousOptimize redundancy in WEKA deployments NextFilesystems, object stores, and filesystem groups

Last updated 1 year ago