Deployment Types

This page describes the types of cluster deployments in AWS, which depend on the instance types being used and their configuration.

Prerequisites for Deployment

  • Check that your AWS account limits allow for the deployment of your selected configuration (it is possible to check your limits under the Limits tab in the EC2 console).

  • Deploying a Weka cluster in AWS requires at least 6 EC2 instances with SSD/NVMe drives (a.k.a instance store), and potentially additional instances that may connect as clients.

  • Weka must have access to instance metadata

    • Only IMDSv1 is supported if using the Instance Metadata service.

Note: It is possible to set client hosts with IMDSv2, but, they would not benefit from seamless cloud configuration and should be manually managed similarly to Adding Clients in bare-metal installations.

  • When deploying in AWS not using the CloudFormation template, or when additional capabilities are added after deployment (e.g., tiering), it is required to provide permissions to several AWS APIs, as described in IAM Role Created in Template.

  • Ensure you have enough available IP addresses in the selected subnet, as each core allocated to Weka required an IP address.

Depending on the instance types being used and how they’re configured, there are two deployment types:

Client Backend Deployment

In a client backend deployment, two different types of instances are launched:

  • Backend Instances: Instances that contribute their drives and all possible CPU and network resources.

  • Client Instances: Instances that connect to the cluster created by the backend instances and run an application using one or more shared filesystems.

In client backend deployments, it is possible to add or remove clients according to the resources required by the application at any given moment.

Backend instances can be added to increase the cluster capacity or performance. They can also be removed, provided that they are deactivated to safely allow for data migration.

Note: Stopping or terminating backend instances causes a loss of all data of the instance store. Refer to Amazon EC2 Instance Store for more information.

Converged Deployment

Converged deployments are more generic deployments in which every instance is configured to contribute resources of some kind — drives, CPUs, and/or network interfaces - to the cluster.

The deployment of a converged cluster is typically selected in the following cases:

  • When using very small applications that require a high-performance filesystem but do not require many resources themselves, in which case they can use resources in the same instances storing the data.

  • When cloud-bursting an application to AWS, in which case you seek to utilize as many resources as possible for the application but also seek to provide as many resources as possible to the Weka system cluster, in order to achieve maximum performance.

Last updated