> For the complete documentation index, see [llms.txt](https://docs.weka.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.weka.io/4.0/install/aws.md).

# Weka installation on AWS

If you already have an AWS account and are familiar with AWS's basic concept and services, you can skip this section.

To install a Weka system in AWS, you need to [create an AWS account](https://aws.amazon.com/account/).

Make sure you are familiar with the following concepts and services that are used as part of the Weka system deployment:

* [IAM](https://docs.aws.amazon.com/IAM/latest/UserGuide/introduction.html) - Identity and access management
* [VPCs](https://docs.aws.amazon.com/vpc/latest/userguide/what-is-amazon-vpc.html), [subnets](https://docs.aws.amazon.com/vpc/latest/userguide/VPC_Subnets.html), and [security groups](https://docs.aws.amazon.com/vpc/latest/userguide/VPC_SecurityGroups.html)
* [EC2](https://aws.amazon.com/documentation/ec2/) instances and [ssh keys](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-key-pairs.html)
* [S3](https://docs.aws.amazon.com/AmazonS3/latest/dev/Introduction.html) - Object store (to be used for tiering data)
* [Cloud Formation](https://aws.amazon.com/documentation/cloudformation/)

During the deployment of the WEKA system, the EC2 instances require access to the internet to download the WEKA software. For this reason, you need to deploy the Weka system in one of the following deployment types in AWS:

* **Public subnet:** Use a public subnet within your VPC with an internet gateway, and allow public IP addresses for your instances.
* **Private subnet with NAT Gateway:** Create a private subnet with a route to a NAT gateway with an elastic IP in the public subnet.
* **Private subnet using Weka VPC endpoint:** Requires the creation of a [prerequisites stack](#prerequisites-cloudformation-stack) (once per VPC) that creates the necessary resources.
* **Private subnet using custom proxy:** Requires the creation of a [prerequisites stack](#prerequisites-cloudformation-stack) (once per VPC) that creates the necessary resources.

The following diagrams illustrate the components of the *public subnet* and *private subnet with NAT gateway deployment* types in AWS:

![AWS subnet options for WEKA deployment](/files/07NyhUmRYosF67FVZBM9)

## Update the number of vCPU limits in EC2

By default, AWS does not provide enough vCPUs to install a WEKA system. Use the Limits Calculator for your region from the AWS EC2 dashboard.

**Procedure**

1. On the AWS EC2 dashboard, select the **Limits** option from the left menu.

![EC2 Limits location](/files/N0zWzYqisWhC1DvHKURd)

2\. In the Limits Calculator, do the following:

* In the **Current Limit**, set the number of vCPUs you currently have for a region.
* In the **vCPUs needed**, set the required number of vCPUs for your specific deployment.

Select the **Request on-demand limit increase** link to get more vCPUs.

{% hint style="info" %}
**Note:** vCPU increase is not an instant action and can take minutes to days for AWS to evaluate and approve your request.
{% endhint %}

The following example shows the required vCPUs for a six-node cluster with two clients of type i3en.2xlarge. This example is the smallest type of instance for a WEKA system deployment.

![Limits Calculator](/files/mbo06HqA71C99jt40Xg0)

## After the installation on AWS best practices&#x20;

### Backup and recovery

#### Resiliency

The Weka system is a distributed cluster protected from 2 or 4 failure domain failures, providing fast rebuild times as described in the [Weka system overview](/4.0/overview/about.md#weka-functionality-features) section.

#### Instance failure

If an instance failure occurs, the Weka system [rebuilds](/4.0/overview/about.md#distributed-network-scheme) the data. [Add a new instance to the cluster](/4.0/usage/expanding-and-shrinking-cluster-resources/stages-in-adding-a-backend-host.md) to regain the reduced compute and storage due to the instance failure.

#### Upload snapshots to S3

It is advisable to use periodic (incremental) snapshots to back up the data and protect it from multiple EC2 instances failures.

The recovery point objective (RPO) is determined by the cadence in which the snapshots are taken and uploaded to S3. The RPO changes between the type of data, regulations, and company policies, but it is advisable to upload at least daily snapshots ([Snap-To-Object](/4.0/fs/snap-to-obj.md#about-snap-to-object)) of the critical filesystems.

If a failure occurs and it is required to recover from a backup, spin up a cluster using the [Self-Service Portal](/4.0/install/aws/self-service-portal.md) or [CloudFormation](/4.0/install/aws/cloudformation.md), and create filesystems from those snapshots. You do not need to wait for the data to reach the EC2 volumes. It is instantly accessible through S3. The recovery time objective (RTO) for this operation mainly depends on the time it takes to deploy the CloudFormation stack and is typically below 30 min.

#### Cross AZ failure

See [Protecting Data Against AWS Availability Zone Failures](/4.0/fs/snap-to-obj.md#protecting-data-against-aws-availability-zone-failures).

#### Region failure

Using Weka snapshots uploaded to S3 combined with S3 cross-region replication enables the protection from an AWS region failure.

### SSH keys rotation

For security reasons, it is advisable to rotate the SSH keys used for the EC2 instances.&#x20;

To rotate the SSH keys, follow these steps:&#x20;

* [Adding or replacing a key pair for your instance](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-key-pairs.html#replacing-key-pair), and
* [How to use AWS Secrets Manager to securely store and rotate SSH key pairs](https://aws.amazon.com/blogs/security/how-to-use-aws-secrets-manager-securely-store-rotate-ssh-key-pairs/).


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.weka.io/4.0/install/aws.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.