Configure the WEKA cluster using the WEKA Configurator
Detailed workflow for WEKA cluster installation in a multi-container backend architecture using the WEKA Configurator.
The WEKA Configurator tool facilitates cluster configuration. It performs the following:
Scans your environment to detect the network, verifies various attributes such as hostnames, and discovers components such as gateway routers.
Selects the servers that can be included in the cluster and verifies that all servers run the same WEKA version.
Guides you through the configuration options.
Generates a valid configuration file that you can apply to form a WEKA cluster from a group of servers.
Before you begin
Adhere to the following concepts:
STEM mode: STEM mode is the initial state before configuration. The term STEM comes from the concept of stem cells in biology, which are undifferentiated. In WEKA clusters, STEM mode carries the same connotation of being an undifferentiated state.
Reference host: The
wekaconfig
normally runs on one of the servers designated as part of the final cluster. The server thatwekaconfig
runs on is called the reference host. Whenwekaconfig
runs, it expects to find a group of servers in STEM mode. If the reference host is not in STEM mode, an error message is issued, and the program terminates.Same networks: It is assumed that all other servers forming the cluster are connected to the same networks as the
reference host
and have the same configuration (all servers have a homogeneous hardware configuration).Homogeneous configuration: Two or more servers with the same core count, RAM size, number and size of drives, and network configurations are considered homogeneous.
It is best practice to create the WEKA cluster from a group of homogeneous servers (it is typically the case because the hardware is typically purchased all at the same time).
wekaconfig
checks if the servers are homogeneous; if they are not, it points out the discrepancies (such as varying numbers of drives, RAM, or cores).wekaconfig
allows the configuration of heterogeneous clusters. However, because most times, the servers are supposed to be homogeneous, it can be an error that they are not. For example, if one of the drives is defective (DOA) from the factory or a memory stick is defective. These hardware issues are uncommon and can be difficult to discover in large clusters.
Passwordless ssh connection: Enabling passwordless ssh between all the servers is very convenient and makes most tools work more smoothly. At a minimum, a regular user with passwordless
sudo
privileges and passwordless ssh is required for configuration. However, it is most convenient to have theroot
user has passwordless ssh, even if only temporarily during configuration. Ensure you can ssh without a password by doing an ssh to each server.Stripe width: A RAID-like concept refers to the total width of the data stripe for data protection mechanisms. Typically, the DATA and PARITY combined are the stripe width. In WEKA terms, the stripe width must be less than the total number of servers in the cluster. For example, in a 10-server cluster, the stripe width can be 9 (7 data + 2 parity) plus 1 spare.
Prerequisites
WEKA software installation on cluster servers: Verify that the WEKA software is installed on all cluster servers. If it is not installed through the WSA, follow the installation instructions provided in the Install tab of get.weka.io. Once the installation is complete, the WEKA software will be deployed on all allocated servers and running in STEM mode.
Workflow
Access the WEKA tools.
Configure a WEKA cluster with the WEKA Configurator.
Apply the configuration (
config.sh
).
1. Access the WEKA tools
If you do not use WSA or WEKApod: Download the WEKA tools repository to one of the servers by running the following command:
If you use WSA or WEKApod: The tools are pre-installed and can be found at the following location:
2. Configure a WEKA cluster with the WEKA Configurator
Run the
wekaconfig
Tool:Connect to one of the backend servers or the WMS server (if it exists) using SSH.
Navigate to the tools directory directory:
Run the
wekaconfig
tool:
The wekaconfig
tool scans the environment, detects the servers, and evaluates whether the group of servers is homogeneous. The following example demonstrates a scenario where the servers do not have a homogeneous number of CPU cores.
Review the detection results:
If the detected configuration meets your requirements, press Enter to proceed.
Navigate through the available tabs to configure the WEKA settings as needed.
The wekaconfig
displays the data plane networks (DP Networks) detected previously. The list under Select DP Networks reflects the high-speed (100Gb+) networks used for the WEKA storage traffic.
Verify that the list of networks, speed, and number of detected hosts are correct.
If the values are not as expected, such as an incorrect number of servers, incorrect or missing networks, investigate it and check the messages. Typically, network configuration issues are the source of the problem.
Select the required networks to configure WEKA POSIX protocol to run on.
Use the arrow and Tab keys to move between the fields and sections, and the space-bar to select the value.
Note: The green labels have entry fields. The yellow labels have read-only fields.
Finalize the configuration:
After completing the WEKA configuration, use the arrow keys to select Done and press Enter.
The
wekaconfig
tool generates theconfig.sh
file based on your settings.
Advanced users can customize the configuration by editing the config.sh
file using a text editor such as vim
or nano
. If modifications are required, consult the Customer Success Team for guidance.
Regarding drive selection: it is not possible to manually select the data drives (NVMe SSDs) for inclusion in the configuration. WEKA clusters are typically dedicated to running WEKA services and are designed to be homogeneous. As a result, the wekaconfig
tool automatically includes all NVMe drives that are larger than approximately 1.5 GB in size.
To modify the drives used in the cluster, manually edit the config.sh
file. Refer to the example output for config.sh
provided above for guidance.
2. Apply the configuration
From the install directory, run
./config.sh
.
The configuration takes a few minutes and possibly longer for large clusters. See some examples of the configuration process and WEKA status.
What to do next?
Last updated