W E K A
3.14
3.14
  • WEKA v3.14 Documentation
  • Weka System Overview
    • About the WEKA System
    • SSD Capacity Management
    • Filesystems, Object Stores & Filesystem Groups
    • Weka Networking
    • Data Lifecycle Management
    • Weka Client & Mount Modes
    • Glossary
  • Getting Started with Weka
    • Quick Install Guide
    • Managing the Weka System
    • CLI Overview
    • GUI Overview
    • Serving IOs with WekaFS
  • Planning & Installation
    • Prerequisites for Installation
    • Bare Metal Installation
      • Planning a Weka System Installation
      • Setting Up the Hosts
        • SR-IOV Enablement
      • Obtaining the Weka Install File
      • Weka System Installation Process Using the CLI
      • Adding Clients
    • AWS Installation
      • Self-Service Portal
      • CloudFormation Template Generator
      • Deployment Types
      • AWS Outposts Deployment
      • Supported EC2 Instance Types
      • Adding Clients
      • Auto Scaling Group
      • Troubleshooting
  • Performance
    • Testing Weka Performance
      • Test Environment Details
  • WekaFS Filesystems
    • Managing Filesystems, Object Stores & Filesystem Groups
      • Managing Object Stores
      • Managing Filesystem Groups
      • Managing Filesystems
      • Attaching/Detaching Object Stores to/from Filesystems
      • KMS Management
    • Advanced Data Lifecycle Management
      • Advanced Time-based Policies for Data Storage Location
      • Data Management in Tiered Filesystems
      • Transition Between Tiered and SSD-Only Filesystems
      • Manual fetch and release of data
    • Mounting Filesystems
    • Snapshots
    • Snap-To-Object
    • Quota Management
  • Additional Protocols
    • NFS
    • SMB
      • SMB Management Using CLIs
      • SMB Management Using the GUI
    • S3
      • S3 Cluster Management
      • S3 Buckets Management
      • S3 Users and Authentication
      • S3 Information Lifecycle Management
      • Audit S3 APIs
      • S3 Limitations
      • S3 Examples using boto3
  • Operation Guide
    • Alerts
      • List of Alerts
    • Events
      • List of Events
    • Statistics
      • List of Statistics
    • System Congestion
    • Security
      • User Management
      • Organizations
    • Expanding & Shrinking Cluster Resources
      • Expand & Shrink Overview
      • Stages in Adding a Backend Host
      • Expansion of Specific Resources
      • Shrinking a Cluster
    • Background Tasks
    • Upgrading Weka Versions
  • Billing & Licensing
    • License Overview
    • Classic License
    • Pay-As-You-Go License
  • Support
    • Prerequisites and Compatibility
    • Getting Support for Your Weka System
    • The Weka Support Cloud
    • Diagnostics CLI Command
  • Appendix
    • Weka CSI Plugin
    • External Monitoring
    • Snapshot Management
  • REST API
Powered by GitBook
On this page
  • Pre-Fetching API for Data Lifecycle Management
  • Fetching Files from an Object Store
  • Fetching a Directory Containing Many Files
  • Release API for Data Lifecycle Management
  • Releasing Files from SSD to an Object Store
  • Releasing a Directory Containing Many Files
  1. WekaFS Filesystems
  2. Advanced Data Lifecycle Management

Manual fetch and release of data

This page describes how to manually force-fetching tiered data back to SSDs, and force-releasing SSD data to object-store

Pre-Fetching API for Data Lifecycle Management

Fetching Files from an Object Store

Tiered files are always accessible and should generally be treated like regular files. Moreover, while files may be tiered, their metadata is always maintained on the SSDs. This allows traversing files and directories without worrying about how such operations may affect performance.

Sometimes, it may be necessary to access previously-tiered files quickly. In such situations, it is possible to request the Weka system to fetch the files back to the SSD without accessing them directly. This is performed using the prefetch command, which can be issued via theweka fs tier fetchcommand, as follows:

Command: weka fs tier fetch

Use the following command to release files:

weka fs tier fetch <path> [-v]

Parameters in Command Line

Name

Type

Value

Limitations

Mandatory

Default

path

A comma-separated list of string

List of file paths

Yes

​

-v, --verbose

Boolean

Showing fetch requests as they are submitted

No

Off

Fetching a Directory Containing Many Files

In order to fetch a directory that contains a large number of files, it is recommended to use the xargs command in a similar manner, as follows:

find -L <directory path> -type f | xargs -r -n512 -P64 weka fs tier fetch -v

Note: The pre-fetching of files does not guarantee that they will reside on the SSD until they are accessed.

In order to ensure that the fetch is effective, the following must be taken into account:

  • Free SSD Capacity: There has to be sufficient free SSD capacity to retain the filesystems that are to be fetched.

  • Tiering Policy: The tiering policy may release some of the files back to the object store after they have been fetched, or even during the fetch if it takes longer than expected. The policy should be long enough to allow for the fetch to complete and the data to be accessed before it is released again.

Release API for Data Lifecycle Management

Releasing Files from SSD to an Object Store

Using the manual release command, it is possible to clear SSD space in advance (e.g., for shrinking one filesystem SSD capacity for a different filesystem without releasing important data, or for a job that needs more SSDs space from different files). The metadata will still remain on SSD for fast traversal over files and directories but the data will be marked for release and will be released to the object-store as soon as possible, and before any other files are scheduled to release due to other lifecycle policies.

Command: weka fs tier release [-v]

Use the following command to release files:

weka fs tier release <path>

Parameters in Command Line

Name

Type

Value

Limitations

Mandatory

Default

path

A comma-separated list of string

List of file paths

Yes

​

-v, --verbose

Boolean

Showing release requests as they are submitted

No

Off

Releasing a Directory Containing Many Files

In order to release a directory that contains a large number of files, it is recommended to use the xargs command in a similar manner, as follows:

# directory
find -L <directory path> -type f | xargs -r -n512 -P64 weka fs tier release
 
# similarly, a file containing a list of paths can be used
cat file-list | xargs -P32 -n200 weka fs tier release
PreviousTransition Between Tiered and SSD-Only FilesystemsNextMounting Filesystems

Last updated 3 years ago