Analyze storage distribution
Analyze storage distribution with the data catalog. Explore directory hierarchies, visualize file statistics, and perform granular metadata discovery.
Explore filesystem metadata to identify usage patterns and discover specific data sets through the Filesystem Analytics dashboard. Powered by the data catalog, these tools provide macro-level insights and granular discovery to eliminate reliance on external capacity monitoring systems.
Analyze capacity usage: Explore the directory hierarchy and identify storage consumption.
Visualize file distribution: Review file statistics by extension, user, or group.
Monitor storage distribution and trends: Observe how files are distributed by size and age, and track capacity growth.
Search files with discovery queries: Build custom queries to locate files based on metadata attributes.
Use discovery templates: Apply pre-defined query patterns for common analysis tasks.
Export catalog data: Save capacity reports and query results as CSV or JSON files.
Analyze capacity usage
Explore the distribution of storage across different directory levels to identify large data sets and review high-level filesystem metrics.

Before you begin
Verify that the target filesystem is indexed by the data catalog.
Procedure
Select Investigate > Filesystem Analytics.
Select the Capacity Usage tab.
Select the target filesystem from the Filesystem dropdown menu.
Select a specific point in time from the Data Collection dropdown menu.
To display the chart from a custom file path, click the pencil icon and enter the desired path. All chart information will relate to this file path.
Review the high-level metrics:
Filesystem Capacity: Displays used and total provisioned space. Hover over the info icon to view the actual block-level occupancy.
File and Directory counts: Displays the total number of files and directories indexed in the filesystem.
Interact with the sunburst chart to navigate the directory hierarchy:
Select a sector to zoom into a specific directory.
Hover over a sector to view the directory path, total size, and percentage of the total filesystem capacity. Dark purple sectors represent directories, while light purple sectors represent individual files or groups of smaller items.
Select the center of the chart to move up one directory level.
Use the File Statistics chart to view data distribution. Select an option from the dropdown menu:
File Count by Extension
Usage Statistics by Group
Usage Statistics by User
Monitor storage distribution and trends
Observe how files are distributed by size and age, and track capacity growth over time to forecast future storage needs.


Before you begin
Access the Capacity Usage tab and scroll to the Filesystem Analytics section.
Scroll down to view additional distribution metrics.
Procedure
Review the File Size Distribution chart:
Identify the number of files within specific size ranges (for example: 1MB-10MB).
Hover over a bar to view the exact File Count for that range.
Review the Capacity by File Age chart:
Identify the volume of data based on the time elapsed since the last modification (for example: < 1 week or 5+ years).
Hover over a bar to view the Total Size of the files in that age category.
Analyze the Filesystem Capacity Over Time chart:
Observe historical trends for Total Capacity and Used Capacity.
Toggle the Forecast switch to ON to view projected storage needs. The chart displays Total Forecast and Used Forecast lines based on current data patterns. This requires at least 24 hours of historical snapshot data.
Select the Download icon in the top right corner of any chart to export the specific chart data as a CSV file.
Search files with discovery queries
Filter and locate specific files by defining complex metadata conditions such as file size, access time, or owner.

Before you begin
Access the Discovery tab within the Filesystem Analytics view.
Procedure
Select the Filesystem and Data Collection date.
In the Show section, select the columns to display in the results table, for example: File Name, Size, and Created At.
In the Conditions section, define the search criteria:
Select a metadata field (for example: File Size, Access Time, or UID).
Select an operator (for example: In, Between, >, or Regular File).
Enter or select the value for the condition.
Select the + icon to add more conditions. Use the Operator dropdown to select AND or OR logic between conditions.
In the Sort section, select a field and the sort order (ASC or DESC).
Set the number of Rows per Page to display.
Select Run Query.
Apply discovery query templates
Use pre-configured templates to quickly identify common file categories like cold data or recently modified files.

Before you begin
Access the Discovery tab within the Filesystem Analytics view.
Procedure
Select Templates in the Build a New Query section.
Select a template from the list, for example: Files Not Accessed in Last 90 Days (Cold Data).
Review the auto-populated conditions.
Modify the values if required, for example: change the date range or the file size threshold.
Select Run Query.
Query results handling
The query results table supports full filesystems exploration through pagination. Navigate across pages to review the complete result set.
The GUI exports up to 10,000 records per query. To retrieve more records, use the REST API. See Catalog.
Export catalog data
Save the results of a capacity analysis or a discovery query for external reporting or further processing.

Before you begin
Generate a visualization in the Capacity Usage tab or run a query in the Discovery tab.
To export capacity data:
Navigate to the Capacity Usage tab.
Select the Download CSV icon located above the sunburst or distribution charts.
The exported CSV reflects the current visualization scope. It includes the top-level directory statistics displayed in the chart. The “...” entry represents an aggregated summary of additional directories outside the top view.
To export discovery results:
Navigate to the Discovery tab.
Select Export above the results table.
Select the preferred format and scope:
CSV (current page results)
JSON (current page results)
CSV (all results)
JSON (all results)
Retrieve the file from the default downloads folder of the browser.
Last updated