Impala Dashboard

Click Impala --> Dashboard in the left pane to access the Impala dashboard. The dashboard consists of summary panels, a Sankey Diagram with various metrics, and charts that display information about queries based on other criteria such as execution time.

The default time range is Last 24 hrs. To change the time range, click the down arrow in the time selection menu.

The summary tiles display several aggregated values. You can click the number on each field to view detailed information about that metric.

Metric NameDescription
UsersThe total number of users.
# of QueriesThe number of queries being run during the selected timeframe.
Avg CPU TimeThe average of CPU time across all queries.
Per Host Peak Mem UsageThe peak memory usage per host.
SucceededThe number of queries executed successfully.
RunningThe number of queries that are in progress.
FailedThe number of queries that failed to execute.
KilledThe number of queries that were killed.

Context Metric Distributions

The Context Metric distributions panel displays the summary of jobs as a Sankey diagram. By default, the chart displays the distribution by Duration.

You can choose to display the distribution by the following metrics.

Metric NameDescription
DurationThe time taken by the query.
Threads Total TimeThe sum of thread CPU time, storage wait, and network wait times used by all threads of the query.
Thread CPU TimeThe sum of the CPU time used by all threads of the query.
Per Node Peak Mem UsageThe peak memory usage per node.
HDFS Aggr Read MemoryThe sum of HDFS bytes read from memory by the query.

Viewing Other Impala Charts

The following charts are also displayed on the Impala Dashboard.

Chart NameDescription
Query execution countThe number of queries executed on the overall Impala Cluster.
Average query timeDisplays average query execution time and total query execution time of queries executed on the overall Impala Cluster.
Top 20 Users (by query)The top 20 users that ran the highest number of queries within the selected timeframe. By default, you can see the top 20 users for the last 24 hours
Top 20 Tables (by query)The top 20 tables that were accessed within the selected timeframe. By default, you can see the top 20 tables for the last 24 hours