Managing Hive

Acceldata for Hive Management offers advanced Exploratory features, which can enable you to look at various aspects of the Hive Query Execution.

Support

The following execution engines are supported

  • Hive on Tez
  • Hive on MapReduce
  • Hive on Spark (CDH only)
  • Hive LLAP

Hive Query Dashboard

Track and Debug Hive Queries

Queries can be identified with the Yarn Application Id. In addition to that there is a comprehensive search available which allows search by other parameters such as User, Application Type, Time of Execution

Resource Utilization by hive

Resources Utilized by Hive during the selected Calendar interval allows one to look at the VCore and Memory Usage and visually identify debugging ideas. Resource usage can be filtered by Queue.

alt-text

Each of these UI elements are clickable and will take you to the filtered list of queries.

Top K Users & Tables

The following graphs show

  • who the Top K users aggregated by the number of queries fired.
  • which are the Top K tables in use, aggregated by the number of queries on that Table object

alt-text

Query Details

Query Listing

Queries are available for the selected time interval, with each of the header fields sortable.

alt-text

Historical Analysis

SQLs are identified through a complex algorithm separating the predicates and a historical performance of the query is represented.

alt-text

Query, Table and Filters

SQL, along with the tables where the joins are applied in addition to the filter expressions and their effect on the Result set is represented underneath.

alt-text

Map Reduce Stats

Mappers and reducers can be sorted by duration or by the start time of each of the containers. This makes it visually very easy to understand the longest/stalling jobs. alt-text

Query Execution Stats

alt-text

Please note to get data for the above to work seamlessly, Hive Hooks need to be deployed on all the HiveServers which are deployed on the cluster.