Acceldata for Hive Management offers advanced Exploratory features, which can enable you to look at various aspects of the Hive Query Execution.
The following execution engines are supported
- Hive on Tez
- Hive on MapReduce
- Hive on Spark (CDH only)
- Hive LLAP
Hive Query Dashboard
Track and Debug Hive Queries
Queries can be identified with the Yarn Application Id. In addition to that there is a comprehensive search available which allows search by other parameters such as User, Application Type, Time of Execution
Resource Utilization by hive
Resources Utilized by Hive during the selected Calendar interval allows one to look at the VCore and Memory Usage and visually identify debugging ideas. Resource usage can be filtered by Queue.
Each of these UI elements are clickable and will take you to the filtered list of queries.
Top K Users & Tables
The following graphs show
- who the Top K users aggregated by the number of queries fired.
- which are the Top K tables in use, aggregated by the number of queries on that Table object
Queries are available for the selected time interval, with each of the header fields sortable.
SQLs are identified through a complex algorithm separating the predicates and a historical performance of the query is represented.
Query, Table and Filters
SQL, along with the tables where the joins are applied in addition to the filter expressions and their effect on the Result set is represented underneath.
Map Reduce Stats
Mappers and reducers can be sorted by duration or by the start time of each of the containers. This makes it visually very easy to understand the longest/stalling jobs.
Query Execution Stats
Please note to get data for the above to work seamlessly, Hive Hooks need to be deployed on all the HiveServers which are deployed on the cluster.