Workload System Table
Backgroup
Doris supports analyzing the resource usage of running workloads through the Workload system table, commonly used in the following scenarios:
- View the resource usage of Workload Group in the cluster, including CPU and memory.
- View the TopN SQL with the highest resource usage in the current cluster.
- View the queue status of Workload Groups in the cluster
Users can query this information by submitting SQL, identify the Workload Group or SQL with high resource usage in the system, and perform corresponding processing.
Workload System Table Introduction
At present, the system tables are in database information_schema
.
active_queries
The active_queries
table records the execution information of the current query on FE, and the detailed information of the fields is as follows:
- query_id, query's id
- query_start_time, the start time of the query execution; If query queues, it represents the time when execution starts after the queue ends
- query_time_ms, the query time, in milliseconds
- workload_group_id, workload group's id
- database, sql's database
- frontend_instance, the FE which query is submitted
- queue_start_time, if query queues, it means the time that query begins to queue
- queue_end_time, if query queues, it means the time that query ends to queue
- query_status, query status, it has two value RUNNING and QUEUED, RUNNIG means query is running; QUEUED means query is queued.
- sql, sql content
backend_active_tasks
A query is usually divided into multiple fragments to be executed on multiple BEs, and backend_active_tasks
table represent the total amount of CPU and memory resources used by a query on a single BE. If this query has multiple concurrency and fragments on a single BE, it will also be summarized into one row of data.
The detailed information of the fields is as follows:
- be_id, BE's id
- fe_host, it represents which FE this query was submitted from
- query_id, query's id
- task_time_ms, query execution time on the current BE, in milliseconds
- task_cpu_time_ms,query cpu time on the current BE, in milliseconds
- scan_rows, the number of rows scanned on the current BE. If multiple tables are scanned, it is the cumulative value of multiple tables
- scan_bytes, the number of bytes scanned on the current BE. If multiple tables are scanned, it is the cumulative value of multiple tables
- be_peak_memory_bytes, the peak memory usage on the current BE, in bytes
- current_used_memory_bytes, the amount of memory currently in use on the BE, in bytes
- shuffle_send_bytes, the number of bytes sent as shuffle clients at the BE
- shuffle_send_rows, the number of rows sent as shuffle clients at the BE
Basic Usage
TopN resource usage sql
select
t2.query_id,
t2.workload_group_id,
t2.`database`,
t1.cpu_time,
t1.mem_used,
t2.`sql`
from
(select query_id, sum(task_cpu_time_ms) as cpu_time,sum(current_used_memory_bytes) as mem_used from backend_active_tasks group by query_id)
t1 left join active_queries t2
on t1.query_id = t2.query_id
order by cpu_time desc, mem_used desc limit 10;TopN resource usage sql on BE
select
t2.query_id,
t2.workload_group_id,
t2.`database`,
t1.cpu_time,
t1.mem_used,
t2.`sql`
from
(select query_id, sum(task_cpu_time_ms) as cpu_time,sum(current_used_memory_bytes) as mem_used
from backend_active_tasks where be_id=12345 group by query_id)
t1 left join active_queries t2
on t1.query_id = t2.query_id
order by cpu_time desc, mem_used desc limit 10;
WorkloadGroup queue details
select
workload_group_id,
sum(case when query_status='QUEUED' then 1 else 0 end) as queue_num,
sum(case when query_status='RUNNING' then 1 else 0 end) as running_query_num
from
active_queries
group by workload_group_idQuery queue time
select
workload_group_id,
query_id,
query_status,
now() - queue_start_time as queued_time
from
active_queries
where query_status='queued'
order by workload_group_id
Application scenarios
When the query latency of the cluster increases and the availability decreases, bottleneck points can be identified through the overall monitoring of the cluster:
- When the CPU resources of BE are fully utilized and the memory usage is not high, it indicates that the main bottleneck should be on the CPU.
- When the CPU and memory resources of BE are both high, it indicates that the main bottleneck is in memory.
- When the CPU and memory resource usage of BE is not high, but the IO usage is high, it indicates that the main bottleneck is in IO.
- The CPU/memory/IO is not high, but there are many queued queries, indicating that the queue parameter configuration is unreasonable. You can try increasing the queue concurrency.
After identifying the bottleneck points of the cluster, the SQL that is currently using more resources can be further analyzed through the workload system table, and then the query can be downgraded.
Find SQL with the highest CPU usage
TopN CPU usage sql
select
t2.query_id,
t2.workload_group_id,
t2.`database`,
t1.cpu_time,
t2.`sql`
from
(select query_id, sum(task_cpu_time_ms) as cpu_time from backend_active_tasks group by query_id)
t1 left join active_queries t2
on t1.query_id = t2.query_id
order by cpu_time desc limit 10;Workload group cpu time
select
t2.workload_group_id,
sum(t1.cpu_time) cpu_time
from
(select query_id, sum(task_cpu_time_ms) as cpu_time from backend_active_tasks group by query_id)
t1 left join active_queries t2
on t1.query_id = t2.query_id
group by workload_group_id order by cpu_time desc
If the CPU usage of a single SQL is too high, it can be alleviated by canceling the query.
If SQL with longer CPU time comes from the same workload group, CPU usage can be reduced by lowering the CPU priority of this workload group or lowering the number of scan threads.
Find SQL with the highest memory usage
TopN memory usage sql
select
t2.query_id,
t2.workload_group_id,
t1.mem_used
from
(select query_id, sum(current_used_memory_bytes) as mem_used from backend_active_tasks group by query_id)
t1 left join active_queries t2
on t1.query_id = t2.query_id
order by mem_used desc limit 10;Workload group memory usage
select
t2.workload_group_id,
sum(t1.mem_used) wg_mem_used
from
(select query_id, sum(current_used_memory_bytes) as mem_used from backend_active_tasks group by query_id)
t1 left join active_queries t2
on t1.query_id = t2.query_id
group by t2.workload_group_id order by wg_mem_used desc
If a single query occupies most of the memory, then killing this query can be used to quickly free up memory.
If a lower priority workload group is using more memory, memory can be released by downgrading this workload group:
- If the memory configuration is soft limit, it can be modified to hard limit and the memory limit of the workload group can be reduced
- Reduce the query concurrency of the workload group through its queuing function
Find queries which scan too much data
At present, Doris does not directly collect indicators of disk IO for queries, but it can indirectly find SQL with heavier scans through the number of scan rows and scan bytes.
TopN scan data query
select
t2.query_id,
t2.workload_group_id,
t1.scan_rows,
t1.scan_bytes
from
(select query_id, sum(scan_rows) as scan_rows,sum(scan_bytes) as scan_bytes from backend_active_tasks group by query_id)
t1 left join active_queries t2
on t1.query_id = t2.query_id
order by scan_rows desc,scan_bytes desc limit 10;Workload group scan data
select
t2.workload_group_id,
sum(t1.scan_rows) as wg_scan_rows,
sum(t1.scan_bytes) as wg_scan_bytes
from
(select query_id, sum(scan_rows) as scan_rows,sum(scan_bytes) as scan_bytes from backend_active_tasks group by query_id)
t1 left join active_queries t2
on t1.query_id = t2.query_id
group by t2.workload_group_id
order by wg_scan_rows desc,wg_scan_bytes desc
If the scan data volume of a single SQL statement is large, then a kill query can be used to check if there will be any relief.
If the scanning data volume of a certain workload group is large, IO can be reduced by lowering the number of scanning threads of the workload group.