命名空间
Namespace = QCE/DLC
监控指标
指标英文名 | 指标中文名 | 说明 | 单位 | 维度 | 统计规则 [period, statType] |
2xxResponse | 2xx状态码 | 2xx状态码 | Count | bucket | [ 60s, max ] [ 300s, sum ] |
2xxResponseRate | 2xx状态码占比 | 2xx状态码占比 | % | bucket | [ 60s, max ] [ 300s, avg ] |
4xxResponse | 4xx状态码 | 4xx状态码 | Count | bucket | [ 60s, max ] [ 300s, sum ] |
503ResponseRate | 503状态码占比 | 503状态码占比 | % | bucket | [ 60s, max ] [ 300s, avg ] |
5xxResponse | 5xx状态码 | 5xx状态码 | Count | bucket | [ 60s, max ] [ 300s, sum ] |
5xxResponseRate | 5xx状态码占比 | 5xx状态码占比 | % | bucket | [ 60s, max ] [ 300s, avg ] |
ClusterCpuUsageAverage | 引擎所有集群的 cpu 平均使用率 | 引擎所有集群的 cpu 平均使用率 | % | dataengineid | [ 60s, avg ] [ 300s, avg ] |
ClusterCpuUsageMax | 引擎所有集群的 cpu 最大使用率 | 引擎所有集群的 cpu 最大使用率 | % | dataengineid | [ 60s, max ] [ 300s, max ] |
ClusterMemUsageAverage | 引擎所有集群的内存平均使用率 | 引擎所有集群的内存平均使用率 | % | dataengineid | [ 60s, max ] [ 300s, max ] |
ClusterMemUsageMax | 引擎所有集群的内存最大使用率 | 引擎所有集群的内存最大使用率 | % | dataengineid | [ 60s, avg ] [ 300s, avg ] |
CoordinatorCpuUsageAverage | 引擎所有 coordinator 节点 cpu 的平均使用率 | 引擎所有 coordinator 节点 cpu 的平均使用率 | % | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
CoordinatorCpuUsageMax | 引擎所有 coordinator 节点 cpu 的最大使用率 | 引擎所有 coordinator 节点 cpu 的最大使用率 | % | prestodataengineid | [ 60s, max ] [ 300s, max ] |
CoordinatorMemUsageAverage | 引擎所有 coordinator 节点内存的平均使用率 | 引擎所有 coordinator 节点内存的平均使用率 | % | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
CoordinatorMemUsageMax | 引擎所有 coordinator 节点内存的最大使用率 | 引擎所有 coordinator 节点内存的最大使用率 | % | prestodataengineid | [ 60s, max ] [ 300s, max ] |
CoordinatorNetworkReceiveBytesRateAverage | 引擎所有 coordinator 节点网络平均入带宽 | 引擎所有 coordinator 节点网络平均入带宽 | MBytes/s | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
CoordinatorNetworkReceiveBytesRateMax | 引擎所有 coordinator 节点网络最大入带宽 | 引擎所有 coordinator 节点网络最大入带宽 | MBytes/s | prestodataengineid | [ 60s, max ] [ 300s, max ] |
CoordinatorNetworkTransmitBytesRateAverage | 引擎所有 coordinator 节点网络平均出带宽 | 引擎所有 coordinator 节点网络平均出带宽 | MBytes/s | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
CoordinatorNetworkTransmitBytesRateMax | 引擎所有 coordinator 节点网络最大出带宽 | 引擎所有 coordinator 节点网络最大出带宽 | MBytes/s | prestodataengineid | [ 60s, max ] [ 300s, max ] |
CoordinatorPvcDiskUsageAverage | 引擎所有 coordinator 节点云盘平均使用率 | 引擎所有 coordinator 节点云盘平均使用率 | % | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
CoordinatorPvcDiskUsageMax | 引擎所有 coordinator 节点云盘最大使用率 | 引擎所有 coordinator 节点云盘最大使用率 | % | prestodataengineid | [ 60s, max ] [ 300s, max ] |
CpuLoadAverage | cpu10秒负载 | cpu10秒负载 | % | gatewayid | [ 60s, avg ] [ 300s, avg ] |
CpuLoadAverageInstance | cpu10秒负载 | cpu10秒负载 | % | instanceid gatewayid | [ 60s, avg ] [ 300s, avg ] |
CpuUsageCore | cpu 使用核心数 | cpu 使用核心数 | Count | gatewayid | [ 60s, avg ] [ 300s, avg ] |
CpuUsageCoreInstance | cpu 使用核心数 | cpu 使用核心数 | Count | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
CpuUsageRate | cpu 使用率 | cpu 使用率 | % | gatewayid | [ 60s, avg ] [ 300s, avg ] |
CpuUsageRateInstance | cpu 使用率 | cpu 使用率 | % | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
CpuUsageSeconds | cpu 占用时间 | cpu 占用时间 | ms | gatewayid | [ 60s, avg ] [ 300s, avg ] |
CpuUsageSecondsInstance | cpu 占用时间 | cpu 占用时间 | ms | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
CuUsage | 引擎 CU 占用量 | 引擎 CU 占用量 | Count | dataengineid | [ 60s, max ] [ 300s, max ] |
CuUsageRate | 引擎 CU 使用率 | 引擎 CU 使用率 | % | dataengineid | [ 60s, max ] [ 300s, max ] |
DataTaskFailed | 调度任务失败 | 调度任务失败 | None | datataskid | [ 60s, max ] [ 300s, max ] |
DataTaskTimeout | 调度任务超时 | 调度任务超时 | None | datataskid | [ 60s, max ] [ 300s, max ] |
DataTaskWaittingTimeout | 调度任务等待调度超时 | 调度任务等待调度超时 | None | datataskid | [ 60s, max ] [ 300s, max ] |
DriverCpuUsageAverage | 引擎所有 driver 节点 cpu 的平均使用率 | 引擎所有 driver 节点 cpu 的平均使用率 | % | dataengineid | [ 60s, avg ] [ 300s, avg ] |
DriverCpuUsageMax | 引擎所有 driver 节点 cpu 的最大使用率 | 引擎所有 driver 节点 cpu 的最大使用率 | % | dataengineid | [ 60s, max ] [ 300s, max ] |
DriverMemUsageAverage | 引擎所有 driver 节点内存的平均使用率 | 引擎所有 driver 节点内存的平均使用率 | % | dataengineid | [ 60s, avg ] [ 300s, avg ] |
DriverMemUsageMax | 引擎所有 driver 节点内存的最大使用率 | 引擎所有 driver 节点内存的最大使用率 | % | dataengineid | [ 60s, max ] [ 300s, max ] |
DriverNetworkReceiveBytesRateAverage | 引擎所有 driver 节点网络平均入带宽 | 引擎所有 driver 节点网络平均入带宽 | MBytes/s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
DriverNetworkReceiveBytesRateMax | 引擎所有 driver 节点网络最大入带宽 | 引擎所有 driver 节点网络最大入带宽 | MBytes/s | dataengineid | [ 60s, max ] [ 300s, max ] |
DriverNetworkTransmitBytesRateAverage | 引擎所有 driver 节点网络平均出带宽 | 引擎所有 driver 节点网络平均出带宽 | MBytes/s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
DriverNetworkTransmitBytesRateMax | 引擎所有 driver 节点网络最大出带宽 | 引擎所有 driver 节点网络最大出带宽 | MBytes/s | dataengineid | [ 60s, max ] [ 300s, max ] |
DriverPvcDiskUsageAverage | 引擎所有 driver 节点云盘平均使用率 | 引擎所有 driver 节点云盘平均使用率 | % | dataengineid | [ 60s, avg ] [ 300s, avg ] |
DriverPvcDiskUsageMax | 引擎所有 driver 节点云盘最大使用率 | 引擎所有 driver 节点云盘最大使用率 | % | dataengineid | [ 60s, max ] [ 300s, max ] |
EngineProcessThreadNum | engine 启动的线程数 | engine 启动的线程数 | Count | gatewayid | [ 60s, sum ] [ 300s, sum ] |
EngineProcessThreadNumMd | engine 启动的线程数 | engine 启动的线程数 | Count | engineid gatewayid processid | [ 60s, max ] [ 300s, max ] |
ExecuteStatementNum | 执行 statement 数量 | 执行 statement 数量 | Count | gatewayid | [ 60s, sum ] [ 300s, sum ] |
ExecutorCpuUsageAverage | 引擎所有 executor 节点 cpu 的平均使用率 | 引擎所有 executor 节点 cpu 的平均使用率 | % | dataengineid | [ 60s, avg ] [ 300s, avg ] |
ExecutorCpuUsageMax | 引擎所有 executor 节点 cpu 的最大使用率 | 引擎所有 executor 节点 cpu 的最大使用率 | % | dataengineid | [ 60s, max ] [ 300s, max ] |
ExecutorMemUsageAverage | 引擎所有 executor 节点内存的平均使用率 | 引擎所有 executor 节点内存的平均使用率 | % | dataengineid | [ 60s, avg ] [ 300s, avg ] |
ExecutorMemUsageMax | 引擎所有 executor 节点内存的最大使用率 | 引擎所有 executor 节点内存的最大使用率 | % | dataengineid | [ 60s, max ] [ 300s, max ] |
ExecutorNetworkReceiveBytesRateAverage | 引擎所有 executor 节点网络平均入带宽 | 引擎所有 executor 节点网络平均入带宽 | MBytes/s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
ExecutorNetworkReceiveBytesRateMax | 引擎所有 executor 节点网络最大入带宽 | 引擎所有 executor 节点网络最大入带宽 | MBytes/s | dataengineid | [ 60s, max ] [ 300s, max ] |
ExecutorNetworkTransmitBytesRateAverage | 引擎所有 executor 节点网络平均出带宽 | 引擎所有 executor 节点网络平均出带宽 | MBytes/s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
ExecutorNetworkTransmitBytesRateMax | 引擎所有 executor 节点网络最大出带宽 | 引擎所有 executor 节点网络最大出带宽 | MBytes/s | dataengineid | [ 60s, max ] [ 300s, max ] |
ExecutorPvcDiskUsageAverage | 引擎所有 executor 节点云盘平均使用率 | 引擎所有 executor 节点云盘平均使用率 | % | dataengineid | [ 60s, avg ] [ 300s, avg ] |
ExecutorPvcDiskUsageMax | 引擎所有 executor 节点云盘最大使用率 | 引擎所有 executor 节点云盘最大使用率 | % | dataengineid | [ 60s, max ] [ 300s, max ] |
FsLimitBytes | 磁盘容量 | 磁盘容量 | Bytes | gatewayid | [ 60s, avg ] [ 300s, avg ] |
FsLimitBytesInstance | 磁盘容量 | 磁盘容量 | Bytes | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
FsReadsBytes | 磁盘读字节数 | 磁盘读字节数 | Bytes | gatewayid | [ 60s, avg ] [ 300s, avg ] |
FsReadsBytesInstance | 磁盘读字节数 | 磁盘读字节数 | Bytes | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
FsReadsCounts | 磁盘读次数 | 磁盘读次数 | Count/s | gatewayid | [ 60s, avg ] [ 300s, avg ] |
FsReadsCountsInstance | 磁盘读次数 | 磁盘读次数 | Count/s | instanceid gatewayid | [ 60s, avg ] [ 300s, avg ] |
FsUsageBytes | 磁盘使用量 | 磁盘使用量 | Bytes | gatewayid | [ 60s, avg ] [ 300s, avg ] |
FsUsageBytesInstance | 磁盘使用量 | 磁盘使用量 | Bytes | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
FsWritesBytes | 磁盘写字节数 | 磁盘写字节数 | Bytes | gatewayid | [ 60s, avg ] [ 300s, avg ] |
FsWritesBytesInstance | 磁盘写字节数 | 磁盘写字节数 | Bytes | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
FsWritesCounts | 磁盘写次数 | 磁盘写次数 | Count/s | gatewayid | [ 60s, avg ] [ 300s, avg ] |
FsWritesCountsInstance | 磁盘写次数 | 磁盘写次数 | Count/s | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
GetRequests | Get 类总请求数 | Get 类总请求数 | Count | bucket | [ 60s, max ] [ 300s, sum ] |
GetRequestsPs | Get 类请求 QPS | Get 类请求 QPS | Count/s | bucket | [ 60s, max ] [ 300s, avg ] |
GovernTableTaskCancelNum | 数据优化任务取消任务个数 | 数据优化任务取消任务个数 | Count | dlccatalog dlcdatabase dlctable dlctasktype | [ 60s, max ] [ 300s, max ] |
GovernTableTaskFailedNum | 数据优化任务失败任务个数 | 数据优化任务失败任务个数 | Count | dlccatalog dlcdatabase dlctable dlctasktype | [ 60s, max ] [ 300s, max ] |
GovernTableTaskInitNum | 数据优化任务初始化任务个数 | 数据优化任务初始化任务个数 | Count | dlcdatabase dlctable dlctasktype dlccatalog | [ 60s, max ] [ 300s, max ] |
GovernTableTaskInitTimeAverage | 数据优化任务平均初始化时长 | 数据优化任务平均初始化时长 | s | dlccatalog dlcdatabase dlctable dlctasktype | [ 60s, avg ] [ 300s, avg ] |
GovernTableTaskInitTimeMax | 数据优化任务最大初始化时长 | 数据优化任务最大初始化时长 | s | dlccatalog dlcdatabase dlctable dlctasktype | [ 60s, max ] [ 300s, max ] |
GovernTableTaskQueueNum | 数据优化排队任务个数 | 数据优化排队任务个数 | Count | dlctasktype dlccatalog dlcdatabase dlctable | [ 60s, max ] [ 300s, max ] |
GovernTableTaskQueueTimeAverage | 数据优化任务平均排队时长 | 数据优化任务平均排队时长 | s | dlcdatabase dlctable dlctasktype dlccatalog | [ 60s, avg ] [ 300s, avg ] |
GovernTableTaskQueueTimeMax | 数据优化任务最大排队时长 | 数据优化任务最大排队时长 | s | dlccatalog dlcdatabase dlctable dlctasktype | [ 60s, max ] [ 300s, max ] |
GovernTableTaskRunningNum | 数据优化运行中任务个数 | 数据优化运行中任务个数 | Count | dlcdatabase dlctable dlctasktype dlccatalog | [ 60s, max ] [ 300s, max ] |
GovernTableTaskSuccNum | 数据优化成功任务个数 | 数据优化成功任务个数 | Count | dlccatalog dlcdatabase dlctable dlctasktype | [ 60s, max ] [ 300s, max ] |
GovernTaskCancelNum | 数据优化取消任务个数 | 数据优化取消任务个数 | Count | dataengineid | [ 60s, max ] [ 300s, sum ] |
GovernTaskFailedNum | 数据优化失败任务个数 | 数据优化失败任务个数 | Count | dataengineid | [ 60s, max ] [ 300s, sum ] |
GovernTaskInitNum | 数据优化初始化任务个数 | 数据优化初始化任务个数 | Count | dataengineid | [ 60s, max ] [ 300s, sum ] |
GovernTaskInitTimeAverage | 数据优化任务平均初始化时长 | 数据优化任务平均初始化时长 | s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
GovernTaskInitTimeMax | 数据优化任务最大初始化时长 | 数据优化任务最大初始化时长 | s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
GovernTaskQueueNum | 数据优化排队任务个数 | 数据优化排队任务个数 | Count | dataengineid | [ 60s, max ] [ 300s, sum ] |
GovernTaskQueueTimeAverage | 数据优化任务平均排队时长 | 数据优化任务平均排队时长 | s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
GovernTaskQueueTimeMax | 数据优化任务最大排队时长 | 数据优化任务最大排队时长 | s | dataengineid | [ 60s, max ] [ 300s, max ] |
GovernTaskRunningNum | 数据优化运行中任务个数 | 数据优化运行中任务个数 | Count | dataengineid | [ 60s, avg ] [ 300s, sum ] |
GovernTaskSuccNum | 数据优化成功任务个数 | 数据优化成功任务个数 | Count | dataengineid | [ 60s, avg ] [ 300s, sum ] |
InternalTraffic | 内网下行流量 | 内网下行流量 | Bytes | bucket | [ 60s, max ] [ 300s, sum ] |
InternalTrafficDownBandwidth | 内网下行带宽 | 内网下行带宽 | Mbps | bucket | [ 60s, max ] [ 300s, sum ] |
InternalTrafficUp | 内网上行流量 | 内网上行流量 | Bytes | bucket | [ 60s, max ] [ 300s, sum ] |
InternalTrafficUpBandwidth | 内网上行带宽 | 内网上行带宽 | Mbps | bucket | [ 60s, max ] [ 300s, sum ] |
InternetTraffic | 外网下行流量 | 外网下行流量 | Bytes | bucket | [ 60s, max ] [ 300s, sum ] |
InternetTrafficDownBandwidth | 外网下行带宽 | 外网下行带宽 | Mbps | bucket | [ 60s, max ] [ 300s, sum ] |
InternetTrafficUp | 外网上行流量 | 外网上行流量 | Bytes | bucket | [ 60s, max ] [ 300s, sum ] |
InternetTrafficUpBandwidth | 外网上行带宽 | 外网上行带宽 | Mbps | bucket | [ 60s, max ] [ 300s, sum ] |
JobLogErrorNum | spark 作业 ERROR 日志数 | spark 作业 ERROR 日志数 | Count | sparkappid | [ 60s, avg ] [ 300s, sum ] |
JobLogWarnNum | spark 作业 WARN 日志数 | spark 作业 WARN 日志数 | Count | sparkappid | [ 60s, avg ] [ 300s, sum ] |
LaunchEngineNum | 启动的引擎数量 | 启动的引擎数量 | Count | gatewayid | [ 60s, sum ] [ 300s, sum ] |
MemoryUsage | 内存使用量 | 内存使用量 | Bytes | gatewayid | [ 60s, avg ] [ 300s, avg ] |
MemoryUsageInstance | 内存使用量 | 内存使用量 | Bytes | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
MemoryUsageRate | 内存使用率 | 内存使用率 | % | gatewayid | [ 60s, avg ] [ 300s, avg ] |
MemoryUsageRateInstance | 内存使用率 | 内存使用率 | % | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
NetworkReceiveRate | 网络入流量 | 网络入流量 | KBytes/s | gatewayid | [ 60s, avg ] [ 300s, avg ] |
NetworkReceiveRateInstance | 网络入流量 | 网络入流量 | KBytes/s | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
NetworkTransmitRate | 网络出流量 | 网络出流量 | KBytes/s | gatewayid | [ 60s, avg ] [ 300s, avg ] |
NetworkTransmitRateInstance | 网络出流量 | 网络出流量 | KBytes/s | gatewayid instanceid | [ 60s, avg ] [ 300s, avg ] |
OpenedOperationNum | 打开的 operation 数量 | 打开的 operation 数量 | Count | gatewayid | [ 60s, sum ] [ 300s, sum ] |
PrestoClusterCpuUsageAverage | presto 引擎所有集群的 cpu平均使用率 | presto 引擎所有集群的 cpu 平均使用率 | % | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
PrestoClusterCpuUsageMax | presto 引擎所有集群的 cpu 最大使用率 | presto 引擎所有集群的 cpu 最大使用率 | % | prestodataengineid | [ 60s, max ] [ 300s, max ] |
PrestoClusterMemUsageAverage | 引擎所有集群的内存平均使用率 | 引擎所有集群的内存平均使用率 | % | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
PrestoClusterMemUsageMax | presto 引擎所有集群的内存最大使用率 | presto 引擎所有集群的内存最大使用率 | % | prestodataengineid | [ 60s, max ] [ 300s, max ] |
PrestoTaskCancelNum | 取消任务个数 | 取消任务个数 | Count | prestodataengineid | [ 60s, avg ] [ 300s, sum ] |
PrestoTaskFailedNum | 失败任务个数 | 失败任务个数 | Count | prestodataengineid | [ 60s, avg ] [ 300s, sum ] |
PrestoTaskInitNum | 初始化任务个数 | 初始化任务个数 | Count | prestodataengineid | [ 60s, avg ] [ 300s, sum ] |
PrestoTaskInitTimeAverage | 任务平均初始化时长 | 任务平均初始化时长 | s | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
PrestoTaskInitTimeMax | 任务最大初始化时长 | 任务最大初始化时长 | s | prestodataengineid | [ 60s, max ] [ 300s, max ] |
PrestoTaskQueueNum | 排队任务个数 | 排队任务个数 | Count | prestodataengineid | [ 60s, avg ] [ 300s, sum ] |
PrestoTaskQueueTimeAverage | 任务平均排队时长 | 任务平均排队时长 | s | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
PrestoTaskQueueTimeMax | 任务最大排队时长 | 任务最大排队时长 | s | prestodataengineid | [ 60s, max ] [ 300s, max ] |
PrestoTaskRunningNum | 运行中任务个数 | 运行中任务个数 | Count | prestodataengineid | [ 60s, avg ] [ 300s, sum ] |
PrestoTaskSuccNum | 成功任务个数 | 成功任务个数 | Count | prestodataengineid | [ 60s, avg ] [ 300s, sum ] |
PutRequests | Put 类总请求数 | Put 类总请求数 | Count | bucket | [ 60s, max ] [ 300s, sum ] |
PutRequestsPs | Put 类请求 QPS | Put 类请求 QPS | Count/s | bucket | [ 60s, max ] [ 300s, avg ] |
RequestLatency | 请求平均时延 | 请求平均时延 | ms | bucket | [ 60s, expr ] [ 300s, expr ] |
RewriteTaskCancelNum | 小文件合并取消任务个数 | 小文件合并取消任务个数 | Count | dataengineid | [ 60s, avg ] [ 300s, sum ] |
RewriteTaskFailedNum | 小文件合并失败任务个数 | 小文件合并失败任务个数 | Count | dataengineid | [ 60s, max ] [ 300s, sum ] |
RewriteTaskInitNum | 小文件合并初始化任务个数 | 小文件合并初始化任务个数 | Count | dataengineid | [ 60s, avg ] [ 300s, sum ] |
RewriteTaskInitTimeAverage | 小文件合并任务平均初始化时长 | 小文件合并任务平均初始化时长 | s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
RewriteTaskInitTimeMax | 小文件合并任务最大初始化时长 | 小文件合并任务最大初始化时长 | s | dataengineid | [ 60s, max ] [ 300s, max ] |
RewriteTaskQueueNum | 小文件合并排队任务个数 | 小文件合并排队任务个数 | Count | dataengineid | [ 60s, avg ] [ 300s, sum ] |
RewriteTaskQueueTimeAverage | 小文件合并任务平均排队时长 | 小文件合并任务平均排队时长 | s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
RewriteTaskQueueTimeMax | 小文件合并任务最大排队时长 | 小文件合并任务最大排队时长 | s | dataengineid | [ 60s, max ] [ 300s, max ] |
RewriteTaskRunningNum | 小文件合并运行中任务个数 | 小文件合并运行中任务个数 | Count | dataengineid | [ 60s, avg ] [ 300s, sum ] |
RewriteTaskSuccNum | 小文件合并成功任务个数 | 小文件合并成功任务个数 | Count | dataengineid | [ 60s, avg ] [ 300s, sum ] |
Slowqueries | 慢查询 | 执行 SQL 超过定义正常阈值 | Count | appId | [ 60s, last ] [ 300s, last ] [ 3600s, last ] [ 86400s, last ] |
SqlTaskFailed | sql 任务失败 | sql 任务失败 | None | sparkappid taskid creator dataengineid | [ 60s, max ] [ 300s, max ] |
SqlTaskInitTime | sql 任务初始化时间 | sql 任务初始化时间 | s | creator dataengineid sparkappid taskid | [ 60s, max ] [ 300s, max ] |
SqlTaskQueueTime | sql 任务排队时间 | sql 任务排队时间 | s | sparkappid taskid creator dataengineid | [ 60s, max ] [ 300s, max ] |
SqlTaskRunningTime | sql 任务运行时间 | sql 任务运行时间 | s | taskid creator dataengineid sparkappid | [ 60s, max ] [ 300s, max ] |
SqlTaskTotalExeTime | sql 任务总执行时间 | sql 任务总执行时间 | s | sparkappid taskid creator dataengineid | [ 60s, max ] [ 300s, max ] |
TaskCancelNum | 取消任务个数 | 取消任务个数 | Count | dataengineid | [ 60s, max ] [ 300s, sum ] |
TaskFailedNum | 失败任务个数 | 失败任务个数 | Count | dataengineid | [ 60s, max ] [ 300s, sum ] |
TaskInitNum | 初始化任务个数 | 初始化任务个数 | Count | dataengineid | [ 60s, avg ] [ 300s, sum ] |
TaskInitTimeAverage | 任务平均初始化时长 | 任务平均初始化时长 | s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
TaskInitTimeMax | 任务最大初始化时长 | 任务最大初始化时长 | s | dataengineid | [ 60s, max ] [ 300s, max ] |
TaskQueueNum | 排队任务个数 | 排队任务个数 | Count | dataengineid | [ 60s, max ] [ 300s, sum ] |
TaskQueueTimeAverage | 任务平均排队时长 | 任务平均排队时长 | s | dataengineid | [ 60s, avg ] [ 300s, avg ] |
TaskQueueTimeMax | 任务最大排队时长 | 任务最大排队时长 | s | dataengineid | [ 60s, max ] [ 300s, max ] |
TaskRunningNum | 运行中任务个数 | 运行中任务个数 | Count | dataengineid | [ 60s, avg ] [ 300s, sum ] |
TaskSuccNum | 成功任务个数 | 成功任务个数 | Count | dataengineid | [ 60s, avg ] [ 300s, sum ] |
TotalRequestLatency | 总请求平均延时 | 总请求平均延时 | ms | bucket | [ 60s, max ] [ 300s, avg ] |
TotalRequests | 总请求数 | 总请求数 | Count | bucket | [ 60s, max ] [ 300s, sum ] |
TotalRequestsPs | 总请求 QPS | 总请求 QPS | Count/s | bucket | [ 60s, max ] [ 300s, sum ] |
WorkerCpuUsageAverage | 引擎所有 worker 节点 cpu 的平均使用率 | 引擎所有 worker 节点 cpu 的平均使用率 | % | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
WorkerCpuUsageMax | 引擎所有 worker 节点 cpu 的最大使用率 | 引擎所有 worker 节点 cpu 的最大使用率 | % | prestodataengineid | [ 60s, max ] [ 300s, max ] |
WorkerMemUsageAverage | 引擎所有 worker 节点内存的平均使用率 | 引擎所有 worker 节点内存的平均使用率 | % | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
WorkerMemUsageMax | 引擎所有 worker 节点内存的最大使用率 | 引擎所有 worker 节点内存的最大使用率 | % | prestodataengineid | [ 60s, max ] [ 300s, max ] |
WorkerNetworkReceiveBytesRateAverage | 引擎所有 worker 节点网络平均入带宽 | 引擎所有 worker 节点网络平均入带宽 | MBytes/s | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
WorkerNetworkReceiveBytesRateMax | 引擎所有 worker 节点网络最大入带宽 | 引擎所有 worker 节点网络最大入带宽 | MBytes/s | prestodataengineid | [ 60s, max ] [ 300s, max ] |
WorkerNetworkTransmitBytesRateAverage | 引擎所有 worker 节点网络平均出带宽 | 引擎所有 worker 节点网络平均出带宽 | MBytes/s | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
WorkerNetworkTransmitBytesRateMax | 引擎所有 worker 节点网络最大出带宽 | 引擎所有 worker 节点网络最大出带宽 | MBytes/s | prestodataengineid | [ 60s, max ] [ 300s, max ] |
WorkerPvcDiskUsageAverage | 引擎所有 worker 节点云盘平均使用率 | 引擎所有 worker 节点云盘平均使用率 | % | prestodataengineid | [ 60s, avg ] [ 300s, avg ] |
WorkerPvcDiskUsageMax | 引擎所有 worker节点云盘最大使用率 | 引擎所有 worker 节点云盘最大使用率 | % | prestodataengineid | [ 60s, max ] [ 300s, max ] |
各维度对应参数总览
参数名称 | 维度名称 | 维度解释 | 格式 |
Instances.N.Dimensions.0.Name | bucket | DLC 内部存储桶的维度名称 | 输入 String 类型维度名称:bucket |
Instances.N.Dimensions.0.Value | bucket | DLC 内部存储桶名称 | 输入 DLC 内部存储桶名称,例如:dlc2v87-100318379117-1658717958-100017507912-1304028854 |
Instances.N.Dimensions.1.Name | dataengineid | Spark 引擎 ID 的维度名称 | 输入 String 类型维度名称:dataengineid |
Instances.N.Dimensions.1.Value | dataengineid | Spark 引擎 ID | 输入 Spark 引擎 ID,例如:DataEngine-btd67pb5 |
Instances.N.Dimensions.2.Name | prestodataengineid | Presto 引擎 ID 的维度名称 | 输入 String 类型维度名称:prestodataengineid |
Instances.N.Dimensions.2.Value | prestodataengineid | Presto 引擎 ID | 输入 Presto 引擎 ID,例如:DataEngine-btd67pb5 |
Instances.N.Dimensions.3.Name | gatewayid | 网关 ID 的维度名称 | 输入 String 类型维度名称:gatewayid |
Instances.N.Dimensions.3.Value | gatewayid | 网关 ID | 输入网关 ID,例如:DataEngine-btd67pb5 |
Instances.N.Dimensions.4.Name | instanceid | 网关实例 ID 的维度名称 | 输入 String 类型维度名称:instanceid |
Instances.N.Dimensions.4.Value | instanceid | 网关实例 ID | 输入网关实例 ID,例如:emr-k5rvr4g6-kyuubi-kyuubiserver-0 |
Instances.N.Dimensions.5.Name | dlccatalog | DLC 数据目录名称的维度名称 | 输入 String 类型维度名称:dlccatalog |
Instances.N.Dimensions.5.Value | dlccatalog | DLC 数据目录名称 | 输入 DLC 数据目录名称,例如:datalakecatalog |
Instances.N.Dimensions.6.Name | dlcdatabase | DLC 数据库名称的维度名称 | 输入 String 类型维度名称:dlcdatabase |
Instances.N.Dimensions.6.Value | dlcdatabase | DLC 数据库名称 | 输入 DLC 数据库名称,例如:test_db |
Instances.N.Dimensions.7.Name | dlctable | DLC 数据库名称的维度名称 | 输入 String 类型维度名称:dlctable |
Instances.N.Dimensions.7.Value | dlctable | DLC 数据表名称 | 输入 DLC 数据表名称,例如:test_table |
Instances.N.Dimensions.8.Name | dlctasktype | DLC 任务类型的维度名称 | 输入 String 类型维度名称:dlctasktype |
Instances.N.Dimensions.8.Value | dlctasktype | DLC 任务类型名称 | 输入 DLC 任务类型名称,例如:SparkSQL |
Instances.N.Dimensions.9.Name | sparkappid | Spark 作业 ID 的维度名称 | 输入 String 类型维度名称:sparkappid |
Instances.N.Dimensions.9.Value | sparkappid | Spark 作业 ID | 输入 Spark 作业 ID,例如:batch_18f0df97-a15a-45d5-802b-c16111d84551 |
Instances.N.Dimensions.10.Name | taskid | 任务 ID 的维度名称 | 输入String类型维度名称:taskid |
Instances.N.Dimensions.10.Value | taskid | 任务 ID | 输入任务 ID,例如:aa241405f36d11ef9941525400333554 |
Instances.N.Dimensions.11.Name | creator | 创建人名称的维度名称 | 输入String类型维度名称:creator |
Instances.N.Dimensions.11.Value | creator | 创建人名称 | 输入创建人名称,例如:Bob |
Instances.N.Dimensions.12.Name | appId | 主账号 appid 的维度名称 | 输入 String 类型维度名称:appId |
Instances.N.Dimensions.12.Value | appId | 主账号 appid | 输入具体 appid,例如:10001234567 |
入参说明
查询数据湖计算 DLC监控数据,入参取值如下:
&Namespace=QCE/DLC
&Instances.N.Dimensions.0.Name=bucket
&Instances.N.Dimensions.0.Value=DLC 内部存储桶名称
&Instances.N.Dimensions.1.Name=dataengineid
&Instances.N.Dimensions.1.Value=Spark 引擎 ID
&Instances.N.Dimensions.2.Name=prestodataengineid
&Instances.N.Dimensions.2.Value=Presto 引擎 ID
&Instances.N.Dimensions.3.Name=gatewayid
&Instances.N.Dimensions.3.Value=网关 ID
&Instances.N.Dimensions.4.Name=instanceid
&Instances.N.Dimensions.4.Value=网关实例 ID
&Instances.N.Dimensions.5.Name=dlccatalog
&Instances.N.Dimensions.5.Value=DLC 数据目录名称
&Instances.N.Dimensions.6.Name=dlcdatabase
&Instances.N.Dimensions.6.Value=DLC 数据库名称
&Instances.N.Dimensions.7.Name=dlctable
&Instances.N.Dimensions.7.Value=DLC 数据表名称
&Instances.N.Dimensions.8.Name=dlctasktype
&Instances.N.Dimensions.8.Value=DLC 任务类型名称
&Instances.N.Dimensions.9.Name=sparkappid
&Instances.N.Dimensions.9.Value=Spark 作业 ID
&Instances.N.Dimensions.10.Name=taskid
&Instances.N.Dimensions.10.Value=任务 ID
&Instances.N.Dimensions.11.Name=creator
&Instances.N.Dimensions.11.Value=创建人名称
&Instances.N.Dimensions.12.Name=appId
&Instances.N.Dimensions.12.Value=主账号 appid