消息队列 MQTT 版提供了全面、多维度的监控指标体系,覆盖公网、连接、生产消费、认证授权等各个维度,帮助您实时了解资源运行状态。本文为您介绍 MQTT 的各项监控指标的含义,您可以通过这些指标快速掌握资源的健康状态,并决定是否需要升降配等运维操作,保障其稳定运行。
监控指标
以下是 MQTT 提供的所有监控指标说明,请您根据实际业务需求合理配置监控和告警策略:
分类 | 指标中文名 | 指标英文名 | 单位 | 指标含义 | 统计规则 |
公网相关指标 | 公网流出流量 | MqttPublicNetworkOutboundTrafficInstance | MB | MQTT 集群流出的公网流量变化,公网费用按照该指标计算 | [ 60s, sum ] [ 300s, last ] |
| 公网流入流量 | MqttPublicNetworkInboundTrafficInstance | MB | 客户端请求 MQTT 集群的公网流量变化 | [ 60s, sum ] [ 300s, last ] |
| 公网流出带宽 | MqttPublicNetworkOutboundBandwidthInstance | Mbps | MQTT 集群流出的公网带宽变化 | [ 60s, sum ] [ 300s, last ] |
| 公网流入带宽 | MqttPublicNetworkInboundBandwidthInstance | Mbps | 客户端请求 MQTT 集群的公网带宽变化 | [ 60s, sum ] [ 300s, last ] |
| 公网丢弃出带宽 | MqttPublicNetworkDroppedOutboundBandwidthInstance | Bit/s | 公网带宽被限流时产生值,大于0时请调大集群的公网带宽限制 | [ 60s, sum ] [ 300s, last ] |
| 公网丢弃入带宽 | MqttPublicNetworkDroppedInboundBandwidthInstance | Bit/s | 公网带宽被限流时产生值,大于0时请调大集群的公网带宽限制 | [ 60s, sum ] [ 300s, last ] |
| 公网出带宽利用率 | MqttPublicNetworkOutboundBandwidthUtilizationInstance | % | MQTT 集群通过公网连接客户端时的带宽利用率,等于当前出带宽 / 集群设定的公网带宽规格 * 100% | [ 60s, sum ] [ 300s, last ] |
| 公网入带宽利用率 | MqttPublicNetworkInboundBandwidthUtilizationInstance | % | 客户端通过公网连接 MQTT 集群时的带宽利用率,等于当前入带宽 / 集群设定的公网带宽规格 * 100% | [ 60s, sum ] [ 300s, last ] |
| 公网新建连接数 | MqttPublicNetworkNewConnectionsRateInstance | Count/s | 客户端通过公网连接到 MQTT 集群的每分钟新建连接数 | [ 60s, sum ] [ 300s, last ] |
| 公网活跃连接数 | MqttPublicNetworkActiveConnectionsInstance | Count | 客户端通过公网连接到 MQTT 集群的活跃连接数 | [ 60s, sum ] [ 300s, last ] |
| 公网非活跃连接数 | MqttPublicNetworkInactiveConnectionsInstance | Count | 客户端通过公网连接到 MQTT 集群的非活跃连接数 | [ 60s, sum ] [ 300s, last ] |
| 公网并发连接数 | MqttPublicNetworkConcurrentConnectionsInstance | Count | 客户端通过公网连接到 MQTT 集群的并发连接数 | [ 60s, sum ] [ 300s, last ] |
| 公网出包量 | MqttPublicNetworkOutboundPacketsRateInstance | Count/s | 客户端到 MQTT 集群的公网出包量变化 | [ 60s, sum ] [ 300s, last ] |
| 公网入包量 | MqttPublicNetworkInboundPacketsRateInstance | Count/s | 客户端到 MQTT 集群的公网入包量变化 | [ 60s, sum ] [ 300s, last ] |
连接相关指标 | 在线客户端数量 | MqttClientCount | Count | 当前集群建立的客户端连接数量。 | [ 60s, sum ] [ 300s, last ] |
| CONNECT 总数量 | MqttConnectPacketCount | Count | 服务端接收到的 CONNECT Packet 的总数变化。 | [ 60s, sum ] [ 300s, last ] |
| CONNECT 被限流次数 | MqttConnectLimitedCount | Count | [ 60s, sum ] [ 300s, last ] | |
| 连接成功次数 | MqttConnectSuccessCount | Count | 新建连接成功的 CONNECT Packet 的数量变化。 | [ 60s, sum ] [ 300s, last ] |
| 连接失败次数 | MqttConnectFailCount | Count | 连接失败的 CONNECT Packet 的数量变化。 | [ 60s, sum ] [ 300s, last ] |
| DISCONNECT 数量 | MqttDisconnectPacketCount | Count | 服务端收到的 DISCONNECT Packet 的总数变化。 | [ 60s, sum ] [ 300s, last ] |
| PINGREQ 数量 | MqttPingreqPacketCount | Count | 服务端收到的 PINGREQ Packet 数量。 | [ 60s, sum ] [ 300s, last ] |
生产相关指标 | 生产消息数量 | MqttInstanceSendMessageCount | Count | 当前主题在一个统计周期内的发送消息数量。 | [ 60s, sum ] [ 300s, last ] |
| 生产消息平均大小 | MqttPublishAverageBytesInstance | Bytes | 当前主题在一个统计周期内的发送消息平均大小。 | [ 60s, sum ] [ 300s, last ] |
| 生产流量大小 | MqttPublishTrafficBytesInstance | Bytes | 当前主题在一个统计周期内的发送消息流量大小。 | [ 60s, sum ] [ 300s, last ] |
| 客户端生产消息错误次数 | MqttPublishClientErrorCountInstance | Count | 客户端生产指定消息时产生错误码的总次数。 | [ 60s, sum ] [ 300s, last ] |
| PUBLISH 数量 | MqttPublishPacketCount | Count | 服务端收到的 PUBLISH Packet 数量。 | [ 60s, sum ] [ 300s, last ] |
| PUBLISH 被限流次数 | MqttPublishLimitedCount | Count | [ 60s, sum ] [ 300s, last ] | |
| PUBACK Packet 数量 | MqttReceivedPubackCount | Count | 服务端发出的 PUBACK Packet 数量。 | [ 60s, sum ] [ 300s, last ] |
| PUBREC Packet 数量 | MqttReceivedPubrecCount | Count | 服务端发出的 PUBREC Packet 数量。 | [ 60s, sum ] [ 300s, last ] |
| PUBREL Packet 数量 | MqttPubrelPacketCount | Count | 服务端收到的 PUBREL Packet 数量。 | [ 60s, sum ] [ 300s, last ] |
| PUBCOMP Packet 数量 | MqttReceivedPubcompCount | Count | 服务端发出的 PUBCOMP Packet 数量。 | [ 60s, sum ] [ 300s, last ] |
消费相关指标 | 消费消息数量 | MqttInstanceConsumeMessageCount | Count | 当前主题在一个统计周期内的消费消息数量。 | [ 60s, sum ] [ 300s, last ] |
| 消费消息平均大小 | MqttConsumeAverageBytesInstance | Bytes | 当前主题在一个统计周期内的消费消息平均大小。 | [ 60s, sum ] [ 300s, last ] |
| 消费流量大小 | MqttConsumeTrafficBytesInstance | Bytes | 当前主题在一个统计周期内的消费消息流量大小。 | [ 60s, sum ] [ 300s, last ] |
| 客户端消费消息错误次数 | MqttConsumeClientErrorCountInstance | Count | 客户端消费指定消息时产生错误码的总次数。 | [ 60s, sum ] [ 300s, last ] |
| 消费重试次数 | MqttConsumeRetryCountInstance | Count | 消费失败后自动重试消费的次数变化。 | [ 60s, sum ] [ 300s, last ] |
| 在线订阅数量 | MqttSubscriptionCount | Count | 当前集群建立的客户端订阅数量。 | [ 60s, sum ] [ 300s, last ] |
| SUBSCRIBE 数量 | MqttSubscribePacketCount | Count | 服务端收到的 SUBSCRIBE Packet 数量。 | [ 60s, sum ] [ 300s, last ] |
| SUBSCRIBE 被限流次数 | MqttConsumeLimitedCount | Count | [ 60s, sum ] [ 300s, last ] | |
| UNSUBSCRIBE 数量 | MqttUnsubscribePacketCount | Count | 服务端收到的 UNSUBSCRIBE Packet 数量。 | [ 60s, sum ] [ 300s, last ] |
| 共享订阅未消费消息总数 | MqttShareBacklogCountInstance | Count | [ 60s, sum ] [ 300s, last ] | |
| 共享订阅消费数量 | MqttShareConsumeCountInstance | Count | 统计周期内共享订阅消费消息数量,即 Shared Subscription 消费消息速率。 | [ 60s, sum ] [ 300s, last ] |
认证和授权相关指标 | 认证成功次数 | MqttAuthenticationSuccessCount | Count | 统计周期内客户端成功通过认证的次数。 | [ 60s, sum ] [ 300s, last ] |
| 认证失败次数 | MqttAuthenticationFailureCount | Count | 统计周期内客户端认证失败的次数。 | [ 60s, sum ] [ 300s, last ] |
| 授权成功次数 | MqttAuthorizationSuccessCount | Count | 统计周期内 ACL 授权成功次数。 | [ 60s, sum ] [ 300s, last ] |
| 授权失败次数 | MqttAuthorizationFailureCount | Count | 统计周期内 ACL 授权失败次数。 | [ 60s, sum ] [ 300s, last ] |
| CONNECT 授权成功次数 | MqttConnectAuthorizationSuccessCount | Count | 客户端发送 CONNECT packet时,授权成功的次数。 | [ 60s, sum ] [ 300s, last ] |
| CONNECT 授权失败次数 | MqttConnectAuthorizationFailureCount | Count | 客户端发送 CONNECT packet时,授权失败的次数。 | [ 60s, sum ] [ 300s, last ] |
| PUBLISH 授权成功次数 | MqttPublishAuthorizationSuccessCount | Count | 客户端发送 PUBLISH packet时,授权成功的次数。 | [ 60s, sum ] [ 300s, last ] |
| PUBLISH 授权失败次数 | MqttPublishAuthorizationFailureCount | Count | 客户端发送 PUBLISH packet时,授权失败的次数。 | [ 60s, sum ] [ 300s, last ] |
| SUBSCRIBE 授权成功次数 | MqttSubscribeAuthorizationSuccessCount | Count | 客户端发送 SUBSCRIBE packet时,授权成功的次数。 | [ 60s, sum ] [ 300s, last ] |
| SUBSCRIBE 授权失败次数 | MqttSubscribeAuthorizationFailureCount | Count | 客户端发送 SUBSCRIBE packet时,授权失败的次数。 | [ 60s, sum ] [ 300s, last ] |
分类 | 指标中文名 | 指标英文名 | 单位 | 指标含义 | 统计规则 |
连接相关指标 | 在线客户端数量 | MqttTopicClientCount | Count | 当前集群建立的客户端连接数量。 | [ 60s, sum ] [ 300s, last ] |
生产相关指标 | 生产消息数量 | MqttTopicSendMessageCount | Count | 当前主题在一个统计周期内的发送消息数量。 | [ 60s, sum ] [ 300s, last ] |
| 生产消息平均大小 | MqttPublishAverageBytesInstanceTopic | Bytes | 当前主题在一个统计周期内的发送消息平均大小。 | [ 60s, sum ] [ 300s, last ] |
| 生产流量大小 | MqttPublishTrafficBytesInstanceTopic | Bytes | 当前主题在一个统计周期内的发送消息流量大小。 | [ 60s, sum ] [ 300s, last ] |
| 客户端生产消息错误次数 | MqttPublishClientErrorCountInstanceTopic | Count | 客户端生产指定消息时产生错误码的总次数。 | [ 60s, sum ] [ 300s, last ] |
消费相关指标 | 消费消息数量 | MqttTopicConsumeMessageCount | Count | 当前主题在一个统计周期内的消费消息数量。 | [ 60s, sum ] [ 300s, last ] |
| 消费消息平均大小 | MqttConsumeAverageBytesInstanceTopic | Bytes | 当前主题在一个统计周期内的消费消息平均大小。 | [ 60s, sum ] [ 300s, last ] |
| 消费流量大小 | MqttConsumeTrafficBytesInstanceTopic | Bytes | 当前主题在一个统计周期内的消费消息流量大小。 | [ 60s, sum ] [ 300s, last ] |
| 客户端消费消息错误次数 | MqttConsumeClientErrorCountInstanceTopic | Count | 客户端消费指定消息时产生错误码的总次数。 | [ 60s, sum ] [ 300s, last ] |
| 消费重试次数 | MqttConsumeRetryCountInstanceTopic | Count | 消费失败后自动重试消费的次数变化。 | [ 60s, sum ] [ 300s, last ] |
| 在线订阅数量 | MqttTopicSubscriptionCount | Count | 当前集群建立的客户端订阅数量。 | [ 60s, sum ] [ 300s, last ] |
各维度对应参数总览
参数名称 | 维度名称 | 维度解释 | 格式 |
Instances.N.Dimensions.0.Name | instanceId | MQTT 实例 ID 的维度名称 | 输入 String 类型维度名称:instanceId |
Instances.N.Dimensions.0.Value | instanceId | MQTT 具体实例的 ID | 输入实例具体 ID,例如:mqtt-qg44xwvj |
Instances.N.Dimensions.2.Name | qos | MQTT 的 QoS 的维度名称 | 输入 String 类型维度名称:qos |
Instances.N.Dimensions.2.Value | qos | MQTT 具体的 QoS | 输入具体 qos,例如:mytopic |
Instances.N.Dimensions.3.Name | topic | MQTT 的 topic filter 的维度名称 | 输入 String 类型维度名称:topic |
Instances.N.Dimensions.3.Value | topic | MQTT 具体的 topic filter | 输入具体 topic filter,例如:mytopic |
Instances.N.Dimensions.4.Name | sharegroup | MQTT 的共享订阅组维度名称 | 输入 String 类型维度名称:sharegroup |
Instances.N.Dimensions.4.Value | sharegroup | MQTT 具体的共享订阅组名称 | 输入具体 group,例如:mygroup |
入参说明
查询 QCE/MQTT 监控数据,入参取值如下:
&Namespace=QCE/MQTT
&Instances.N.Dimensions.0.Name=instanceId
&Instances.N.Dimensions.0.Value=实例 ID
&Instances.N.Dimensions.1.Name=qos
&Instances.N.Dimensions.1.Value=具体的 QoS
&Instances.N.Dimensions.2.Name=topic
&Instances.N.Dimensions.2.Value=具体的 topic filter
&Instances.N.Dimensions.3.Name=sharegroup
&Instances.N.Dimensions.3.Value=具体的共享订阅组名称