嗨,我们在opsgenie错误中得到了下面的错误:对于GET节点,API服务器有一个5.21555555555553秒的99%延迟。
因此,请帮助解决此问题
描述: 1.INT_Prometheus:触发:3 KubeAPILatencyHigh (https apiserver默认监控/k8s 0.99 kubernetes 3)
Source /#/alerts?receiver=opsgenie Integration INT_PROMETHEUS ( prometheus )响应器资源所有者Team FCA_EMEA - FCA_EMEA别名- KubeAPILatencyHigh = https - job = apiserver - namespace = default -prometheus= monitoring/k8s - quantile = 0.99 - resource = nodes - scope = cluster - service = kubernetes - severity =3- verb = GET Last Updated At Apr 28,2020年7时59分描述警报触发:标签:- alertname = https endpoint =https- job = apiserver - namespace = default - prometheus = monitoring/k8s - quantile = 0.99 - resource = nodes - scope = cluster - service = kubernetes - severity =3- verb = GET Annotations:- message = KubeAPILatencyHigh服务器对于GET节点有5.21555555555553秒的99%延迟。S标签:- alertname = apiserver - KubeAPILatencyHigh = https - job =apiserver- namespace = default - prometheus =监控/k8s-default= 0.99 - resource = pods - scope = namespace - service = kubernetes - severity =3- verb = GET注解:- message = API服务器对于GET pods有8秒的99%延迟。
标签:- alertname = apiserver - KubeAPILatencyHigh = https - job =apiserver- namespace = default - prometheus = monitoring/k8s - quantile = 0.99 - resource = pods - scope = namespace - service = kubernetes - severity =3- subresource = status - server = PUT注解:- message =对于PUT pods,消息服务器具有8秒的99%延迟。
发布于 2020-04-27 12:38:12
Kubernetes API服务器使用ETCD作为所有kubernetes对象的后备存储。我会从查看ETCD服务器的日志开始。还可以在EtcdHighCommitDurations、EtcdHighFsyncDurations、EtcdHighNumberOfFailedGRPCRequests上设置警报,以了解ETCD是否有任何问题。
https://stackoverflow.com/questions/61451346
复制相似问题