利用普罗米修斯联盟进行kubernetes监控。
正在尝试更改多个群集的提示查询:
计数
by(node) (sum by(node, cpu) (node_cpu_seconds_total{job="node-exporter"}
* on(namespace, pod) group_left(node) node_namespace_pod:kube_pod_info:))
For multiple clusters, the query is giving:
Error executing query: found duplicate series for the match group {namespace="monitoring", pod="prometheus-k8s-1"} on the right hand-side of the operation: [{__name__="node_namespace_pod:kube_pod_info:", clustername="xyz", environment="dev", job="prometheus", location="haha", namespace="monitoring", node="228d5f45-27cc-4a59-b99d-3bab9ebe3b52", pod="prometheus-k8s-1", prometheus="monitoring/k8s", prometheus_replica="prometheus-k8s-0"}, {__name__="node_namespace_pod:kube_pod_info:", clustername="abc", environment="dev", job="prometheus", location="haha", namespace="monitoring", node="3faf3dfa-f8ab-4b3f-bda7-6662c1aa2a34", pod="prometheus-k8s-1", prometheus="monitoring/k8s", prometheus_replica="prometheus-k8s-1"}];many-to-many matching not allowed: matching labels must be unique on one side已将群集名称作为外部标签添加到prometheus服务器。
请给我引路好吗?
发布于 2020-03-11 10:49:58
由于prometheus pod作为statefulset运行,所以会出现重复错误。Pod名称为prometheus-k8s-1,且不会更改。
您可能需要在on(namespace, pod)中使用一些其他参数,如uid或在任何情况下都是唯一的其他参数。
https://stackoverflow.com/questions/58062303
复制相似问题