简单地说,我咀嚼了我的kubeadm
集群,并试图从头开始(我无法对集群进行适当的拆卸,这就是为什么我被迫走kubeadm reset
和sudo yum remove kubeadm kubectl kubelet kubernetes-cni kube*
的这条路径,减去将节点从集群中移除的原因)。
我删除了所有软件包(运行在RHEL 7上),现在正在尝试重新安装,但在启动时获得了错误:
-- Unit kubelet.service has begun starting up.
Nov 13 12:23:50 thalia2.ahc.umn.edu kubelet[31361]: F1113 12:23:50.681719 31361 server.go:190] failed to load Kubelet config file /var/lib/kubelet/config.yaml, error failed to
read kubelet config file "/var/lib/kubelet/config.yaml", error: open /var/lib/kubelet/config.yaml: no such file or directory
Nov 13 12:23:50 thalia2.ahc.umn.edu systemd[1]: kubelet.service: main process exited, code=exited, status=255/n/a
Nov 13 12:23:50 thalia2.ahc.umn.edu systemd[1]: Unit kubelet.service entered failed state.
Nov 13 12:23:50 thalia2.ahc.umn.edu systemd[1]: kubelet.service failed.
Nov 13 12:24:00 thalia2.ahc.umn.edu systemd[1]: kubelet.service holdoff time over, scheduling restart.
Nov 13 12:24:00 thalia2.ahc.umn.edu systemd[1]: Started kubelet: The Kubernetes Node Agent.
-- Subject: Unit kubelet.service has finished start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
--
-- Unit kubelet.service has finished starting up.
--
我看到的关于这个错误的每一篇文章都指出,由于我没有运行/var/lib/kubelet/config.yaml
,所以丢失了这个文件( kubeadm init
),但是我没有创建一个主节点--我只想像运行安装-kubeadm-kubelet-和-kubectl一样运行这个新的安装。
我漏掉了什么明显的东西吗?我不记得在几个月前的初始安装中启动kubelet service
之前在每个节点上运行过D8
,另外,在安装软件包的说明中,没有哪个地方说要执行Kubeadm init.
。
还要注意的是:在我以前的主节点上,在尝试重新安装和启动kubectl
之后,我得到了错误:
Get https://134.84.53.162:6443/api/v1/pods?fieldSelector=spec.nodeName%3Dthalia0.ahc.umn.edu&limit=500&resourceVersion=0: dial tcp 134.84.53.162:6443: connect: connection refused
Nov 13 13:40:40 thalia0.ahc.umn.edu kubelet[17689]: W1113 13:40:40.487124 17689 cni.go:188] Unable to update cni config: No networks found in /etc/cni/net.d
Nov 13 13:40:40 thalia0.ahc.umn.edu kubelet[17689]: E1113 13:40:40.487404 17689 kubelet.go:2167] Container runtime network not ready: NetworkReady=false reason:NetworkPluginNot
Ready message:docker: network plugin is not ready: cni config uninitialized
但是,我以为我通过做kubeadm reset
删除了所有的东西?
我确实删除了前主节点( calico
)上的sudo rm -rf /var/etcd/calico-data
文件夹,然后重新启动了kubectl,现在正在获取此错误:
Nov 13 13:55:37 thalia0.ahc.umn.edu kubelet[29725]: E1113 13:55:37.447381 29725 kubelet.go:2236] node "thalia0.ahc.umn.edu" not found
Nov 13 13:55:37 thalia0.ahc.umn.edu kubelet[29725]: E1113 13:55:37.547732 29725 kubelet.go:2236] node "thalia0.ahc.umn.edu" not found
Nov 13 13:55:37 thalia0.ahc.umn.edu kubelet[29725]: E1113 13:55:37.648002 29725 kubelet.go:2236] node "thalia0.ahc.umn.edu" not found
Nov 13 13:55:37 thalia0.ahc.umn.edu kubelet[29725]: E1113 13:55:37.748219 29725 kubelet.go:2236] node "thalia0.ahc.umn.edu" not found
Nov 13 13:55:37 thalia0.ahc.umn.edu kubelet[29725]: E1113 13:55:37.849201 29725 kubelet.go:2236] node "thalia0.ahc.umn.edu" not found
Nov 13 13:55:37 thalia0.ahc.umn.edu kubelet[29725]: E1113 13:55:37.950018 29725 kubelet.go:2236] node "thalia0.ahc.umn.edu" not found
Nov 13 13:55:38 thalia0.ahc.umn.edu kubelet[29725]: E1113 13:55:38.050193 29725 kubelet.go:2236] node "thalia0.ahc.umn.edu" not found
Nov 13 13:55:38 thalia0.ahc.umn.edu kubelet[29725]: E1113 13:55:38.150479 29725 kubelet.go:2236] node "thalia0.ahc.umn.edu" not found
Nov 13 13:55:38 thalia0.ahc.umn.edu kubelet[29725]: I1113 13:55:38.173736 29725 kubelet_node_status.go:276] Setting node annotation to enable volume controller attach/detach
Nov 13 13:55:38 thalia0.ahc.umn.edu kubelet[29725]: I1113 13:55:38.180154 29725 kubelet_node_status.go:70] Attempting to register node thalia0.ahc.umn.edu
所以,我原来的集群里似乎有“残余物”。是否有一个kubeadm reset
和sudo yum remove kubeadm kubectl kubelet kubernetes-cni kube*
没有删除的隐藏包?
发布于 2018-11-14 12:21:12
问题是/var
目录已经满了。把它清理干净,现在一切都按预期开始了。
发布于 2018-11-14 08:29:36
我以为我做了
kubeadm reset
就把所有的东西都删除了?
关于转到文件,它不是删除所有内容。它只会逆转更改,所以在删除包并破坏所有路径的地方,运行kubeadm init
重新初始化集群。
https://serverfault.com/questions/939867
复制相似问题