Skip to content
This repository was archived by the owner on Oct 16, 2020. It is now read-only.
This repository was archived by the owner on Oct 16, 2020. It is now read-only.

oem-gce.service crashlooping on version 2191.4.1 #2608

@george-angel

Description

@george-angel

Provider: GCE
CoreOS Container Linux version: 2191.4.1

$ rkt list
UUID            APP     IMAGE NAME                      STATE           CREATED         STARTED         NETWORKS
02c3d817        oem-gce coreos.com/oem-gce:2191.4.1     exited garbage  21 minutes ago  21 minutes ago
07b06c28        oem-gce coreos.com/oem-gce:2191.4.1     exited garbage  13 minutes ago  13 minutes ago
09314175        oem-gce coreos.com/oem-gce:2191.4.1     exited garbage  5 minutes ago   5 minutes ago
0be9b554        oem-gce coreos.com/oem-gce:2191.4.1     exited garbage  2 minutes ago   2 minutes ago
0ea0572f        oem-gce coreos.com/oem-gce:2191.4.1     exited garbage  22 minutes ago  22 minutes ago
11d1439d        oem-gce coreos.com/oem-gce:2191.4.1     exited garbage  16 minutes ago  16 minutes ago
130ecdf9        oem-gce coreos.com/oem-gce:2191.4.1     exited garbage  30 minutes ago  30 minutes ago
15fff556        oem-gce coreos.com/oem-gce:2191.4.1     exited garbage  27 minutes ago  27 minutes ago
16d68799        oem-gce coreos.com/oem-gce:2191.4.1     exited garbage  16 minutes ago  16 minutes ago
Aug 29 09:58:53 etcd-0-k8s-rlpv.c.uw-prod.internal systemd[1]: Starting GCE Linux Agent...
Aug 29 09:59:30 etcd-0-k8s-rlpv.c.uw-prod.internal rkt[11438]: + '[' -e /etc/default/instance_configs.cfg.template ']'
Aug 29 09:59:30 etcd-0-k8s-rlpv.c.uw-prod.internal rkt[11438]: + /usr/bin/google_instance_setup
Aug 29 09:59:30 etcd-0-k8s-rlpv.c.uw-prod.internal rkt[11438]: /init.sh: /usr/bin/google_instance_setup: /usr/lib/python-exec/python2.7/python: bad interpreter: No such file or directory
Aug 29 09:59:31 etcd-0-k8s-rlpv.c.uw-prod.internal systemd[1]: oem-gce.service: Main process exited, code=exited, status=126/n/a
Aug 29 09:59:31 etcd-0-k8s-rlpv.c.uw-prod.internal rkt[11486]: gc: moving pod "3de37879-8f93-4d1c-9717-997fd56715e2" to garbage
Aug 29 09:59:31 etcd-0-k8s-rlpv.c.uw-prod.internal systemd[1]: oem-gce.service: Failed with result 'exit-code'.
Aug 29 09:59:31 etcd-0-k8s-rlpv.c.uw-prod.internal systemd[1]: Failed to start GCE Linux Agent.
Aug 29 09:59:36 etcd-0-k8s-rlpv.c.uw-prod.internal systemd[1]: oem-gce.service: Service RestartSec=5s expired, scheduling restart.
Aug 29 09:59:36 etcd-0-k8s-rlpv.c.uw-prod.internal systemd[1]: oem-gce.service: Scheduled restart job, restart counter is at 285.
Aug 29 09:59:36 etcd-0-k8s-rlpv.c.uw-prod.internal systemd[1]: Stopped GCE Linux Agent.

This seems to be pushing the loadavg on the node and resulted in a high sync duration of the etcd on the nodes.

2135.6.0 - does not appear to have this problem.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions