NE 484: Use ingress-operator subcommand instead of hello-openshift #561
Conversation
|
/retest |
|
Operator unavailable (OAuthServiceCheckEndpointAccessibleController_EndpointUnavailable::OAuthServiceEndpointsCheckEndpointAccessibleController_EndpointUnavailable::OAuthVersionDeployment_MissingDeployment::ReadyIngressNodes_NoReadyIngressNodes::WellKnown_NotReady): OAuthServiceEndpointsCheckEndpointAccessibleControllerAvailable: Failed to get oauth-openshift enpoints /retest |
718c8f3 to
e266bc4
Compare
fae1c1a to
a4e07bc
Compare
|
/retest |
a4e07bc to
24441ba
Compare
|
/retest |
fbf5afe to
b96e462
Compare
|
fail [github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/leafnodes/runner.go:64]: kube-apiserver reports a non-graceful termination: ... Message:"Previous pod kube-apiserver-ip-10-0-236-113.ec2.internal started at 2021-03-08 23:10:05.437889569 +0000 UTC did not terminate gracefully",... Probably kubelet or CRI-O is not giving the time to cleanly shut down. This can lead to connection refused and network I/O timeout errors in other components. |
|
Observed 1.0416666666666667 leader changes in 22m55s: Leader changes are a result of stopping the etcd leader process or from latency (disk or network), review etcd performance metrics |
|
fail [github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/leafnodes/runner.go:64]: kube-apiserver reports a non-graceful termination |
b96e462 to
7edb66d
Compare
|
fail [github.com/openshift/origin/test/extended/util/disruption/controlplane/controlplane.go:118]: Mar 10 19:27:57.021: API "oauth-api-available-new-connections" was unreachable during disruption for at least 32s of 57m30s (1%): |
|
fail [github.com/openshift/origin/test/extended/util/disruption/controlplane/controlplane.go:118]: Mar 10 21:59:37.344: API "kubernetes-api-available-reused-connections" was unreachable during disruption for at least 44s of 1h0m46s (1%): |
|
@candita: The
Use
DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
Mar 15 19:52:06.573: INFO: cluster upgrade is Progressing: Unable to apply 4.8.0-0.ci.test-2021-03-15-182041-ci-op-ivhwh5xi: an unknown error has occurred: MultipleErrors
|
|
Mar 15 23:32:05.850: FAIL: API "oauth-api-available-new-connections" was unreachable during disruption for at least 3s of 1h3m21s (0%): |
|
fail [github.com/openshift/origin/test/extended/util/disruption/controlplane/controlplane.go:118]: Mar 16 15:44:32.024: API "openshift-api-available-new-connections" was unreachable during disruption for at least 13s of 1h3m26s (0%): |
Miciah
left a comment
There was a problem hiding this comment.
Looks great over all. Some very minor comments.
| func TestDesiredCanaryDaemonSet(t *testing.T) { | ||
| canaryImage := "openshift/hello-openshift:latest" | ||
| // canaryImageName is the ingress-operator image | ||
| var canaryImageName = "openshift/origin-cluster-ingress-operator:latest" |
There was a problem hiding this comment.
| var canaryImageName = "openshift/origin-cluster-ingress-operator:latest" | |
| const canaryImageName = "openshift/origin-cluster-ingress-operator:latest" |
There was a problem hiding this comment.
Moved it to local variable in the only function it is used.
|
Oh, one question. The PR's description has the following
Was that supposed to be an environment variable? I don't see that change. |
|
pods should never transition back to pending |
No, it is an artifact of a different design. I will remove that from the description. |
7edb66d to
63bbc10
Compare
|
failed to acquire lease |
| if err == nil { | ||
| fmt.Println("Servicing request.") | ||
| } else { | ||
| log.Error(err, "could not serve health check") |
There was a problem hiding this comment.
Hm, if we can use the logger for errors, can we use it also (log.Info) for success?
There was a problem hiding this comment.
Changed to use fmt.Println for both to be consistent with a web server pattern.
63bbc10 to
8332af2
Compare
|
E0318 16:32:18.173056 25 portforward.go:400] an error occurred forwarding 37587 -> 80: error forwarding port 80 to pod 2dbbd646c9283c765567808154bf8078e306d5411aa5d9f7f5e3bb7fd208b598, uid : port forward into network namespace "/var/run/netns/0025377f-b51d-4534-8602-cced2e88d979": read tcp 127.0.0.1:45086->127.0.0.1:80: read: connection reset by peer /test e2e-aws-operator |
|
fail [github.com/openshift/origin/test/extended/util/disruption/controlplane/controlplane.go:118]: Mar 18 17:18:03.551: API "kubernetes-api-available-reused-connections" was unreachable during disruption for at least 22s of 1h4m12s (1%): |
|
error: failed to push image registry.build02.ci.openshift.org/ci-op-8sfgxk35/release:latest: unable to upload the new image manifest: received unexpected HTTP status: 500 Internal Server Error |
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: candita, Miciah The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
/retest Please review the full test history for this PR and help us cut down flakes. |
3 similar comments
|
/retest Please review the full test history for this PR and help us cut down flakes. |
|
/retest Please review the full test history for this PR and help us cut down flakes. |
|
/retest Please review the full test history for this PR and help us cut down flakes. |
All consumers have been moved to other images: openshift/cluster-ingress-operator#561 openshift/console#8459
ART hasn't been building these images since 4.8's a6286e7 (rm hello-openshift, 2021-03-29), with the router/ingress folks pivoting away in openshift/cluster-ingress-operator@8332af2f56 (NE 484: Use ingress-operator subcommand instead of hello-openshift for canary server, 2021-03-18, openshift/cluster-ingress-operator#561). Samples had been installing a hello-openshift ImageStream, but they also got away from that in 2021 with openshift/cluster-samples-operator@d94ad97e49 (delete hello-openshift in payload imagestream via CVO annotation, 2021-06-24, openshift/cluster-samples-operator#380). Origin hasn't entirely gotten out of the hello-openshift business yet [1], but regardless of whether they still have an interest, neither ART nor ingress is involved anymore, and it's been a long time since 4.7 went end-of-life: $ curl -s 'https://access.redhat.com/product-life-cycles/api/v1/products?name=Openshift+Container+Platform+4' | jq -r '.data[].versions[] | select(.name == "4.7").phases[] | .date + " " + .name' 2021-02-24T00:00:00.000Z General availability 2021-10-27T00:00:00.000Z Full support 2022-08-24T00:00:00.000Z Maintenance support N/A Extended update support N/A Extended update support Term 2 N/A Extended life phase So I'm dropping those lines from the mapping. [1]: https://github.com/openshift/origin/tree/94a4d2a6f202f64f89a1b747b44484c358f8299d/examples/hello-openshift
NE 484: Use ingress-operator subcommand instead of hello-openshift for canary server
serve-healthcheck