Skip to content

Eliminate startup-script hasn't started message#2001

Merged
tpdownes merged 1 commit into
GoogleCloudPlatform:release-candidatefrom
tpdownes:rc_eliminate_not_yet_started_message
Nov 28, 2023
Merged

Eliminate startup-script hasn't started message#2001
tpdownes merged 1 commit into
GoogleCloudPlatform:release-candidatefrom
tpdownes:rc_eliminate_not_yet_started_message

Conversation

@tpdownes

Copy link
Copy Markdown
Contributor

The /etc/profile.d login prompt informational message makes the assumption that the VM is running a startup-script that uses our startup-script module. This assumption is broken when an image is built using our startup-script module and then a VM is booted with that image that does not execute our startup-script module. This assumption is also broken upon reboots of Slurm VMs because our script is wrapped inside a startup script solution developed by SchedMD that exits early when Slurm has previously started successfully. We can reconsider enabling this message more robustly as part of future work.

Submission Checklist

Please take the following actions before submitting this pull request.

  • Fork your PR branch from the Toolkit "develop" branch (not main)
  • Test all changes with pre-commit in a local branch #
  • Confirm that "make tests" passes all tests
  • Add or modify unit tests to cover code changes
  • Ensure that unit test coverage remains above 80%
  • Update all applicable documentation
  • Follow Cloud HPC Toolkit Contribution guidelines #

The /etc/profile.d login prompt informational message makes the
assumption that the VM is running a startup-script that uses our
startup-script module. This assumption is broken when an image is built
using our startup-script module and then a VM is booted with that image
that does not execute our startup-script module. This assumption is also
broken upon reboots of Slurm VMs because our script is wrapped inside
a startup script solution developed by SchedMD that exits early when
Slurm has previously started successfully. We can reconsider enabling
this message more robustly as part of future work.
@tpdownes

Copy link
Copy Markdown
Contributor Author

Can reproduce problem for Slurm using examples/image-builder.yaml. Can test behavior on vm-instance using:

# Copyright 2022 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#      http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

---
blueprint_name: startup-vm-instance

vars:
  project_id:  ## Set project id here
  deployment_name: testfix
  region: us-east4
  zone: us-east4-c

deployment_groups:
- group: first
  modules:
  - id: network1
    source: modules/network/vpc
  - id: script
    source: modules/scripts/startup-script
    settings:
      runners:
      - type: shell
        destination: tmp.sh
        content: |
          #!/bin/bash
          sleep 180
  - id: vm0
    source: modules/compute/vm-instance
    use:
    - network1
    - script
    settings:
      name_prefix: vm0
      machine_type: n1-standard-2
  - id: vm1
    source: modules/compute/vm-instance
    use:
    - network1
    settings:
      name_prefix: vm1
      machine_type: n1-standard-2

@tpdownes tpdownes enabled auto-merge November 28, 2023 19:51
@tpdownes tpdownes merged commit dc7deb9 into GoogleCloudPlatform:release-candidate Nov 28, 2023
@rohitramu rohitramu assigned tpdownes and unassigned rohitramu Nov 28, 2023
@tpdownes tpdownes deleted the rc_eliminate_not_yet_started_message branch November 28, 2023 23:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants