Skip to content

Version 0.0.31 Sandbox creation failed (exit 1). #2760

@stephonye

Description

@stephonye

Description

Building image openshell/sandbox-from:1777544316 from /tmp/nemoclaw-build-eYmlS9/Dockerfile
Context: /tmp/nemoclaw-build-eYmlS9
Gateway: nemoclaw
Building image openshell/sandbox-from:1777544316 from /tmp/nemoclaw-build-eYmlS9/Dockerfile
Step 1/56 : ARG BASE_IMAGE=ghcr.io/nvidia/nemoclaw/sandbox-base@sha256:37311755be54909e10cfc64d496a5d5fcb4587fe1a7754ecfbe00e377117c93c
Step 2/56 : FROM node:22-slim@sha256:4f77a690f2f8946ab16fe1e791a3ac0667ae1c3575c3e4d0d4589e9ed5bfaf3d AS builder
---> c7d2ba7d15bc
Step 3/56 : ENV NPM_CONFIG_AUDIT=false NPM_CONFIG_FUND=false NPM_CONFIG_UPDATE_NOTIFIER=false
---> Using cache
---> 8c550f886edb
Step 4/56 : COPY nemoclaw/package.json nemoclaw/package-lock.json nemoclaw/tsconfig.json /opt/nemoclaw/
---> Using cache
---> 8a0811198a59
Step 5/56 : COPY nemoclaw/src/ /opt/nemoclaw/src/
---> Using cache
---> 9faa35b80202
Step 6/56 : WORKDIR /opt/nemoclaw
---> Using cache
---> b2736295f219
Step 7/56 : RUN npm ci && npm run build
---> Running in 728ada42b827
npm error code EUSAGE
npm error
npm error npm ci can only install packages when your package.json and package-lock.json or npm-shrinkwrap.json are in sync. Please update your lock file with npm install before continuing.
npm error
npm error Missing: @emnapi/core@1.10.0 from lock file
npm error Missing: @emnapi/runtime@1.10.0 from lock file
npm error Missing: @emnapi/core@1.9.2 from lock file
npm error Missing: @emnapi/runtime@1.9.2 from lock file

npm error
npm error Clean install a project
npm error
npm error Usage:
npm error npm ci
npm error
npm error Options:
npm error [--install-strategy <hoisted|nested|shallow|linked>] [--legacy-bundling]
npm error [--global-style] [--omit <dev|optional|peer> [--omit <dev|optional|peer> ...]]
npm error [--include <prod|dev|optional|peer> [--include <prod|dev|optional|peer> ...]]
npm error [--strict-peer-deps] [--foreground-scripts] [--ignore-scripts] [--no-audit]
npm error [--no-bin-links] [--no-fund] [--dry-run]
npm error [-w|--workspace [-w|--workspace ...]]
npm error [-ws|--workspaces] [--include-workspace-root] [--install-links]
npm error
npm error aliases: clean-install, ic, install-clean, isntall-clean
npm error
npm error Run "npm help ci" for more info

npm error A complete log of this run can be found in: /root/.npm/_logs/2026-04-30T10_18_36_626Z-debug-0.log

Error: × Docker build stream error
╰─▶ Docker stream error: The command '/bin/sh -c npm ci && npm run build'
returned a non-zero code: 1
Try: openshell sandbox list # check gateway state
Recovery: nemoclaw onboard --resume
Or: nemoclaw onboard

Reproduction Steps

1.curl -fsSL https://www.nvidia.com/nemoclaw.sh | bash
2. npm error exit

Environment

-Device: Intel NUC amd64
-OS: Ubuntu 24.04
-Architecture: amd64
-Node.js: v24.13.0
-npm: 11.6.2
-Docker: 29.4.1 (build 055a478)
-OpenShell CLI: 0.0.36
-NemoClaw: v0.0.31
-OpenClaw: 2026.4.9

Debug Output

devops01@ailab:~$ nemoclaw debug --quick
[debug] Collecting diagnostics for sandbox 'No'...
[debug] Quick mode: true


═══ System ═══

Thu Apr 30 06:30:28 PM +08 2026
Linux ailab 6.8.0-110-generic #110-Ubuntu SMP PREEMPT_DYNAMIC Thu Mar 19 15:09:20 UTC 2026 x86_64 x86_64 x86_64 GNU/Linux
 18:30:28 up 58 min,  2 users,  load average: 0.24, 0.32, 0.53
               total        used        free      shared  buff/cache   available
Mem:           23913        3294       12811          24        8233       20619
Swap:           8191           0        8191

═══ Processes ═══

    PID    PPID CMD                         %MEM %CPU
 127439    7806 node /home/devops01/.nvm/ve  0.2 64.7
  14685   14513 k3s server                   2.2  7.8
   2703    2226 java -Xmx1G -Xms1G -server   2.6  2.1
   1376       1 /usr/bin/dockerd -H fd:// -  0.5  1.6
   2727    2303 /usr/local/bin/python3 -m u  3.3  1.2
   2782    2322 redis-server *:6379          0.0  0.8
  14728   14685 containerd                   0.6  0.7
  19013   16925 /metrics-server --cert-dir=  0.2  0.5
   1103       1 /usr/bin/containerd          0.2  0.3
   2661    2118 /usr/local/bin/python3.10 /  0.3  0.2
   2540    1982 /usr/local/bin/python3.10 /  0.3  0.2
  17854   16971 /coredns -conf /etc/coredns  0.2  0.2
   1033       1 /snap/snapd/current/usr/lib  0.1  0.1
   1150       1 /usr/lib/systemd/systemd --  0.0  0.1
      1       0 /sbin/init                   0.0  0.1
   2490    1991 /portainer                   0.4  0.0
  14490       1 /usr/bin/containerd-shim-ru  0.0  0.0
   2322       1 /usr/bin/containerd-shim-ru  0.0  0.0
     17       2 [rcu_preempt]                0.0  0.0
   1083       1 /opt/Synology/ActiveBackupf  0.0  0.0
   1101       1 /snap/canonical-livepatch/3  0.0  0.0
  17400   16880 /agent-sandbox-controller    0.1  0.0
    444       1 /usr/lib/systemd/systemd-jo  0.0  0.0
  14513   14490 /bin/k3s init                0.5  0.0
  92403       2 [kworker/u24:6-iou_exit]     0.0  0.0
    377       2 [jbd2/nvme0n1p2-8]           0.0  0.0
  20517   19656 openshell-gateway --port 80  0.0  0.0
  59138       2 [kworker/u24:1-iou_exit]     0.0  0.0
   1098       1 /usr/local/bin/ollama serve  0.1  0.0

═══ GPU ═══

  (nvidia-smi not found, skipping)

═══ Docker ═══

CONTAINER ID   IMAGE                                                                       COMMAND                  CREATED          STATUS                      PORTS                                                                                                NAMES
728ada42b827   b2736295f219                                                                "/bin/sh -c 'npm ci …"   11 minutes ago   Exited (1) 11 minutes ago                                                                                                        elated_villani
caf1ab7a430b   b2736295f219                                                                "/bin/sh -c 'npm ci …"   27 minutes ago   Exited (1) 27 minutes ago                                                                                                        exciting_hellman
c85f9b48f071   b2736295f219                                                                "/bin/sh -c 'npm ci …"   37 minutes ago   Exited (1) 37 minutes ago                                                                                                        zen_keldysh
2cd1c1822798   b2736295f219                                                                "/bin/sh -c 'npm ci …"   46 minutes ago   Exited (1) 46 minutes ago                                                                                                        upbeat_feynman
a28f0f9d4931   ghcr.io/nvidia/openshell/cluster:0.0.36                                     "/usr/local/bin/clus…"   51 minutes ago   Up 51 minutes (healthy)     0.0.0.0:8080->30051/tcp                                                                              openshell-cluster-nemoclaw
c8abdc2e6a1c   redis:7-alpine                                                              "docker-entrypoint.s…"   11 days ago      Up 58 minutes (healthy)     0.0.0.0:16379->6379/tcp, [::]:16379->6379/tcp                                                        uniglobal-redis
fb21133964f6   portainer/portainer-ee:latest                                               "/portainer"             5 weeks ago      Up 58 minutes               0.0.0.0:8000->8000/tcp, [::]:8000->8000/tcp, 0.0.0.0:9443->9443/tcp, [::]:9443->9443/tcp, 9000/tcp   portainer
97bb314c6242   glm-ocr-api-api-gateway                                                     "uvicorn main:app --…"   6 weeks ago      Up 58 minutes               0.0.0.0:28801->28801/tcp, [::]:28801->28801/tcp                                                      uniglobal-fastapi
8ff275e6801a   glm-ocr-api-celery-worker                                                   "celery -A tasks.cel…"   6 weeks ago      Up 58 minutes               28801/tcp                                                                                            uniglobal-celery-worker
e78f0de0f189   confluentinc/cp-kafka:7.7.0                                                 "/etc/confluent/dock…"   7 weeks ago      Up 58 minutes               0.0.0.0:9092-9093->9092-9093/tcp, [::]:9092-9093->9092-9093/tcp                                      kafka
5dece4169940   sandbox-registry.cn-zhangjiakou.cr.aliyuncs.com/opensandbox/vscode:latest   "code-server --auth …"   8 weeks ago      Up 58 minutes               0.0.0.0:9090->8080/tcp                                                                               vscode-sandbox
544e159ba62a   ghcr.io/open-webui/open-webui:main                                          "bash start.sh"          8 weeks ago      Up 58 minutes (healthy)     0.0.0.0:12000->8080/tcp, [::]:12000->8080/tcp                                                        open-webui
d1a6b073cdab   rostislavdugin/postgresus:latest                                            "/app/start.sh"          4 months ago     Up 58 minutes               0.0.0.0:4005->4005/tcp, [::]:4005->4005/tcp                                                          postgresus
438faac12343   gc_kb_server-ubuntu                                                         "bash -c 'service ss…"   5 months ago     Up 58 minutes                                                                                                                    gc-kb-server
CONTAINER ID   NAME                         CPU %     MEM USAGE / LIMIT     MEM %     NET I/O           BLOCK I/O        PIDS
a28f0f9d4931   openshell-cluster-nemoclaw   0.57%     337.1MiB / 23.35GiB   1.41%     189MB / 3.44MB    430kB / 852MB    88
c8abdc2e6a1c   uniglobal-redis              2.99%     15.62MiB / 23.35GiB   0.07%     2.29MB / 2.15MB   12.7MB / 0B      6
fb21133964f6   portainer                    0.00%     132.2MiB / 23.35GiB   0.55%     14MB / 4.43MB     106MB / 14.5MB   15
97bb314c6242   uniglobal-fastapi            0.20%     93.64MiB / 23.35GiB   0.39%     9.78kB / 126B     36.3MB / 0B      1
8ff275e6801a   uniglobal-celery-worker      0.18%     210.4MiB / 23.35GiB   0.88%     2.16MB / 2.28MB   27.4MB / 4.1kB   5
e78f0de0f189   kafka                        1.29%     636.2MiB / 23.35GiB   2.66%     631kB / 953kB     149MB / 42.1MB   110
5dece4169940   vscode-sandbox               0.00%     90.86MiB / 23.35GiB   0.38%     9.48kB / 126B     60.5MB / 4.1kB   22
544e159ba62a   open-webui                   0.20%     996.7MiB / 23.35GiB   4.17%     22.8kB / 2.81kB   593MB / 193kB    43
d1a6b073cdab   postgresus                   0.00%     142MiB / 23.35GiB     0.59%     84.1kB / 61.4kB   103MB / 3.97MB   22
438faac12343   gc-kb-server                 0.00%     180MiB / 23.35GiB     0.75%     0B / 0B           132MB / 4.1kB    2

═══ OpenShell ═══

Server Status

  Gateway: nemoclaw
  Server: https://127.0.0.1:8080
  Status: Connected
  Version: 0.0.36
No sandboxes found.

Error:   × status: NotFound, message: "sandbox not found", details: [], metadata:
  │ MetadataMap { headers: {"content-type": "application/grpc", "date": "Thu,
  │ 30 Apr 2026 10:30:30 GMT"} }
  (command exited with non-zero status)

Error:   × status: NotFound, message: "sandbox not found", details: [], metadata:
  │ MetadataMap { headers: {"content-type": "application/grpc", "date": "Thu,
  │ 30 Apr 2026 10:30:30 GMT"} }
  (command exited with non-zero status)

═══ Onboard Session ═══

{
  "version": 1,
  "sessionId": "1777542657975-toejet1t",
  "status": "failed",
  "resumable": true,
  "mode": "interactive",
  "startedAt": "2026-04-30T09:50:57.974Z",
  "updatedAt": "2026-04-30T10:18:37.334Z",
  "sandboxName": "dtms-claw-ts",
  "provider": "compatible-endpoint",
  "model": "Qwen3.6-35B-A3B-FP8",
  "endpointUrl": "http://10.110.18.230:12600/v1",
  "credentialEnv": "COMPATIBLE_API_KEY",
  "preferredInferenceApi": "openai-completions",
  "nimContainer": null,
  "policyPresets": null,
  "lastStepStarted": "sandbox",
  "lastCompletedStep": "inference",
  "failure": {
    "step": "sandbox",
    "message": "Onboarding exited before the step completed.",
    "recordedAt": "2026-04-30T10:18:37.334Z"
  },
  "steps": {
    "preflight": {
      "status": "complete",
      "startedAt": "2026-04-30T09:50:57.976Z",
      "completedAt": "2026-04-30T09:50:58.424Z",
      "error": null
    },
    "gateway": {
      "status": "complete",
      "startedAt": "2026-04-30T10:01:55.991Z",
      "completedAt": "2026-04-30T10:01:56.052Z",
      "error": null
    },
    "sandbox": {
      "status": "failed",
      "startedAt": "2026-04-30T10:18:15.773Z",
      "completedAt": null,
      "error": "Onboarding exited before the step completed."
    },
    "provider_selection": {
      "status": "complete",
      "startedAt": "2026-04-30T09:50:58.490Z",
      "completedAt": "2026-04-30T09:51:51.555Z",
      "error": null
    },
    "inference": {
      "status": "complete",
      "startedAt": "2026-04-30T09:52:02.448Z",
      "completedAt": "2026-04-30T10:18:02.665Z",
      "error": null
    },
    "openclaw": {
      "status": "pending",
      "startedAt": null,
      "completedAt": null,
      "error": null
    },
    "agent_setup": {
      "status": "pending",
      "startedAt": null,
      "completedAt": null,
      "error": null
    },
    "policies": {
      "status": "pending",
      "startedAt": null,
      "completedAt": null,
      "error": null
    }
  }
}

═══ Sandbox Internals ═══


Error:   × status: NotFound, message: "sandbox not found", details: [], metadata:
  │ MetadataMap { headers: {"content-type": "application/grpc", "date": "Thu,
  │ 30 Apr 2026 10:30:30 GMT"} }
  (command exited with non-zero status)

Error:   × status: NotFound, message: "sandbox not found", details: [], metadata:
  │ MetadataMap { headers: {"content-type": "application/grpc", "date": "Thu,
  │ 30 Apr 2026 10:30:30 GMT"} }
  (command exited with non-zero status)

═══ Kernel Messages ═══


dmesg: read kernel buffer failed: Operation not permitted

[debug] Done. If filing a bug, run with --output and attach the tarball to your issue:
[debug]   nemoclaw debug --output /tmp/nemoclaw-debug.tar.gz
devops01@ailab:~$

Logs

Checklist

  • I confirmed this bug is reproducible
  • I searched existing issues and this is not a duplicate

Metadata

Metadata

Labels

area: sandboxOpenShell sandbox lifecycle, runtime, config, or recoveryfixed-on-latestVerified by verify-stale skill: bug not reproducible on latestplatform: containerAffects Docker, containerd, Podman, or images

Type

No fields configured for Bug.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions