Skip to content

[All Platforms][Docs] inference/inference-options has wrong note for vLLM in provider status #3684

@zNeill

Description

@zNeill

Description

Issue summary:
In docs/inference/inference-options.md, Provider Status table incorrectly states that Local vLLM requires both NEMOCLAW_EXPERIMENTAL=1 and an already-running server on localhost:8000.
https://docs.nvidia.com/nemoclaw/latest/inference/inference-options.html#provider-status

Actual:

Expected:
The Provider Status docs should distinguish the two paths: already-running vLLM appears when detected on localhost:8000; managed vLLM install/start requires NEMOCLAW_EXPERIMENTAL=1 or explicit NEMOCLAW_PROVIDER=install-vllm.In v0.0.44, an already-running vLLM server is offered in the onboarding provider list without requiring NEMOCLAW_EXPERIMENTAL=1.
The experimental flag, or NEMOCLAW_PROVIDER=install-vllm, is only required for the managed install/start path that pulls images and starts a container.

Reference of code logic:
Here’s the v0.0.44 logic.

EXPERIMENTAL is only true when NEMOCLAW_EXPERIMENTAL=1:

const EXPERIMENTAL = process.env.NEMOCLAW_EXPERIMENTAL ===  1 ;

Already-running vLLM is offered regardless of EXPERIMENTAL:

const explicitProvider = (process.env.NEMOCLAW_PROVIDER ||   ).trim().toLowerCase();
const userChoseManagedVllm = explicitProvider ===  install-vllm ;
if (vllmRunning) {
  options.push({
    key:  vllm ,
    label: `Local vLLM [experimental] (localhost:${VLLM_PORT}) — running (suggested)`,
  });
} else if (vllmProfile && (userChoseManagedVllm || EXPERIMENTAL)) {
  const verb = hasVllmImage ?  Start  :  Install ;
  options.push({ key:  install-vllm , label: `${verb} vLLM (${vllmProfile.name})` });
}

Managed install/start is only offered when vLLM is not running and either NEMOCLAW_PROVIDER=install-vllm or NEMOCLAW_EXPERIMENTAL=1 is set:

} else if (vllmProfile && (userChoseManagedVllm || EXPERIMENTAL)) {
  const verb = hasVllmImage ?  Start  :  Install ;
  options.push({ key:  install-vllm , label: `${verb} vLLM (${vllmProfile.name})` });
}

If the user selects managed install/start, it calls installVllm(...), then intentionally falls through to the existing-running vLLM path:

} else if (selected.key ===  install-vllm ) {
  // ...
  const result = await installVllm(vllmProfile, {
    hasImage: hasVllmImage,
    nonInteractive: isNonInteractive(),
    promptFn: prompt,
  });
  // ...
  selected = { key:  vllm , label: `Local vLLM (localhost:${VLLM_PORT}) — running` };
}
if (selected.key ===  vllm ) {
  console.log(`  ✓ Using existing vLLM on localhost:${VLLM_PORT}`);
  provider =  vllm-local ;
  // queries /v1/models, validates endpoint, then forces chat completions
}

Bug Details

Field Value
Priority Unprioritized
Action Dev - Open - To fix
Disposition Open issue
Module Machine Learning - NemoClaw
Keyword NemoClaw, NemoClaw_Docs, NEMOCLAW_GH_SYNC_APPROVAL

[NVB#6186688]

Metadata

Metadata

Assignees

Labels

NV QABugs found by the NVIDIA QA Teamarea: docsDocumentation, examples, guides, or docs build

Type

No fields configured for Bug.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions