Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: NVIDIA-NeMo/DataDesigner
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v0.5.8
Choose a base ref
...
head repository: NVIDIA-NeMo/DataDesigner
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v0.5.9
Choose a head ref
  • 5 commits
  • 38 files changed
  • 2 contributors

Commits on Apr 28, 2026

  1. fix: normalize image_url blocks to OpenAI-compliant dict format (#577)

    * fix: normalize image_url blocks to OpenAI-compliant dict format (#576)
    
    ImageContext.get_contexts() produced bare-string and non-standard dict
    shapes for image_url content blocks, which broke the native OpenAI
    adapter (passes blocks through as-is) and only worked with Anthropic
    by accident via defensive handling in the translation layer.
    
    - Wrap all image_url values in {"url": ...} dict (OpenAI spec)
    - Remove non-standard "format" key from base64 dicts
    - Tighten Anthropic translate_image_url_block to require dict input
    
    Fixes #576
    
    Made-with: Cursor
    
    * fix: reject malformed image_url blocks instead of silently dropping them
    
    translate_image_url_block now raises TypeError when image_url is not a
    dict. Since all image_url blocks are constructed internally, a bare
    string indicates an internal bug and should fail loudly.
    
    Made-with: Cursor
    
    * address review: tighten return type, add OpenAI + data-URI tests
    
    - Narrow _auto_resolve_context_value return type to dict[str, str]
    - Add OpenAI-client regression tests for image_url dict passthrough
    - Cover both bare-URL and bare-data-URI rejection in Anthropic tests
    
    Made-with: Cursor
    nabinchha authored Apr 28, 2026
    Configuration menu
    Copy the full SHA
    05c2e8d View commit details
    Browse the repository at this point in the history
  2. docs: add VLM long-document understanding dev note and recipes (#579)

    * Add resources for long-document-understanding-dev-note
    
    * added links
    nabinchha authored Apr 28, 2026
    Configuration menu
    Copy the full SHA
    7c5a722 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e92c76b View commit details
    Browse the repository at this point in the history
  4. ci: raise agent audit turn limit and preserve logs (#571)

    * ci: raise agent audit turn limit and preserve logs
    
    The Friday test-health audit hit the 30-turn cap on its first-ever run
    (2026-04-24) and the agent log was discarded with the self-hosted
    runner. Heavier recipes need more room, and the next failure should be
    diagnosable.
    
    - Raise --max-turns from 30 to 50
    - Switch --output-format from text to stream-json so events are emitted
      during the run instead of only at process exit; prefix with
      stdbuf -oL -eL to line-buffer the pipe
    - Upload /tmp/claude-audit-log.txt and /tmp/audit-<suite>.md as an
      artifact (if: always(), 14-day retention) using the upload-artifact
      SHA already pinned in build-notebooks.yml
    
    Signed-off-by: Andre Manoel <amanoel@nvidia.com>
    
    * ci: disambiguate audit artifact name across run attempts
    
    actions/upload-artifact@v4+ rejects duplicate names within a workflow,
    and re-running a failed run reuses the same github.run_id. Append
    github.run_attempt so re-runs upload successfully instead of failing at
    the exact moment the artifact is most useful.
    
    Found by Codex review of #571.
    
    Signed-off-by: Andre Manoel <amanoel@nvidia.com>
    
    * ci: only upload agent log on failure
    
    Raise the bar for persisting the full verbose stream-json event log:
    we only need it when we're actually debugging a failure, and the audit
    report itself still lands in the step summary on success. Shrinks the
    window where tool inputs, read file contents, or other verbose-stream
    detail could end up in a 14-day artifact.
    
    Addresses the minor privacy finding from Codex review of #571.
    
    Signed-off-by: Andre Manoel <amanoel@nvidia.com>
    
    * ci: drop raw agent log from job summary
    
    With --output-format stream-json the previous tail -100 of the agent
    log emitted raw NDJSON into the GH Actions UI summary, which is
    unreadable. The audit report itself (/tmp/audit-<suite>.md) already
    carries the human-readable payload, and the full event stream is
    available as an on-failure artifact, so the raw tail was redundant and
    worse than nothing for the summary surface.
    
    Also rewords the fallback message to point at the artifact when no
    report lands (typically a failure).
    
    Signed-off-by: Andre Manoel <amanoel@nvidia.com>
    
    ---------
    
    Signed-off-by: Andre Manoel <amanoel@nvidia.com>
    andreatgretel authored Apr 28, 2026
    Configuration menu
    Copy the full SHA
    482ab5a View commit details
    Browse the repository at this point in the history
  5. feat: update default nvidia-vision model to nemotron-3-nano-omni (#583)

    * update nvidia-vision
    
    * update temp
    nabinchha authored Apr 28, 2026
    Configuration menu
    Copy the full SHA
    93ae875 View commit details
    Browse the repository at this point in the history
Loading