feat: support multiple images per column in image context by nabinchha · Pull Request #257 · NVIDIA-NeMo/DataDesigner

nabinchha · 2026-01-28T19:43:40Z

📋 Summary

Adds support for handling multiple images per column in image context configurations. Previously, each image context column could only reference a single image. This change enables columns to contain multiple images as a list or JSON-serialized array.

🔄 Changes

✨ Added

Support for list-of-strings and JSON-serialized list input formats in ImageContext.get_contexts() (models.py)
Comprehensive test coverage for various input formats: single string, list, JSON-serialized list, invalid JSON, and empty list (test_models.py)

🔧 Changed

Breaking API change: Renamed ModalityContext.get_context() → get_contexts() to return list instead of single dict (models.py) though this is mostly internal and not user facing.
Updated ImageContext.get_contexts() to parse and normalize various input formats to list of contexts (models.py)
Modified LLM completion generator to flatten multiple contexts using extend() instead of append() (llm_completion.py)
Reordered message construction to pack multimodal context before user text messages for better model processing (utils.py - commit 998d880)

🔍 Attention Areas

⚠️ Reviewers: Please pay special attention to the following:

packages/data-designer-config/src/data_designer/config/models.py - Breaking API change: get_context() → get_contexts() with list return type
The JSON parsing logic handles edge cases (non-list JSON objects, invalid JSON strings) by treating them as single values
Message ordering change (multimodal context now precedes text). This is based on feedback from folks working on VLM training.

🤖 Generated with AI

johnnygreco · 2026-01-28T19:55:56Z

+        {
+            "type": "image_url",
+            "image_url": {"url": "data:image/png;base64,image1base64", "format": "png"},
+        },
+        {
+            "type": "image_url",
+            "image_url": {"url": "data:image/png;base64,image2base64", "format": "png"},
+        },
+        {


maybe a dumb question, but is there any case in which you'd like to identify a particular context out of the group of contexts? Here I guess you have index but then you need to know the order.

Yes, you'd need to call out by the order... like "what are the difference between the scene in the 1st and the 3rd picture?"

eric-tramel

lgtm

nabinchha · 2026-01-28T20:24:57Z

I'll merge this shortly once I verify a test workflow I have looks gtg.

…texts-per-column

nabinchha added 2 commits January 28, 2026 12:31

allow image context column to have multiple images

3e16b02

pack multimodal context at the front before user text messages

998d880

nabinchha requested a review from a team as a code owner January 28, 2026 19:43

nabinchha requested a review from eric-tramel January 28, 2026 19:43

johnnygreco reviewed Jan 28, 2026

View reviewed changes

johnnygreco previously approved these changes Jan 28, 2026

View reviewed changes

eric-tramel previously approved these changes Jan 28, 2026

View reviewed changes

Fix edge case with numpy array

8501781

nabinchha dismissed stale reviews from eric-tramel and johnnygreco via 8501781 January 28, 2026 23:33

nabinchha requested review from eric-tramel and johnnygreco January 28, 2026 23:34

Merge branch 'main' into nmulepati/feat/255-handle-multiple-image-con…

b8a0b83

…texts-per-column

johnnygreco approved these changes Jan 28, 2026

View reviewed changes

nabinchha merged commit 3d86a38 into main Jan 28, 2026
46 checks passed

nabinchha deleted the nmulepati/feat/255-handle-multiple-image-contexts-per-column branch January 28, 2026 23:45

nabinchha mentioned this pull request Jan 29, 2026

Support including multiple images per column in the same llm generation with multi-modal context. #255

Closed

github-actions Bot mentioned this pull request Apr 28, 2026

docs: add VLM long-document understanding dev note and recipes #579

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support multiple images per column in image context#257

feat: support multiple images per column in image context#257
nabinchha merged 4 commits into
mainfrom
nmulepati/feat/255-handle-multiple-image-contexts-per-column

nabinchha commented Jan 28, 2026 •

edited

Loading

Uh oh!

johnnygreco Jan 28, 2026

Uh oh!

nabinchha Jan 28, 2026

Uh oh!

eric-tramel left a comment

Uh oh!

nabinchha commented Jan 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

nabinchha commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📋 Summary

🔄 Changes

✨ Added

🔧 Changed

🔍 Attention Areas

Uh oh!

johnnygreco Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

nabinchha Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

eric-tramel left a comment

Choose a reason for hiding this comment

Uh oh!

nabinchha commented Jan 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nabinchha commented Jan 28, 2026 •

edited

Loading