Skip to content

[docs] refactor: Organize model docs by family#3908

Merged
yaoyu-33 merged 6 commits into
mainfrom
docs/model-collection-brand-layout
May 22, 2026
Merged

[docs] refactor: Organize model docs by family#3908
yaoyu-33 merged 6 commits into
mainfrom
docs/model-collection-brand-layout

Conversation

@yaoyu-33

@yaoyu-33 yaoyu-33 commented May 20, 2026

Copy link
Copy Markdown
Contributor

Summary

  • Reorganized docs model pages from docs/models/llm and docs/models/vlm into docs/models// paths.
  • Replaced README and docs navigation category buckets with family-organized model tables and family index pages.
  • Updated cross-links from docs, examples, skills, and model comments to the new paths.

Validation

  • git diff --check
  • git diff --cached --check
  • git grep -n "docs/models/llm\|docs/models/vlm\|models/llm\|models/vlm\|Large Language Models\|Vision Language Models\|Omni Models"
    • Only remaining hits are performance or paper-title prose outside the model collection.
  • git grep -n -i "brand" -- README.md docs examples skills src
    • Only remaining hit is the FontAwesome icon class fa-brands fa-github.
  • uv run --no-sync pre-commit run --all-files

Unit tests were not run per request; this is a docs/navigation refactor.

Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
@copy-pr-bot

copy-pr-bot Bot commented May 20, 2026

Copy link
Copy Markdown

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@yaoyu-33

Copy link
Copy Markdown
Contributor Author

/ok to test 0964c9c

@claude

claude Bot commented May 20, 2026

Copy link
Copy Markdown
Contributor

Review - Clean docs-only refactor. All old docs/models/llm/ and docs/models/vlm/ references have been swept. No stale references remain in .md, .py, or .yaml files. Cross-links between model pages, examples, and skills are updated correctly. The Sphinx toctree in docs/index.md now lists brand index pages directly, which is consistent. Minor issue - README.md line 16 - The Nemotron-3 Nano Omni news entry text still says available on the nemotron_3_omni branch but the examples README link now points to main. Since the examples directory exists on main, the link is correct, but the surrounding prose is inconsistent. Consider updating the text to say the model is now available on main. This may be a pre-existing issue that this PR surfaced by fixing the link. Suggested test cases - No perf tests impacted.

Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
@yaoyu-33

Copy link
Copy Markdown
Contributor Author

/ok to test e0ea428

Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
@yaoyu-33 yaoyu-33 changed the title [docs] refactor: Organize model docs by brand [docs] refactor: Organize model docs by family May 20, 2026
@yaoyu-33

Copy link
Copy Markdown
Contributor Author

/ok to test 990003a

@yaoyu-33 yaoyu-33 added area:model Model implementations and HF bridge logic docs Documentation-only updates or documentation debt needs-review PR is ready for code review and waiting on a reviewer labels May 20, 2026
@yaoyu-33 yaoyu-33 added the docs-only With great power comes great responsibility. label May 21, 2026
Comment thread docs/models/ministral/index.md Outdated

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mistral and ministral is arguably the same family

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Folded Ministral into the Mistral docs family: moved the Ministral 3 guide under docs/models/mistral/, removed the separate Ministral family index/toctree entry, and updated docs/example links so Ministral 3 remains discoverable under Mistral.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Follow-up addressed: moved the Ministral 3 examples from examples/models/ministral/ministral3 to examples/models/mistral/ministral3, removed the empty examples/models/ministral directory, and updated the README/docs/skill references to the new Mistral-family examples path.

Comment thread README.md Outdated
| **Nemotron** | [Nemotron H](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/nemotronh), [Nemotron Nano v2](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/nemotronh), [Nemotron-3 Nano](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/nemotronh), [Nemotron-3 Super](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/nemotronh), [Llama Nemotron](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/llama_nemotron), [Nemotron Nano v2 VL](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/nemotron_vl), [Nemotron-3 Nano Omni](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/nemotron_omni) | [Nemotron H recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/nemotronh/nemotronh.py), [Nemotron Nano v2 recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/nemotronh/nemotron_nano_v2.py), [Nemotron-3 Nano recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/nemotronh/nemotron_3_nano.py), [Nemotron-3 Super recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/nemotronh/nemotron_3_super.py), [Nemotron Nano v2 VL recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/nemotron_vl/nemotron_nano_v2_vl.py), [Nemotron-3 Nano Omni recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/nemotron_omni/nemotron_omni.py) |
| **OLMoE** | [OLMoE](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/olmoe) | [recipes (7B)](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/olmoe/olmoe_7b.py) |
| **Qwen** | [Qwen2 / Qwen2.5](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen), [Qwen3](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen), [Qwen3-MoE](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen), [Qwen3 Next](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen), [Qwen2.5-VL](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen_vl), [Qwen3-VL](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen_vl), [Qwen3.5-VL](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen_vl), [Qwen3.6-VL](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen_vl), [Qwen2 Audio](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen_audio), [Qwen2.5-Omni](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen_omni), [Qwen3-Omni](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen_omni), [Qwen3-ASR](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/qwen3_asr) | [Qwen2 recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/qwen/qwen2.py), [Qwen3 recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/qwen/qwen3.py), [Qwen3-MoE recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/qwen/qwen3_moe.py), [Qwen3 Next recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/qwen/qwen3_next.py), [Qwen VL recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/recipes/qwen_vl), [Qwen Omni recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/recipes/qwen_omni), [Qwen examples](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/examples/models/qwen) |
| **Sarvam** | [Sarvam](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/sarvam) | [examples](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/examples/models/sarvam) |

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we link to the readme, if it exists? otherwise this table is a bit busy

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Simplified the README supported-models table: variants are now plain text, and the resources column links to family docs plus README entry points where available instead of long per-recipe link lists. Also updated the Nemotron-3 Nano Omni news item to say the recipes are available on main.

Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
@yaoyu-33

Copy link
Copy Markdown
Contributor Author

/ok to test dd91f2b

@yaoyu-33 yaoyu-33 removed the docs-only With great power comes great responsibility. label May 21, 2026
Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
@yaoyu-33

Copy link
Copy Markdown
Contributor Author

/ok to test 4afcac2

Comment thread README.md Outdated
| **Mamba** | Mamba | [model bridge](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/mamba) |
| **MiniMax** | MiniMax-M2 / M2.5 / M2.7 | [examples README](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/examples/models/minimax/minimax_m2/README.md) |
| **Mistral** | Mistral, Ministral 3 (3B/8B/14B) | [model docs](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/docs/models/mistral/index.md), [Ministral 3 examples README](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/examples/models/mistral/ministral3/README.md) |
| **MiMo** | MiMo | [Megatron-MiMo training examples](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/examples/megatron_mimo) |

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not megatron-mimo, xiaomi-mimo

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated the README branding to Xiaomi-MiMo; the supported-models row now links to the new Xiaomi-MiMo docs page.

Comment thread README.md Outdated
| **GPT-OSS** | GPT-oss | [model docs](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/docs/models/gpt_oss/index.md), [examples README](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/examples/models/gpt_oss/README.md) |
| **Kimi** | Kimi K2, Kimi-K2.5-VL | [Kimi-K2.5-VL examples README](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/examples/models/kimi/kimi_k25_vl/README.md) |
| **Llama** | Llama 2, Llama 3 / 3.1 / 3.2 / 3.3 | [model docs](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/docs/models/llama/index.md), [recipes](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/recipes/llama) |
| **Mamba** | Mamba | [model bridge](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/src/megatron/bridge/models/mamba) |

@yaoyu-33 yaoyu-33 May 22, 2026

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove mamba

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed Mamba from the README supported-models table.

Comment thread README.md Outdated
| **Nemotron** | Nemotron H, Nemotron Nano v2, Nemotron-3 Nano, Nemotron-3 Super, Llama Nemotron, Nemotron Nano v2 VL, Nemotron-3 Nano Omni | [model docs](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/docs/models/nemotron/index.md), [Nemotron-3 README](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/examples/models/nemotron/nemotron_3/README.md), [Nemotron-3 Omni README](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/examples/models/nemotron/nemotron_3_omni/README.md) |
| **OLMoE** | OLMoE | [model docs](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/docs/models/olmoe/index.md), [recipe](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/src/megatron/bridge/recipes/olmoe/olmoe_7b.py) |
| **Qwen** | Qwen2 / Qwen2.5, Qwen3, Qwen3-MoE, Qwen3 Next, Qwen2.5-VL, Qwen3-VL, Qwen3.5-VL, Qwen3.6-VL, Qwen2 Audio, Qwen2.5-Omni, Qwen3-Omni, Qwen3-ASR | [model docs](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/docs/models/qwen/index.md), [examples directory](https://github.com/NVIDIA-NeMo/Megatron-Bridge/tree/main/examples/models/qwen) |
| **Sarvam** | Sarvam | [examples README](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/examples/models/sarvam/README.md) |

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we dont need last column. for hyperlink, just hyperlink the model doc hyperlink to model name

https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/main/docs/models/qwen/index.md to qwen.

If there is missing doc page for that model, please add in docs folder. Link the example directory in that model doc

Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
@yaoyu-33

Copy link
Copy Markdown
Contributor Author

/ok to test b9619df

@yaoyu-33 yaoyu-33 added the docs-only With great power comes great responsibility. label May 22, 2026
@yaoyu-33 yaoyu-33 removed the docs-only With great power comes great responsibility. label May 22, 2026
@yaoyu-33 yaoyu-33 merged commit f19d8a9 into main May 22, 2026
101 checks passed
@yaoyu-33 yaoyu-33 deleted the docs/model-collection-brand-layout branch May 22, 2026 04:38
vasunvidia pushed a commit to vasunvidia/Megatron-Bridge that referenced this pull request Jun 10, 2026
Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
Signed-off-by: Vasudevan Rengasamy <vrengasamy@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:model Model implementations and HF bridge logic docs Documentation-only updates or documentation debt needs-review PR is ready for code review and waiting on a reviewer

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants