fix: disable mixed precision training until #13 is resolved by parthchadha · Pull Request #14 · NVIDIA-NeMo/RL

parthchadha · 2025-03-21T04:21:07Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

Please update the CHANGELOG.md under next version with high level changes in this PR.

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation? Make sure to also update the NeMo Framework User Guide which contains the tutorials

Checklist when contributing

TBD

Additional Information

Related to # (issue)

Signed-off-by: Parth Chadha <pchadha@nvidia.com>

parthchadha · 2025-03-21T04:43:33Z

CI results from local run:

================================================================================ 105 passed, 1 skipped, 2 warnings in 332.84s (0:05:32) =================================================================================

Signed-off-by: Parth Chadha <pchadha@nvidia.com>

…t API Address easy-batch items from PR NVIDIA-NeMo#2508 review. - Move sinkhorn_one_dim / apply_canonicalization_if_enabled / clean_model_name_for_filename / project_token_likelihoods to nemo_rl/utils/x_token/_shared.py and import from there in the four projection-prep CLIs (#5). - Drop unused sinkhorn (10-iter), debug_projection_map, and generate_projection_map from minimal_projection_generator.py (NVIDIA-NeMo#8). - Move minimal_projection_generator.py's CLI parsing under `if __name__ == "__main__":` so the module is importable for the P1 dedup harness. - Rename seq1 / seq2 (and the derivative s1_*/s2_*/used_seq*/joined_seq*/ seg* names) to student_tokens / teacher_tokens throughout TokenAligner._align_single / _align_with_anchors / _align_dp; all call sites are internal (NVIDIA-NeMo#11). - Replace **kwargs in TokenAligner._align_with_anchors with the five explicit keyword-only scoring knobs already declared on _align_dp, and pass them through named at the _align_single call site (NVIDIA-NeMo#12). - Remove dead TokenAligner.load_projection_matrix + the private _load_projection_components plus the now-orphaned self._projection_* attributes; the live projection-load path is nemo_rl/algorithms/x_token/utils.py::{get_sparse_projection_matrix, get_topk_projection} added in 6336464 (NVIDIA-NeMo#13, NVIDIA-NeMo#14). - git mv nemo_rl/algorithms/x_token/tokenalign.py -> token_aligner.py and update import sites in __init__.py, data/cross_tokenizer_collate.py, the four utils.x_token CLIs, and the xtoken-distillation guide; update Sphinx :mod: references in algorithms/x_token/utils.py (NVIDIA-NeMo#16). Incidental: the CLIs previously called TokenAligner._canonical_token, which was never a class attribute (the helper is module-level); the new shared helper imports _canonical_token directly so the use_canonicalization branch isn't broken at runtime. Signed-off-by: Adithya Hanasoge <avenkateshha@nvidia.com>

…t API Address easy-batch items from PR #2508 review. - Move sinkhorn_one_dim / apply_canonicalization_if_enabled / clean_model_name_for_filename / project_token_likelihoods to nemo_rl/utils/x_token/_shared.py and import from there in the four projection-prep CLIs (#5). - Drop unused sinkhorn (10-iter), debug_projection_map, and generate_projection_map from minimal_projection_generator.py (#8). - Move minimal_projection_generator.py's CLI parsing under `if __name__ == "__main__":` so the module is importable for the P1 dedup harness. - Rename seq1 / seq2 (and the derivative s1_*/s2_*/used_seq*/joined_seq*/ seg* names) to student_tokens / teacher_tokens throughout TokenAligner._align_single / _align_with_anchors / _align_dp; all call sites are internal (#11). - Replace **kwargs in TokenAligner._align_with_anchors with the five explicit keyword-only scoring knobs already declared on _align_dp, and pass them through named at the _align_single call site (#12). - Remove dead TokenAligner.load_projection_matrix + the private _load_projection_components plus the now-orphaned self._projection_* attributes; the live projection-load path is nemo_rl/algorithms/x_token/utils.py::{get_sparse_projection_matrix, get_topk_projection} added in 6336464 (#13, #14). - git mv nemo_rl/algorithms/x_token/tokenalign.py -> token_aligner.py and update import sites in __init__.py, data/cross_tokenizer_collate.py, the four utils.x_token CLIs, and the xtoken-distillation guide; update Sphinx :mod: references in algorithms/x_token/utils.py (#16). Incidental: the CLIs previously called TokenAligner._canonical_token, which was never a class attribute (the helper is module-level); the new shared helper imports _canonical_token directly so the use_canonicalization branch isn't broken at runtime. Signed-off-by: Adithya Hanasoge <avenkateshha@nvidia.com>

fix: disable mixed precision training until #13 is resolved

6c580ca

Signed-off-by: Parth Chadha <pchadha@nvidia.com>

parthchadha force-pushed the pchadha/disable-mp branch from 50b4fc2 to 6c580ca Compare March 21, 2025 04:22

parthchadha requested review from SahilJain314 and terrykong March 21, 2025 04:22

parthchadha commented Mar 21, 2025

View reviewed changes

Comment thread nemo_reinforcer/models/policy/hf_policy.py

terrykong approved these changes Mar 21, 2025

View reviewed changes

SahilJain314 approved these changes Mar 21, 2025

View reviewed changes

Merge branch 'main' into pchadha/disable-mp

ec39421

parthchadha merged commit 40f2292 into main Mar 21, 2025

parthchadha deleted the pchadha/disable-mp branch March 21, 2025 05:38

KiddoZhu pushed a commit that referenced this pull request May 6, 2025

fix: disable mixed precision training until #13 is resolved (#14)

dec2436

Signed-off-by: Parth Chadha <pchadha@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: disable mixed precision training until #13 is resolved#14

fix: disable mixed precision training until #13 is resolved#14
parthchadha merged 2 commits into
mainfrom
pchadha/disable-mp

parthchadha commented Mar 21, 2025

Uh oh!

Uh oh!

parthchadha commented Mar 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

parthchadha commented Mar 21, 2025

What does this PR do ?

Changelog

Usage

Before your PR is "Ready for review"

Checklist when contributing

Additional Information

Uh oh!

Uh oh!

parthchadha commented Mar 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants