feat: evaluation implement by yuki-97 · Pull Request #16 · NVIDIA-NeMo/RL

yuki-97 · 2025-03-21T05:13:04Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

Please update the CHANGELOG.md under next version with high level changes in this PR.

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation? Make sure to also update the NeMo Framework User Guide which contains the tutorials

Checklist when contributing

TBD

Additional Information

#61

Signed-off-by: Yuki Huang <yukih@nvidia.com>

Signed-off-by: Yuki Huang <yukih@nvidia.com> Signed-off-by: Yi-Fu Wu <yifu.wu@gmail.com>

Signed-off-by: Yuki Huang <yukih@nvidia.com>

…t API Address easy-batch items from PR NVIDIA-NeMo#2508 review. - Move sinkhorn_one_dim / apply_canonicalization_if_enabled / clean_model_name_for_filename / project_token_likelihoods to nemo_rl/utils/x_token/_shared.py and import from there in the four projection-prep CLIs (#5). - Drop unused sinkhorn (10-iter), debug_projection_map, and generate_projection_map from minimal_projection_generator.py (NVIDIA-NeMo#8). - Move minimal_projection_generator.py's CLI parsing under `if __name__ == "__main__":` so the module is importable for the P1 dedup harness. - Rename seq1 / seq2 (and the derivative s1_*/s2_*/used_seq*/joined_seq*/ seg* names) to student_tokens / teacher_tokens throughout TokenAligner._align_single / _align_with_anchors / _align_dp; all call sites are internal (NVIDIA-NeMo#11). - Replace **kwargs in TokenAligner._align_with_anchors with the five explicit keyword-only scoring knobs already declared on _align_dp, and pass them through named at the _align_single call site (NVIDIA-NeMo#12). - Remove dead TokenAligner.load_projection_matrix + the private _load_projection_components plus the now-orphaned self._projection_* attributes; the live projection-load path is nemo_rl/algorithms/x_token/utils.py::{get_sparse_projection_matrix, get_topk_projection} added in 6336464 (NVIDIA-NeMo#13, NVIDIA-NeMo#14). - git mv nemo_rl/algorithms/x_token/tokenalign.py -> token_aligner.py and update import sites in __init__.py, data/cross_tokenizer_collate.py, the four utils.x_token CLIs, and the xtoken-distillation guide; update Sphinx :mod: references in algorithms/x_token/utils.py (NVIDIA-NeMo#16). Incidental: the CLIs previously called TokenAligner._canonical_token, which was never a class attribute (the helper is module-level); the new shared helper imports _canonical_token directly so the use_canonicalization branch isn't broken at runtime. Signed-off-by: Adithya Hanasoge <avenkateshha@nvidia.com>

…t API Address easy-batch items from PR #2508 review. - Move sinkhorn_one_dim / apply_canonicalization_if_enabled / clean_model_name_for_filename / project_token_likelihoods to nemo_rl/utils/x_token/_shared.py and import from there in the four projection-prep CLIs (#5). - Drop unused sinkhorn (10-iter), debug_projection_map, and generate_projection_map from minimal_projection_generator.py (#8). - Move minimal_projection_generator.py's CLI parsing under `if __name__ == "__main__":` so the module is importable for the P1 dedup harness. - Rename seq1 / seq2 (and the derivative s1_*/s2_*/used_seq*/joined_seq*/ seg* names) to student_tokens / teacher_tokens throughout TokenAligner._align_single / _align_with_anchors / _align_dp; all call sites are internal (#11). - Replace **kwargs in TokenAligner._align_with_anchors with the five explicit keyword-only scoring knobs already declared on _align_dp, and pass them through named at the _align_single call site (#12). - Remove dead TokenAligner.load_projection_matrix + the private _load_projection_components plus the now-orphaned self._projection_* attributes; the live projection-load path is nemo_rl/algorithms/x_token/utils.py::{get_sparse_projection_matrix, get_topk_projection} added in 6336464 (#13, #14). - git mv nemo_rl/algorithms/x_token/tokenalign.py -> token_aligner.py and update import sites in __init__.py, data/cross_tokenizer_collate.py, the four utils.x_token CLIs, and the xtoken-distillation guide; update Sphinx :mod: references in algorithms/x_token/utils.py (#16). Incidental: the CLIs previously called TokenAligner._canonical_token, which was never a class attribute (the helper is module-level); the new shared helper imports _canonical_token directly so the use_canonicalization branch isn't broken at runtime. Signed-off-by: Adithya Hanasoge <avenkateshha@nvidia.com>

github-actions Bot added the Documentation Improvements or additions to documentation label Mar 21, 2025

yuki-97 requested review from SahilJain314, parthchadha and terrykong March 21, 2025 05:15

yuki-97 force-pushed the yukih/eval branch 6 times, most recently from 87f6419 to bbda80e Compare March 24, 2025 03:54

parthchadha reviewed Mar 24, 2025

View reviewed changes

Comment thread examples/configs/eval.yaml Outdated

Comment thread examples/run_grpo_math.py Outdated

yuki-97 force-pushed the yukih/eval branch 2 times, most recently from 4033c1a to 27b417d Compare March 25, 2025 03:22

terrykong reviewed Mar 25, 2025

View reviewed changes

Comment thread examples/run_grpo_math.py

Comment thread examples/run_grpo_math.py Outdated

Comment thread examples/run_sft.py

yuki-97 force-pushed the yukih/eval branch 3 times, most recently from c0a72bf to be87a25 Compare March 25, 2025 08:15

parthchadha reviewed Mar 25, 2025

View reviewed changes

Comment thread examples/configs/eval.yaml Outdated

Comment thread nemo_reinforcer/data/datasets.py Outdated

yuki-97 force-pushed the yukih/eval branch from ccfef59 to 8831f18 Compare March 25, 2025 15:38

parthchadha previously approved these changes Mar 25, 2025

View reviewed changes

parthchadha reviewed Mar 25, 2025

View reviewed changes

Comment thread docs/guides/eval.md

terrykong requested changes Mar 25, 2025

View reviewed changes

Comment thread nemo_reinforcer/models/generation/vllm.py Outdated

SahilJain314 reviewed Mar 25, 2025

View reviewed changes

Comment thread nemo_reinforcer/data/__init__.py

Comment thread nemo_reinforcer/data/llm_message_utils.py Outdated

terrykong previously approved these changes Mar 26, 2025

View reviewed changes

yuki-97 added 5 commits March 26, 2025 06:07

add evaluation implement and minial doc

976f20e

Signed-off-by: Yuki Huang <yukih@nvidia.com>

add remap_problem_solution

9c1be17

Signed-off-by: Yuki Huang <yukih@nvidia.com>

set load_format=auto to avoid dummy loading in eval

5399426

Signed-off-by: Yuki Huang <yukih@nvidia.com>

remove useless keys; move prompt process outside

3e2321f

Signed-off-by: Yuki Huang <yukih@nvidia.com>

add MathDataConfig; add remap_dataset_keys; fix rebase

b60c94d

Signed-off-by: Yuki Huang <yukih@nvidia.com>

yuki-97 dismissed terrykong’s stale review via e3b23a5 March 26, 2025 06:58

remove setting pad_token_id

f0a5b54

Signed-off-by: Yuki Huang <yukih@nvidia.com>

yuki-97 force-pushed the yukih/eval branch from 4c26878 to f0a5b54 Compare March 26, 2025 14:54

yuki-97 added Run CICD and removed Run CICD labels Mar 26, 2025

remove duplicated keys in MathDataConfig

af39289

Signed-off-by: Yuki Huang <yukih@nvidia.com>

yuki-97 added Run CICD and removed Run CICD labels Mar 26, 2025

parthchadha approved these changes Mar 26, 2025

View reviewed changes

Merge branch 'main' into yukih/eval

0b4b103

parthchadha added Run CICD and removed Run CICD labels Mar 26, 2025

Merge branch 'main' into yukih/eval

fe93ba4

SahilJain314 approved these changes Mar 26, 2025

View reviewed changes

parthchadha enabled auto-merge (squash) March 26, 2025 23:53

parthchadha disabled auto-merge March 26, 2025 23:53

parthchadha enabled auto-merge (squash) March 26, 2025 23:53

Merge branch 'main' into yukih/eval

1c36f25

parthchadha added Run CICD and removed Run CICD labels Mar 27, 2025

parthchadha merged commit 8416915 into main Mar 27, 2025

parthchadha deleted the yukih/eval branch March 27, 2025 01:00

yuki-97 linked an issue Apr 1, 2025 that may be closed by this pull request

Eval Script #61

Closed

yfw pushed a commit that referenced this pull request Apr 2, 2025

feat: evaluation implement (#16)

4eb1d6d

Signed-off-by: Yuki Huang <yukih@nvidia.com> Signed-off-by: Yi-Fu Wu <yifu.wu@gmail.com>

KiddoZhu pushed a commit that referenced this pull request May 6, 2025

feat: evaluation implement (#16)

6ce2ba6

Signed-off-by: Yuki Huang <yukih@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: evaluation implement#16

feat: evaluation implement#16
parthchadha merged 11 commits into
mainfrom
yukih/eval

yuki-97 commented Mar 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

yuki-97 commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

Usage

Before your PR is "Ready for review"

Checklist when contributing

Additional Information

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yuki-97 commented Mar 21, 2025 •

edited

Loading