Integrate SGLang into OpenRLHF. Non-Hybrid Engine Only by zhaochenyang20 · Pull Request #661 · OpenRLHF/OpenRLHF

zhaochenyang20 · 2025-01-09T08:43:44Z

This PR supports SGLang as an inference engine for the PPO actor and makes relevant changes. Detailed usage can be found in:

https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial/blob/main/rlhf/OpenRLHF/openrlhf-sglang.md

fzyzcjy

Since @zhaochenyang20 asked me at slack to have a review about this, I very briefly glanced at the code (indeed randomly picked a file, experience_maker.py) but too tired to continue today.

Anyway there is a small nit below, and also a small proposal sgl-project/sglang#2818 that may be hopefully a little bit helpful.

fzyzcjy · 2025-01-09T14:45:50Z

        all_outputs = sum(ray.get(all_output_refs), [])
+        assert len(all_outputs) == len(all_prompts)
+        pad_token_id, eos_token_id = self.tokenizer.pad_token_id, self.tokenizer.eos_token_id
+        try:


nit: it may be a bit better if we do not use try-except here because

if sglang/vllm's format change in the future, non-error may become error and vise versa, and the behavior suddenly change

except without specifying exception type will cause everything to be caught here, even things like real bugs etc

I also do not like try except, but do I have other choices? Like seeing the type of the first output?

I guess a way may be check if config.engine == 'sglang' (and pass around the config to here), or maybe if inference_engine.get_mode() == 'sglang' (and create a method on inference_engine to tell its mode; but this way may have a larger overhead if the engine is on another ray actor).

Yes, this part should be refactored.

catqaq · 2025-01-12T19:32:57Z

great job!

fix: streamline the OpenRLHF-SGLangf/openrlhf/cli/batch_inference.py

…to dev_pr

hijkzzz · 2025-01-20T06:31:30Z

-    dummy_strategy.print = print
-    dummy_strategy.is_rank_0 = lambda: True
-    dummy_strategy.args = args
+    strategy = Empty()


Just set strategy to None for vllm and sglang
btw, for reward model please use the deepspeed strategy

I just copied from the main.

zhaochenyang20 · 2025-01-21T04:36:12Z

delete this print print("os.environ['LOCAL_RANK']", os.environ["LOCAL_RANK"])

zhaochenyang20 · 2025-01-21T04:36:45Z

split sglang and vllm into two files and provide different create_inference_engines funciton.

zhaochenyang20 · 2025-01-21T04:37:18Z

delete this torch.cuda.synchronize()

…NK"])\'

…e_llm_ray_actor funciton.

…ainer/ppo_utils/experience_maker.py

Split LLM ray actor

…to dev_pr

merrymercy · 2025-03-03T23:39:13Z

@xiaoxigua999 @hijkzzz This PR has been fully verified regarding accuracy and speed. Is it possible to merge this?

zhaochenyang20 added 3 commits January 8, 2025 22:44

init sglang engine in openrlhf

0222f3c

delete logging for debugging

27efc2b

fix sampling error

84981dd

fzyzcjy mentioned this pull request Jan 9, 2025

[Feature] Compatibility layer to other inference engines sgl-project/sglang#2818

Closed

2 tasks

fzyzcjy reviewed Jan 9, 2025

View reviewed changes

revert barrier in vllm

dff238b

Merge branch 'main' into dev_pr

2ead62b

minleminzui force-pushed the dev_pr branch from d755e22 to 2ead62b Compare January 14, 2025 05:16

minleminzui and others added 6 commits January 14, 2025 13:21

fix: streamline the OpenRLHF-SGLangf/openrlhf/cli/batch_inference.py

57d0779

Merge pull request #2 from minleminzui/dev_pr

7817e01

fix: streamline the OpenRLHF-SGLangf/openrlhf/cli/batch_inference.py

Merge branch 'dev_pr' of github.com:zhaochenyang20/OpenRLHF-SGLang in…

831256c

…to dev_pr

Add batch inference sglang for iterative DPO

aac55d7

add barrier in vllm

f5bafb0

use token in token out feature of sglang

8f02818

hijkzzz mentioned this pull request Jan 20, 2025

[Roadmap] OpenRLHF Development Roadmap #568

Open

59 tasks

hijkzzz reviewed Jan 20, 2025

View reviewed changes

minleminzui and others added 5 commits January 27, 2025 16:06

fix: delete 'print("os.environ[\'LOCAL_RANK\']", os.environ["LOCAL_RA…

7f47820

…NK"])\'

fix delete torch.cuda.synchronize() for debugging

c3e3e1b

fix: split sglang and vllm into two files and provide different creat…

d181211

…e_llm_ray_actor funciton.

fix: use if-else instead of try-except in the line 681 of openrlhf/tr…

0c6a771

…ainer/ppo_utils/experience_maker.py

Merge pull request #3 from minleminzui/dev_pr

cb8eed8

Split LLM ray actor

zhaochenyang20 changed the title ~~[WIP] Integrate SGLang into OpenRLHF~~ Integrate SGLang into OpenRLHF. Non-Hybrid Engine Only Jan 28, 2025

zhaochenyang20 added 3 commits January 28, 2025 07:45

Merge branch 'dev_pr' of github.com:zhaochenyang20/OpenRLHF-SGLang in…

adfbf2c

…to dev_pr

Merge branch 'main' into dev_pr

0db9a84

fix lint error

06ce786

zhyncs mentioned this pull request Mar 4, 2025

Development Roadmap (2025 H1) sgl-project/sglang#4042

Closed

67 tasks

zhaochenyang20 mentioned this pull request Mar 11, 2025

Update README.md jovany-wang/OpenRLHF-X#1

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate SGLang into OpenRLHF. Non-Hybrid Engine Only#661

Integrate SGLang into OpenRLHF. Non-Hybrid Engine Only#661
zhaochenyang20 wants to merge 19 commits intoOpenRLHF:mainfrom
zhaochenyang20:dev_pr

zhaochenyang20 commented Jan 9, 2025

Uh oh!

fzyzcjy left a comment •

edited

Loading

Uh oh!

fzyzcjy Jan 9, 2025 •

edited

Loading

Uh oh!

zhaochenyang20 Jan 9, 2025

Uh oh!

fzyzcjy Jan 9, 2025 •

edited

Loading

Uh oh!

hijkzzz Jan 20, 2025

Uh oh!

catqaq commented Jan 12, 2025

Uh oh!

hijkzzz Jan 20, 2025

Uh oh!

zhaochenyang20 Jan 20, 2025

Uh oh!

zhaochenyang20 commented Jan 21, 2025

Uh oh!

zhaochenyang20 commented Jan 21, 2025

Uh oh!

zhaochenyang20 commented Jan 21, 2025

Uh oh!

merrymercy commented Mar 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

zhaochenyang20 commented Jan 9, 2025

Uh oh!

fzyzcjy left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fzyzcjy Jan 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhaochenyang20 Jan 9, 2025

Choose a reason for hiding this comment

Uh oh!

fzyzcjy Jan 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hijkzzz Jan 20, 2025

Choose a reason for hiding this comment

Uh oh!

catqaq commented Jan 12, 2025

Uh oh!

hijkzzz Jan 20, 2025

Choose a reason for hiding this comment

Uh oh!

zhaochenyang20 Jan 20, 2025

Choose a reason for hiding this comment

Uh oh!

zhaochenyang20 commented Jan 21, 2025

Uh oh!

zhaochenyang20 commented Jan 21, 2025

Uh oh!

zhaochenyang20 commented Jan 21, 2025

Uh oh!

merrymercy commented Mar 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

fzyzcjy left a comment •

edited

Loading

fzyzcjy Jan 9, 2025 •

edited

Loading

fzyzcjy Jan 9, 2025 •

edited

Loading