[Bug #21941] YJIT: Fix local variables read as nil when EP escapes before YJIT is enabled by paracycle · Pull Request #16306 · ruby/ruby

paracycle · 2026-03-04T23:11:07Z

Summary

Fixes a bug where local variables are read as nil in JIT-compiled code when RubyVM::YJIT.enable is called mid-execution (e.g. from a Rack config.ru while Puma's server loop is already on the call stack).

Bug

The bug requires three ingredients:

A method is already on the call stack when RubyVM::YJIT.enable is called.
Before YJIT.enable, something causes the method's environment pointer (EP) to escape to the heap (e.g., creating a lambda/proc that captures the environment). This calls vm_make_env_each, which copies local variable values from the stack to a heap-allocated environment and updates cfp->ep to point to the heap copy. The original stack slots (accessed via vm_base_ptr(cfp)) become stale.
The method uses next inside a begin/rescue block wrapped in a begin/ensure block. This compiles to a throw instruction that enters YJIT through the jit_exception code path.

When vm_make_env_each runs, it calls rb_yjit_invalidate_ep_is_bp to notify YJIT that EP has escaped for this ISEQ. However, when YJIT was started with --yjit-disable (or not yet initialized), the INVARIANTS global is None, so rb_yjit_invalidate_ep_is_bp returns early as a no-op and the EP escape is never recorded.

Later, when YJIT compiles the jit_exception entry point, it checks iseq_escapes_ep(iseq) which returns false (no record of EP escape), so it assumes EP == BP and generates SP-based local variable access. SP-based access reads from vm_base_ptr(cfp) which points to the original (stale) stack location containing nil values, instead of cfp->ep which points to the heap-allocated environment with the real values.

Fix

Adds a runtime check in gen_entry_point: before compiling an entry, examine cfp->ep for the VM_ENV_FLAG_ESCAPED flag. If the flag is set but YJIT has no record of the escape, call rb_yjit_invalidate_ep_is_bp to register it. This causes subsequent code generation to use EP-based local access which correctly reads from the heap-allocated environment.

Test

Adds a regression test that reproduces the exact scenario: a lambda is created (causing EP escape), YJIT.enable is called mid-method, and next inside begin/rescue/ensure triggers the jit_exception path. Without the fix, i becomes nil after ~30 iterations when YJIT compiles the exception handler.

NOTE: The bug was found and the fix was made by Claude Opus 4.6 under my guidance and using the minimal reproduction script in the bug ticket.

XrXr · 2026-03-05T00:20:19Z

yjit/src/core.rs

+        let ep = unsafe { get_cfp_ep(cfp) };
+        let ep_flags = unsafe { (*ep).0 };
+        if ep_flags & (VM_ENV_FLAG_ESCAPED as usize) != 0 {
+            rb_yjit_invalidate_ep_is_bp(iseq);


This is wrong in two ways:

rb_yjit_invalidate_ep_is_bp() is only meant to be called by the interpreter and runtime code. The weird reentrance from here when we're in the compiler might create issues with &mut on global states. At minimum it's doing more work than it should -- we are not looking to invalidate anything here, just to change what iseq_escapes_ep() returns

This makes the false assumption that the running CFP is the one being compiled. That's not necessarily the case, since YJIT can look at multiple iseqs during one compile run. This might fix iseq_escapes_ep() for the running iseq but we might have the same issue elsewhere.

We should figure out a way to have iseq_escapes_ep() be reliable in face of delayed enablement. Or use a more correct signal. I get the feeling that the correct way to interpret the return value from iseq_escapes_ep() is "yes" and "don't know" and YJIT was misinterpreting "don't know" as "never".

Thanks for the review! You're right on both counts.

I've reworked the fix based on your insight that iseq_escapes_ep() returns "yes" and "don't know", not "yes" and "no".

The new approach:

Added a YJIT_ENABLED_AT_BOOT flag that's set in rb_yjit_init(true) (the --yjit path) but not in rb_yjit_enable() (the lazy path).

Changed iseq_escapes_ep() to use !yjit_enabled_at_boot_p() as the default for untracked ISEQs. When YJIT was enabled at boot, "not in map" means "never escaped" (safe). When YJIT was enabled lazily, "not in map" means "don't know" → conservatively report as escaped.

Removed the gen_entry_point hack entirely — no more calling rb_yjit_invalidate_ep_is_bp() from the compiler, no more assuming the running CFP is the one being compiled.

This is entirely within YJIT's Rust code. The EP==BP optimization is only lost for ISEQs that existed before lazy enablement and haven't had any block compiled yet — which is the correct conservative behavior since we can't know if their EP escaped before we started watching.

…fore YJIT is enabled When YJIT is enabled lazily via RubyVM::YJIT.enable (or --yjit-disable), EP escape notifications from vm_make_env_each() are silently dropped because the YJIT invariants system is not yet initialized. This causes iseq_escapes_ep() to return false for ISEQs whose EP has already escaped, leading YJIT to generate SP-based local variable access that reads stale nil values instead of the correct heap-allocated environment. Fix by tracking whether YJIT was enabled at boot. When it was not, iseq_escapes_ep() conservatively returns true for ISEQs with no tracking entry, since we cannot know whether their EP escaped before YJIT started observing.

matzbot requested a review from a team March 4, 2026 23:11

paracycle changed the title ~~YJIT: Fix local variables read as nil when EP escapes before YJIT is enabled~~ [Bug #124920] YJIT: Fix local variables read as nil when EP escapes before YJIT is enabled Mar 4, 2026

byroot mentioned this pull request Mar 4, 2026

Puma::Cluster#run': undefined method 'wait_readable' for nil puma/puma#3620

Closed

paracycle force-pushed the uk-fix-yjit-ep-escape branch from cc02218 to 86557d2 Compare March 4, 2026 23:22

paracycle changed the title ~~[Bug #124920] YJIT: Fix local variables read as nil when EP escapes before YJIT is enabled~~ [Bug #21941] YJIT: Fix local variables read as nil when EP escapes before YJIT is enabled Mar 4, 2026

paracycle force-pushed the uk-fix-yjit-ep-escape branch from 86557d2 to 0111e3e Compare March 4, 2026 23:26

XrXr requested changes Mar 5, 2026

View reviewed changes

paracycle force-pushed the uk-fix-yjit-ep-escape branch from 0111e3e to feba10a Compare March 5, 2026 01:06

paracycle requested a review from XrXr March 5, 2026 01:08

XrXr mentioned this pull request Mar 13, 2026

YJIT: Fix not reading locals from cfp->ep after YJIT.enable and exceptional entry #16381

Merged

XrXr closed this in #16381 Mar 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug #21941] YJIT: Fix local variables read as nil when EP escapes before YJIT is enabled#16306

[Bug #21941] YJIT: Fix local variables read as nil when EP escapes before YJIT is enabled#16306
paracycle wants to merge 1 commit intoruby:masterfrom
Shopify:uk-fix-yjit-ep-escape

paracycle commented Mar 4, 2026 •

edited

Loading

Uh oh!

XrXr Mar 5, 2026

Uh oh!

paracycle Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

paracycle commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Bug

Fix

Test

Uh oh!

XrXr Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

paracycle Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

paracycle commented Mar 4, 2026 •

edited

Loading