Store array of original_insns on iseq by jhawthorn · Pull Request #5622 · ruby/ruby

jhawthorn · 2022-03-03T20:10:46Z

This aims to fix https://bugs.ruby-lang.org/issues/18269 as well as making GC iseq marking (and anywhere else we iterate through the instructions) faster by avoiding the hash lookup inside rb_vm_insn_addr2insn.

This is done by storing the original instruction values as a byte array on the iseq before they are translated for threading. We can then use those values directly instead of using the hash table from threaded to non-threaded.

The array is per-instruction rather than per-PC-address, so getting the instruction at a specific PC/offset address is awkward and this is designed for iterating the entire iseq. The advantage is that the array is more compact.

The downside of this is that we need to record an extra byte per-vm-instruction.

TODO:

Replace remaining uses of rb_vm_insn_addr2insn and similar (To avoid conflicts with YJIT's port, avoiding modifications there)
Check memory increase on a large app
Should this code path be disabled if token threading is used?
Add regression test for Bug18269

cc @nobu @k0kubun @jeremyevans from linked issue. What do you think of this approach?

k0kubun · 2022-03-04T04:12:24Z

vm_core.h


+    struct {
+        uint8_t *insns;
+        unsigned int size;


What's different between original_insns.size and iseq_size?

original_insns.size is the count of instructions in this iseq. iseq_size is the count of instructions + the total count of operands.

k0kubun · 2022-03-04T04:13:28Z

I can't think of a better approach to fix [Bug #18269], so it seems legit to me.

jhawthorn · 2022-12-09T22:10:50Z

This is still an option, but I really don't want to make iseq any larger 😅. I think for now we've worked around the issue in other ways.

jhawthorn added 5 commits March 3, 2022 11:41

Record byte array of original instructions

d323cdf

Use original_insns for original_iseq

9450686

Use original_insns for rb_iseq_each_value

824fcc5

Use original_insns in update_catch_except_flags

d6aa9ff

Fix static

d2bce2d

jhawthorn mentioned this pull request Mar 3, 2022

Remove get_insn_idx from iseq_inline_constant_cache struct #5623

Closed

k0kubun reviewed Mar 4, 2022

View reviewed changes

jhawthorn closed this Dec 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store array of original_insns on iseq#5622

Store array of original_insns on iseq#5622
jhawthorn wants to merge 5 commits intoruby:masterfrom
jhawthorn:original_insns

jhawthorn commented Mar 3, 2022 •

edited

Loading

Uh oh!

k0kubun Mar 4, 2022

Uh oh!

jhawthorn Mar 4, 2022

Uh oh!

k0kubun commented Mar 4, 2022

Uh oh!

jhawthorn commented Dec 9, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jhawthorn commented Mar 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

k0kubun Mar 4, 2022

Choose a reason for hiding this comment

Uh oh!

jhawthorn Mar 4, 2022

Choose a reason for hiding this comment

Uh oh!

k0kubun commented Mar 4, 2022

Uh oh!

jhawthorn commented Dec 9, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jhawthorn commented Mar 3, 2022 •

edited

Loading