Winch aarch64 extend, wrap, popcnt, promote & demote by vulc41n · Pull Request #9114 · bytecodealliance/wasmtime

vulc41n · 2024-08-12T16:28:55Z

Hey 👋

This PR implements extend, wrap, popcnt, promote & demote instructions for winch targeting aarch64.

saulecabrera · 2024-08-13T11:52:07Z

I can help reviewing this one.

saulecabrera · 2024-08-13T13:51:14Z

winch/codegen/src/isa/aarch64/asm.rs

+        let (signed, from_bits, to_bits) = match kind {
+            I64ExtendI32S => (true, 32, 64),
+            I64ExtendI32U => (false, 32, 64),
+            I32Extend8S => (true, 8, 32),
+            I32Extend16S => (true, 16, 32),
+            I64Extend8S => (true, 8, 64),
+            I64Extend16S => (true, 16, 64),
+            I64Extend32S => (true, 32, 64),
+        };


Could we add some methods to the ExtendKind implementation instead? Something like:

impl ExtendKind { fn signed(&self) -> bool { ... } fn from_bits(&self) -> u8 { ... } fn to_bits(&self) -> u8 { ... } }

saulecabrera · 2024-08-13T14:23:44Z

winch/codegen/src/isa/aarch64/masm.rs

-        todo!()
+    fn popcnt(&mut self, context: &mut CodeGenContext, size: OperandSize) {
+        let src = context.pop_to_reg(self, None);
+        let tmp = context.reg_for_class(RegClass::Float, self);


To reduce register pressure, could we use the scratch float register here?

saulecabrera · 2024-08-13T14:29:11Z

tests/disas/winch/aarch64/i64_popcnt/fallback.wat

In the case of aarch64 I believe that the fallback test is redundant i.e., since we are not generating an alternative sequence of instructions like in x64.

saulecabrera · 2024-08-13T14:51:25Z

winch/codegen/src/isa/aarch64/masm.rs

+            OperandSize::S8 => {}
+            OperandSize::S16 => self.asm.addp_rrr(tmp, tmp, tmp, VectorSize::Size8x8),


If I'm not wrong, Wasm only defines {i32, i64}_popcnt so I believe we only need to handle the S32 | S64 case and handle all the other operand sizes with unreachable!() instead of unimplemented, which also means that we can probably drop the addp_rrr implementation.

You're right, I used lower.isle as a reference but CLIF is different from WASM.

handle all the other operand sizes with unreachable!() instead of unimplemented

I just removed the match, please tell me if you think an assertion is needed

saulecabrera

One minor nit and this should be good to go.

saulecabrera · 2024-08-14T09:50:05Z

winch/codegen/src/isa/aarch64/masm.rs

+        self.asm.addv(tmp, tmp, VectorSize::Size8x8);
+        self.asm.mov_from_vec(tmp, src.into(), 0, OperandSize::S8);
+        context.stack.push(src.into());
+        context.free_reg(tmp);


The scratch register is manually tracked i.e., free_reg is a no-op in this case, so we can remove this call.

saulecabrera

Thanks!

vulc41n requested review from a team as code owners August 12, 2024 16:28

vulc41n requested review from abrown and alexcrichton and removed request for a team August 12, 2024 16:28

vulc41n force-pushed the winch-aarch64-extend-wrap branch from 5a2afc0 to 638b4a5 Compare August 12, 2024 16:47

saulecabrera requested review from saulecabrera and removed request for abrown and alexcrichton August 13, 2024 11:52

saulecabrera reviewed Aug 13, 2024

View reviewed changes

saulecabrera reviewed Aug 14, 2024

View reviewed changes

vulc41n added 13 commits August 14, 2024 12:07

winch aarch64 extend

7b4d5df

winch aarch64 wrap

890e243

winch aarch64 popcnt

a42175a

winch aarch64 float promote & demote

0fd07f0

winch aarch64 extend tests

dc385a1

winch aarch64 wrap tests

e21d79e

winch aarch64 popcnt tests

31a2bd5

winch aarch64 promote & demote tests

3289ad2

winch ExtendKind methods

84758e2

winch aarch64 popcnt: use scratch regiter

e9cce59

winch aarch64 popcnt: remove fallback tests

bddbd62

winch aarch64 popcnt: remove unused sizes cases

faea62b

winch aarch64 tests: fix float loading

f2e2425

vulc41n force-pushed the winch-aarch64-extend-wrap branch from d896ad5 to f2e2425 Compare August 14, 2024 10:09

winch aarch64 popcnt: do not free scratch register

b48554b

saulecabrera approved these changes Aug 14, 2024

View reviewed changes

saulecabrera added this pull request to the merge queue Aug 14, 2024

Merged via the queue into bytecodealliance:main with commit 3bd6708 Aug 14, 2024

vulc41n deleted the winch-aarch64-extend-wrap branch August 14, 2024 12:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Winch aarch64 extend, wrap, popcnt, promote & demote#9114

Winch aarch64 extend, wrap, popcnt, promote & demote#9114
saulecabrera merged 14 commits intobytecodealliance:mainfrom
vulc41n:winch-aarch64-extend-wrap

vulc41n commented Aug 12, 2024

Uh oh!

saulecabrera commented Aug 13, 2024

Uh oh!

saulecabrera Aug 13, 2024

Uh oh!

saulecabrera Aug 13, 2024

Uh oh!

saulecabrera Aug 13, 2024

Uh oh!

saulecabrera Aug 13, 2024 •

edited

Loading

Uh oh!

vulc41n Aug 14, 2024

Uh oh!

saulecabrera left a comment

Uh oh!

saulecabrera Aug 14, 2024

Uh oh!

saulecabrera left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		OperandSize::S8 => {}
		OperandSize::S16 => self.asm.addp_rrr(tmp, tmp, tmp, VectorSize::Size8x8),

Conversation

vulc41n commented Aug 12, 2024

Uh oh!

saulecabrera commented Aug 13, 2024

Uh oh!

saulecabrera Aug 13, 2024

Choose a reason for hiding this comment

Uh oh!

saulecabrera Aug 13, 2024

Choose a reason for hiding this comment

Uh oh!

saulecabrera Aug 13, 2024

Choose a reason for hiding this comment

Uh oh!

saulecabrera Aug 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vulc41n Aug 14, 2024

Choose a reason for hiding this comment

Uh oh!

saulecabrera left a comment

Choose a reason for hiding this comment

Uh oh!

saulecabrera Aug 14, 2024

Choose a reason for hiding this comment

Uh oh!

saulecabrera left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

saulecabrera Aug 13, 2024 •

edited

Loading