Investigate SVE acceleration for RPO hash function

Rounds of [RPO hash function](https://github.com/0xPolygonMiden/crypto/blob/main/src/hash/rpo/mod.rs) have a very regular structure which should be amenable to vectorized computation. This is especially true for the [inverse alpha](https://github.com/0xPolygonMiden/crypto/blob/main/src/hash/rpo/mod.rs#L414) portion which applies ~70 identical operations to a state of 12 elements. This portion is by far the most time-consuming part of the hash function.

By using vectorized instructions, it may be possible to speed the hash function up by 2x - 3x (though, this needs to be confirmed). As one of our target machines for Miden VM is Graviton 3, which supports SVE extension, it would be great to see if can get this type of speed up there.

Ideally, we'd want to add a feature to this crate which, when enabled, would replace the current pure Rust code for either the entire RPO [permutation](https://github.com/0xPolygonMiden/crypto/blob/main/src/hash/rpo/mod.rs#L414), a single RPO [round](https://github.com/0xPolygonMiden/crypto/blob/main/src/hash/rpo/mod.rs#L345), are even just the [inverse alpha](https://github.com/0xPolygonMiden/crypto/blob/main/src/hash/rpo/mod.rs#L345) computation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate SVE acceleration for RPO hash function #158

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Investigate SVE acceleration for RPO hash function #158

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions