sha2: improve RISC-V Zknh backends by newpavlov · Pull Request #617 · RustCrypto/hashes

newpavlov · 2024-08-26T16:48:45Z

Annoyingly, RISC-V is really inconvenient when we have to deal with misaligned loads/stores. LLVM by default generates very inefficient code which loads every byte separately and combines them into a 32/64 bit integer. The ld instruction "may" support misaligned loads and for Linux user-space it's even guaranteed, but it can be (and IIUC often in practice is) "extremely slow", so we should not rely on it while writing performant code.

After asking around, it looks like this mess is here to stay, so we have no choice but to work around it. To do that this PR introduces two separate paths for loading block data: aligned and misaligned. The aligned path should be the most common one. In the misaligned path we have to rely on inline assembly since we have to load some bits outside of the block.

Additionally, this PR makes inlining in the riscv-zknh backend less aggressive, which makes generated binary code 3-4 times smaller at the cost of one additional branch.

Generated assembly for RV64:

SHA-256, unrolled: https://rust.godbolt.org/z/GxPM8PE3P (2278 bytes)
SHA-256, compact: https://rust.godbolt.org/z/4KWrcve9E (538 bytes)
SHA-512, unrolled: https://rust.godbolt.org/z/Th8ro8Tbo (2278 bytes)
SHA-512: compact: https://rust.godbolt.org/z/dqrv48ax3 (530 bytes)

newpavlov · 2024-08-26T17:13:32Z

+    unsafe {
+        asm!(
+            "ld {left}, 0({bp})",
+            "srl {left}, {left}, {off1}",


This asm! block (and several others like it) read bits outside of the block (e.g. here we read off1 bits before block). In my understanding, this is not an UB. All bits are guaranteed to reside on the same page as at least one byte of the block, meaning that page faults should be impossible on such edge loads. The garbage bits get eliminated by the shift which is intentionally part of the assembly block, so outside of this asm block we can observe only bits which belong to block.

UPD: IRLO discussion

nazar-pc

llvm/llvm-project#150263 led me to this PR (the link there wasn't permalink, so it took some effort to chase the PR).

Would it be worth adding a separate case and simplifying block loads when +unaligned-scalar-mem feature is specified explicitly?

I have a virtual target that supports this and LLVM already produces A LOT more compact code when this feature is enabled, so not needing to do extra alignment checks + even less code should be even more beneficial. I do use Zknh support in sha2 crate specifically.

newpavlov · 2026-05-28T01:18:49Z

I plan to completely remove the unaligned load hack in a future release and blame LLVM and RISC-V spec for terrible codegen with default compilation options.

nazar-pc · 2026-05-28T01:22:53Z

Oh, does that also mean +unaligned-scalar-mem will naturally result in more efficient code in that case by accident? I see why it is unpleasant to maintain such hacks downstream.

newpavlov · 2026-05-28T15:58:21Z

I wouldn't say "by accident", but yes.

newpavlov · 2026-06-03T19:36:23Z

@nazar-pc
See #879

newpavlov added 6 commits August 26, 2024 19:02

sha2: improve RISC-V Zknh backend

b6f265f

Use less aggressive inlining in the riscv-zknh backend

8fb3a99

relax target feature requirements

18db623

Fix compile error cfg

0acd099

fix CI

2326f77

Use asm!-based opaque load instead of volatile read

5ad4f58

newpavlov commented Aug 26, 2024

View reviewed changes

newpavlov requested a review from tarcieri August 26, 2024 17:13

newpavlov added 4 commits August 26, 2024 20:45

fix opaque load

87c8b87

fix opaque load

8386abc

Expose only load_block from the util modules

1dec071

fix

88bae59

newpavlov merged commit b2312fa into master Aug 27, 2024

newpavlov deleted the sha2/riscv_unaligned branch August 27, 2024 09:56

nazar-pc reviewed May 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sha2: improve RISC-V Zknh backends#617

sha2: improve RISC-V Zknh backends#617
newpavlov merged 10 commits into
masterfrom
sha2/riscv_unaligned

newpavlov commented Aug 26, 2024 •

edited

Loading

Uh oh!

newpavlov Aug 26, 2024 •

edited

Loading

Uh oh!

nazar-pc left a comment •

edited

Loading

Uh oh!

newpavlov commented May 28, 2026

Uh oh!

nazar-pc commented May 28, 2026

Uh oh!

newpavlov commented May 28, 2026

Uh oh!

newpavlov commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

newpavlov commented Aug 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

newpavlov Aug 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nazar-pc left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

newpavlov commented May 28, 2026

Uh oh!

nazar-pc commented May 28, 2026

Uh oh!

newpavlov commented May 28, 2026

Uh oh!

newpavlov commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

newpavlov commented Aug 26, 2024 •

edited

Loading

newpavlov Aug 26, 2024 •

edited

Loading

nazar-pc left a comment •

edited

Loading