Optimize compression input copy by malik672 · Pull Request #240 · leanEthereum/leanVM

malik672 · 2026-05-31T07:37:56Z

Summary

replace two slice copies with a single flattened contiguous copy in symetric::compress
keep the change scoped to crates/backend/symetric/src/compression.rs

For the concrete optimized instantiation compress::<KoalaBear, Poseidon1KoalaBear16, 8, 16>:

Old input-copy shape:

New input-copy shape:

LLVM IR shows one contiguous input-side copy: memcpy 64
AArch64 asm still lowers that 64-byte copy into two 32-byte vector load/store pairs on this target, but the extra staging buffer in the old helper goes away

Overall optimized function effect:

old IR: two 32-byte copies into state, then an extra 64-byte copy into the compression buffer
new IR: one 64-byte copy directly into the compression buffer
old asm frame size: 224 bytes
new asm frame size: 160 bytes

Source change:

state[..2 * CHUNK].copy_from_slice(input.as_flattened());

Optimize compression input copy

43f5f8a

TomWambsgans merged commit d559dc2 into leanEthereum:main May 31, 2026
3 checks passed