diff options
| author | iximeow <me@iximeow.net> | 2025-06-02 08:03:12 +0000 | 
|---|---|---|
| committer | iximeow <me@iximeow.net> | 2025-06-02 08:03:12 +0000 | 
| commit | 276172a5a888165f82075eba48bd6f79246c2dcc (patch) | |
| tree | dc0e018f59fdbe18d9323bff22edcb530e1d2967 /src/armv8 | |
| parent | 614d7e8bc6325dd3e632b3e83da627fa6ff9f31a (diff) | |
some armv7 decode helpers are trivial functions but didn't inline
both from_u8 and the build function here compiled to truly trivial code:
four instructions (mov rdi, rax; cmp 0xlim, rax; jae panic; ret) in the
hot path, and constrained register choice on the caller side. inlining
these makes for a *smaller* armv7 decoder, on the order of 5kb down from
5.5kb. in the process it also gets about 45% faster (400mb/s to 560mb/s)
inlining decode_into, then, really just helps the standalone decoder
benchmark case. this moves decode throughput from 560mb/s to 724mb/s.
Diffstat (limited to 'src/armv8')
0 files changed, 0 insertions, 0 deletions
