Speed-up SHA-1 NEON assembly implementation
* cipher/sha1-armv7-neon.S: Tweak implementation for speed-up.
Benchmark on Cortex-A8 1008Mhz:
New:
| nanosecs/byte mebibytes/sec cycles/byte
SHA1 | 7.04 ns/B 135.4 MiB/s 7.10 c/B
Old:
| nanosecs/byte mebibytes/sec cycles/byte
SHA1 | 7.79 ns/B 122.4 MiB/s 7.85 c/B
- Signed-off-by: Jussi Kivilinna <jussi.kivilinna@iki.fi>